From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Yuri Khan Newsgroups: gmane.emacs.help Subject: Re: upcase-word, &c. for non-latin alphabets Date: Thu, 10 Apr 2014 09:55:40 +0700 Message-ID: References: <83a9bu5utv.fsf@gnu.org> <831tx65sjy.fsf@gnu.org> NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable X-Trace: ger.gmane.org 1397098572 27141 80.91.229.3 (10 Apr 2014 02:56:12 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Thu, 10 Apr 2014 02:56:12 +0000 (UTC) Cc: "help-gnu-emacs@gnu.org" To: Eli Zaretskii Original-X-From: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Thu Apr 10 04:56:05 2014 Return-path: Envelope-to: geh-help-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1WY59o-0006gE-9t for geh-help-gnu-emacs@m.gmane.org; Thu, 10 Apr 2014 04:56:04 +0200 Original-Received: from localhost ([::1]:48823 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1WY59n-0007G6-TQ for geh-help-gnu-emacs@m.gmane.org; Wed, 09 Apr 2014 22:56:03 -0400 Original-Received: from eggs.gnu.org ([2001:4830:134:3::10]:57440) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1WY59T-0007Fv-KF for help-gnu-emacs@gnu.org; Wed, 09 Apr 2014 22:55:44 -0400 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1WY59S-0000ut-RW for help-gnu-emacs@gnu.org; Wed, 09 Apr 2014 22:55:43 -0400 Original-Received: from mail-qg0-x22f.google.com ([2607:f8b0:400d:c04::22f]:36561) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1WY59R-0000tm-DL; Wed, 09 Apr 2014 22:55:41 -0400 Original-Received: by mail-qg0-f47.google.com with SMTP id i50so3264559qgf.20 for ; Wed, 09 Apr 2014 19:55:40 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:sender:in-reply-to:references:date:message-id:subject :from:to:cc:content-type:content-transfer-encoding; bh=gyQy/C3ODZXvRjraeGE/bI+9R17e249ttWYDi4FnI/s=; b=aE4sX9qCUF4HrLtgk3F5c6IvZdYZTlHRlcrdyAI8fO5l4UhymxAWof9v4mxJIG2Oo5 EZ8g4GdxCZhao+62jr99Cb8zs6JTQ+O2S/4giI3ibr7UbDyywXmEBrZ3cCGShLuz0plG ZbxHIYFdK8rjMc67UV+cR+wVGVZCFRAyp+QkitCKUwfkCqYLRAMSS3DWu9KyViTR8O21 +84GrzFcFOl5NgOvdkKwMls5kClGxPm3NZUjrI8lIfa3BZWeZLMZGB6rXp/QNAKtOM/J Mk/UpwzybIb3emByj1kuxGwUZ4IQ9ORca6gINNdUiBo8brXkgeNHE9SMuKPH/eZOSaQ8 cjRQ== X-Received: by 10.224.46.197 with SMTP id k5mr17320984qaf.15.1397098540714; Wed, 09 Apr 2014 19:55:40 -0700 (PDT) Original-Received: by 10.96.29.1 with HTTP; Wed, 9 Apr 2014 19:55:40 -0700 (PDT) In-Reply-To: <831tx65sjy.fsf@gnu.org> X-Google-Sender-Auth: We6udVpvSN1k4E6eyKLayjNGLDc X-detected-operating-system: by eggs.gnu.org: Error: Malformed IPv6 address (bad octet value). X-Received-From: 2607:f8b0:400d:c04::22f X-BeenThere: help-gnu-emacs@gnu.org X-Mailman-Version: 2.1.14 Precedence: list List-Id: Users list for the GNU Emacs text editor List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Original-Sender: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.help:97083 Archived-At: On Thu, Apr 10, 2014 at 12:19 AM, Eli Zaretskii wrote: >> >> Does it also do the right thing for the Turkic dotted and dotless I >> (which is about the only thing in Unicode that cannot be up/downcased >> without knowing the language)? > > N o, Emacs doesn't yet support titlecase. As far as I understand, titlecase is when you capitalize each word in a sentence (in English; in other languages rules differ). My question was not about that. In Turkic languages (including but not limited to Turkish), the lowercase of ?I is not ?i but ?=C4=B1 (U+0131 Latin small letter dotless i), and the uppercase of ?i is ?=C4=B0 (U+0130 Latin capital letter I with dot above). $ emacs --version GNU Emacs 24.3.1 [=E2=80=A6] $ emacs -Q M-x set-language-environment Turkish (in *scratch*) M-< M-u =3D=3D> ;; THIS buffer=E2=80=A6 (wrong for Turkish) M-l =3D=3D> ;; if you want=E2=80=A6 (also wrong for Turkish) Further, if I paste the above paragraph and try M-u and M-l on those funny letters, Emacs does nothing to them. This happens even when I do =E2=80=9Csudo locale-gen tr_TR.utf8=E2=80=9D an= d/or start Emacs as =E2=80=9CLANG=3Dtr_TR.utf8 emacs -Q=E2=80=9D.