From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Emmanuel Bigler Newsgroups: gmane.emacs.bugs Subject: bug#7962: 23.2; capitalize letters ISO-8859-1 with diacritic signs in emacs 23.2.1 Date: Fri, 04 Feb 2011 18:08:51 +0100 Message-ID: <4D4C32A3.3030906@ens2m.fr> References: <4D493388.2070801@ens2m.fr> <4D4AAEE7.9040509@ens2m.fr> <4D4AF8B2.6090108@ens2m.fr> <83oc6sbyum.fsf@gnu.org> <87hbckn4ph.fsf@turtle.gmx.de> <834o8jc0k7.fsf@gnu.org> NNTP-Posting-Host: lo.gmane.org Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="------------080001030106090400060809" X-Trace: dough.gmane.org 1296841840 23232 80.91.229.12 (4 Feb 2011 17:50:40 GMT) X-Complaints-To: usenet@dough.gmane.org NNTP-Posting-Date: Fri, 4 Feb 2011 17:50:40 +0000 (UTC) Cc: svenjoac@gmx.de To: Eli Zaretskii , Lennart Borgman , schwab@linux-m68k.org, 7962@debbugs.gnu.org Original-X-From: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Fri Feb 04 18:50:35 2011 Return-path: Envelope-to: geb-bug-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([199.232.76.165]) by lo.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1PlPnk-0003K4-8j for geb-bug-gnu-emacs@m.gmane.org; Fri, 04 Feb 2011 18:50:32 +0100 Original-Received: from localhost ([127.0.0.1]:34545 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1PlPZq-000105-Vd for geb-bug-gnu-emacs@m.gmane.org; Fri, 04 Feb 2011 12:36:11 -0500 Original-Received: from [140.186.70.92] (port=38249 helo=eggs.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1PlPZe-0000ya-Kv for bug-gnu-emacs@gnu.org; Fri, 04 Feb 2011 12:36:00 -0500 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1PlPZd-00085p-AZ for bug-gnu-emacs@gnu.org; Fri, 04 Feb 2011 12:35:58 -0500 Original-Received: from debbugs.gnu.org ([140.186.70.43]:49970) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1PlPZd-00085l-97 for bug-gnu-emacs@gnu.org; Fri, 04 Feb 2011 12:35:57 -0500 Original-Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.69) (envelope-from ) id 1PlP1q-0000yr-7Z; Fri, 04 Feb 2011 12:01:02 -0500 X-Loop: help-debbugs@gnu.org Resent-From: Emmanuel Bigler Original-Sender: debbugs-submit-bounces@debbugs.gnu.org Resent-To: owner@debbugs.gnu.org Resent-CC: bug-gnu-emacs@gnu.org Resent-Date: Fri, 04 Feb 2011 17:01:02 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: followup 7962 X-GNU-PR-Package: emacs X-GNU-PR-Keywords: Original-Received: via spool by 7962-submit@debbugs.gnu.org id=B7962.12968388263726 (code B ref 7962); Fri, 04 Feb 2011 17:01:02 +0000 Original-Received: (at 7962) by debbugs.gnu.org; 4 Feb 2011 17:00:26 +0000 Original-Received: from localhost ([127.0.0.1] helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.69) (envelope-from ) id 1PlP1F-0000y3-Cy for submit@debbugs.gnu.org; Fri, 04 Feb 2011 12:00:25 -0500 Original-Received: from mercure.ens2m.fr ([194.167.45.241]) by debbugs.gnu.org with esmtp (Exim 4.69) (envelope-from ) id 1PlP1D-0000xr-7V for 7962@debbugs.gnu.org; Fri, 04 Feb 2011 12:00:24 -0500 Original-Received: from localhost (localhost [127.0.0.1]) by mercure.ens2m.fr (Postfix) with ESMTP id 696797BDEB; Fri, 4 Feb 2011 18:08:54 +0100 (CET) Original-Received: from mercure.ens2m.fr ([127.0.0.1]) by localhost (mercure.ens2m.fr [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 05353-10; Fri, 4 Feb 2011 18:08:52 +0100 (CET) Original-Received: from ebigler.femto-dtf.ens2m.fr (unknown [172.16.120.218]) (using TLSv1 with cipher DHE-RSA-AES256-SHA (256/256 bits)) (No client certificate requested) by mercure.ens2m.fr (Postfix) with ESMTP id 82EB17BDD7; Fri, 4 Feb 2011 18:08:52 +0100 (CET) User-Agent: Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.9.2.13) Gecko/20101209 Fedora/3.1.7-0.35.b3pre.fc14 Thunderbird/3.1.7 In-Reply-To: <834o8jc0k7.fsf@gnu.org> X-Virus-Scanned: by amavisd-new at ens2m.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.11 Precedence: list Resent-Date: Fri, 04 Feb 2011 12:01:02 -0500 X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6 (newer, 3) X-Received-From: 140.186.70.43 X-BeenThere: bug-gnu-emacs@gnu.org List-Id: "Bug reports for GNU Emacs, the Swiss army knife of text editors" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Errors-To: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.bugs:43940 Archived-At: This is a multi-part message in MIME format. --------------080001030106090400060809 Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: quoted-printable >> >> I see this: >> buffer-file-coding-system is a variable defined in `C source code'. >> Its value is iso-latin-1-dos > > See "M-: (coding-system-priority-list) RET". > > The highest-priority encoding is set from your locale, but look what > is the next one. > hello again. I think I'm starting to understand what is going on. I had created a long time ago an unibyte file containing the 1-byte=20 characters I want to test within emacs. The file was created with a=20 program on which I have total control byte-by-byte, so I know what is=20 inside the file exactly. I have attached the file to this mail, not sure=20 that this is allowed on the gnu-debug mailing list, but this is simple=20 and very short .txt file, that reads as follows : (this mail itself is=20 typeset and displayed here as iso-8859-1) ------- mytestchars-224-255-iso-8859.txt --------------------- 224 \340 =E0 225 \341 =E1 226 \342 =E2 227 \343 =E3 228 \344 =E4 229 \345 =E5 230 \346 =E6 231 \347 =E7 232 \350 =E8 233 \351 =E9 234 \352 =EA 235 \353 =EB 236 \354 =EC 237 \355 =ED 238 \356 =EE 239 \357 =EF 240 \360 =F0 241 \361 =F1 242 \362 =F2 243 \363 =F3 244 \364 =F4 245 \365 =F5 246 \366 =F6 247 \367 =F7 248 \370 =F8 249 \371 =F9 250 \372 =FA 251 \373 =FB 252 \374 =FC 253 \375 =FD 254 \376 =FE 255 \377 =FF =E9=E8=E7=E0=F9 < test strings to see how they behave =C9=E8=E7=E0=F9 ---------------------------------------------------------- I started /usr/local/bin/emacs -Q mytestchars-224-255-iso-8859.txt under emacs 23.2.93.1 (i686-pc-linux-gnu) The file displays perfectly correctly. (describe-char (point)) gives me=20 exactly what I want, i.e. an extended asci decimal code between 224 and 2= 55. Almost all operations (except capitalize, see below) work exactly as I=20 wish and exactly like in older emacs versions, no mystery since the=20 priority list M-: (coding-system-priority-list) RET reads as : (iso-latin-1 utf-8 iso-2022-7bit iso-2022-7bit-lock iso-2022-8bit-ss2=20 emacs-mule raw-text iso-2022-jp in-is13194-devanagari chinese-iso-8bit=20 utf-8-auto utf-8-with-signature ...) Again I'm perfectly happy since I see that iso-latin-1 comes first, but=20 is this what I want ? certainly yes, my locale environment variables look like : LC_ALL=3Dfr_FR.ISO8859-1 LC_COLLATE=3Dfr_FR.ISO8859-1 LANG=3Dfr_FR.ISO8859-1 GDM_LANG=3Dfr_FR.iso88591 LC_CTYPE=3Dfr_FR.ISO8859-1 XTERM_LOCALE=3Dfr_FR.ISO8859-1 However, in this emacs -Q session, with a correct unibyte display of an unibyte file, *capitalize does not work*. At the beginning of this discussion, Sven explained that capitalize=20 would only work on 2-byte characters. Which I tested of course, and of=20 course it works, but I simply wish I could continue to capitalize M-c=20 unibyte words like in the good old iso-8859 days !! Additional info : when applying the M-c command to a letter above decimal ascii 224, nothing happens on the display as reported, *although=20 the buffer is marked as being changed.* Incidentally in a good ol' xterm window (fitted with gnu readline and obeying my LOCALE preferences as liste above), M-c works perfectly as it should, and if I cut-paste from the xterm to the emacs buffer, everything looks fine & unibyte ... except that I can no longer change the case of the pasted string with 'capitalize' or a similar 'case' command. Bug, or UTF-8 emacs 23.2 feature ? -- Emmanuel --------------080001030106090400060809 Content-Type: text/plain; name="mytestchars-224-255-iso-8859.txt" Content-Disposition: attachment; filename="mytestchars-224-255-iso-8859.txt" Content-Transfer-Encoding: quoted-printable 224 \340 =E0 225 \341 =E1 226 \342 =E2 227 \343 =E3=20 228 \344 =E4 229 \345 =E5 230 \346 =E6 231 \347 =E7=20 232 \350 =E8 233 \351 =E9 234 \352 =EA 235 \353 =EB=20 236 \354 =EC 237 \355 =ED 238 \356 =EE 239 \357 =EF=20 240 \360 =F0 241 \361 =F1 242 \362 =F2 243 \363 =F3=20 244 \364 =F4 245 \365 =F5 246 \366 =F6 247 \367 =F7=20 248 \370 =F8 249 \371 =F9 250 \372 =FA 251 \373 =FB=20 252 \374 =FC 253 \375 =FD 254 \376 =FE 255 \377 =FF=20 =E9=E8=E7=E0=F9 =C9=E8=E7=E0=F9 --------------080001030106090400060809--