From mboxrd@z Thu Jan 1 00:00:00 1970 Path: main.gmane.org!not-for-mail From: "Stephen J. Turnbull" Newsgroups: gmane.emacs.devel,gmane.mail.mew.general Subject: Re: [mew-int 01593] Re: windows 1252 Date: Wed, 05 Nov 2003 14:00:44 +0900 Organization: The XEmacs Project Sender: emacs-devel-bounces+emacs-devel=quimby.gnus.org@gnu.org Message-ID: <87he1jquir.fsf@tleepslib.sk.tsukuba.ac.jp> References: <20031105.005502.58143677.kazu@iijlab.net> <878ymwrmzm.fsf@tleepslib.sk.tsukuba.ac.jp> <20031105.105912.246010891.kazu@iijlab.net> NNTP-Posting-Host: deer.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=iso-2022-jp X-Trace: sea.gmane.org 1068008608 32071 80.91.224.253 (5 Nov 2003 05:03:28 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Wed, 5 Nov 2003 05:03:28 +0000 (UTC) Cc: mew-int@mew.org, emacs-devel@gnu.org Original-X-From: emacs-devel-bounces+emacs-devel=quimby.gnus.org@gnu.org Wed Nov 05 06:03:25 2003 Return-path: Original-Received: from quimby.gnus.org ([80.91.224.244]) by deer.gmane.org with esmtp (Exim 3.35 #1 (Debian)) id 1AHFpF-0000hl-00 for ; Wed, 05 Nov 2003 06:03:25 +0100 Original-Received: from monty-python.gnu.org ([199.232.76.173]) by quimby.gnus.org with esmtp (Exim 3.35 #1 (Debian)) id 1AHFpF-0002ht-00 for ; Wed, 05 Nov 2003 06:03:25 +0100 Original-Received: from localhost ([127.0.0.1] helo=monty-python.gnu.org) by monty-python.gnu.org with esmtp (Exim 4.24) id 1AHFna-0001JR-9v for emacs-devel@quimby.gnus.org; Wed, 05 Nov 2003 00:01:42 -0500 Original-Received: from list by monty-python.gnu.org with tmda-scanned (Exim 4.24) id 1AHFnF-0001IU-Cv for emacs-devel@gnu.org; Wed, 05 Nov 2003 00:01:21 -0500 Original-Received: from mail by monty-python.gnu.org with spam-scanned (Exim 4.24) id 1AHFmi-00013M-9N for emacs-devel@gnu.org; Wed, 05 Nov 2003 00:01:20 -0500 Original-Received: from [130.158.98.109] (helo=tleepslib.sk.tsukuba.ac.jp) by monty-python.gnu.org with esmtp (Exim 4.24) id 1AHFmh-00012k-Ki for emacs-devel@gnu.org; Wed, 05 Nov 2003 00:00:47 -0500 Original-Received: from steve by tleepslib.sk.tsukuba.ac.jp with local (Exim 3.36 #1 (Debian)) id 1AHFme-0004Xy-00; Wed, 05 Nov 2003 14:00:44 +0900 Original-To: Kazu Yamamoto (~~~~) In-Reply-To: <20031105.105912.246010891.kazu@iijlab.net> (Kazu Yamamoto's message of "Wed, 05 Nov 2003 10:59:12 +0900 (JST)") User-Agent: Gnus/5.1002 (Gnus v5.10.2) XEmacs/21.5 (celeriac, linux) X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.2 Precedence: list List-Id: Emacs development discussions. List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-devel-bounces+emacs-devel=quimby.gnus.org@gnu.org Xref: main.gmane.org gmane.emacs.devel:17694 gmane.mail.mew.general:520 X-Report-Spam: http://spam.gmane.org/gmane.emacs.devel:17694 >>>>> "Kazu" == Kazu Yamamoto <(山本和彦) > writes: Kazu> Q1) According to the page above, 0x8f is undefined, and 0x9e Kazu> is defined as LATIN SMALL LETTER Z WITH CARON. Kazu> But Emacs 21.3.50 treated 0x8f as LATIN SMALL LETTER Z Kazu> WITH CARON and 0x9e as undefined. Kazu> Is this a bug? Yes. CP1252.TXT from the Unicode Consortium also has 0x9E as LATIN SMALL LETTER Z WITH CARON. Kazu> Q2) However the encoding is different from the one above. Is Kazu> this encoding correct? The encoding below "ESC $ - 1" uses a multibyte private charset in the GR register, then switches back to Latin-1 with "ESC - A". Since Windows 1252 is just Latin-1 in 0xA0--0xFF, this is the expected result. I don't recognize the encoding offhand, but I guess it's some transform of Unicode, probably the mule-unicode charset. Kazu> Note that I verified that Emacs can read the ctext file Kazu> correctly. Then the obvious advice is that mew should just use that encoding, and recommend GNU Emacs to those who insist on Windows 1252 (etc) instead of ISO 8859/1. I doubt that XEmacs will use this private charset; it's really too late to add to 21.4, and 21.5 will take Stefan's advice and recommend Unicode (UTF-8) for this purpose. We recognize the backward compatibility problem that mew faces, but prefer all standard-with-XEmacs external encodings to be standard to improve portability to other apps, and to leave all ISO 2022 private charsets to the user. It's a tough choice. The only real question is will emacs-unicode continue to support the mule-unicode charsets as external encodings. I suppose so, but you'd have to ask Handa-san and/or Dave Love. If not, then I suggest you follow Stefan's advice and start moving to UTF-8 immediately. -- Institute of Policy and Planning Sciences http://turnbull.sk.tsukuba.ac.jp University of Tsukuba Tennodai 1-1-1 Tsukuba 305-8573 JAPAN Ask not how you can "do" free software business; ask what your business can "do for" free software.