From mboxrd@z Thu Jan 1 00:00:00 1970 Path: main.gmane.org!not-for-mail From: Kazu Yamamoto (=?iso-2022-jp?B?GyRCOzNLXE9CSScbKEI=?=) Newsgroups: gmane.mail.mew.general,gmane.emacs.devel Subject: [mew-int 01594] Re: windows 1252 Date: Wed, 05 Nov 2003 10:59:12 +0900 (JST) Message-ID: <20031105.105912.246010891.kazu@iijlab.net> References: <20031105.005502.58143677.kazu@iijlab.net> <878ymwrmzm.fsf@tleepslib.sk.tsukuba.ac.jp> NNTP-Posting-Host: deer.gmane.org Mime-Version: 1.0 Content-Type: Text/Plain; charset=us-ascii Content-Transfer-Encoding: 7bit X-Trace: sea.gmane.org 1067997587 13832 80.91.224.253 (5 Nov 2003 01:59:47 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Wed, 5 Nov 2003 01:59:47 +0000 (UTC) Original-X-From: mew-int-return-1594-gmmg-mew-int=m.gmane.org@mew.org Wed Nov 05 02:59:45 2003 Return-path: Original-Received: from mew2.iijlab.net ([202.232.15.102]) by deer.gmane.org with smtp (Exim 3.35 #1 (Debian)) id 1AHCxU-00036T-00 for ; Wed, 05 Nov 2003 02:59:44 +0100 Original-Received: (qmail 8537 invoked by uid 7800); 5 Nov 2003 01:59:40 -0000 Mailing-List: contact mew-int-help@mew.org; run by ezmlm Precedence: bulk List-Unsubscribe: Original-Received: (qmail 8525 invoked from network); 5 Nov 2003 01:59:40 -0000 Original-Received: from unknown (HELO omgo.iij.ad.jp) (202.232.30.157) by 202.232.15.102 with SMTP; 5 Nov 2003 01:59:40 -0000 Original-Received: from jc-smtp.iij.ad.jp ([192.168.176.35]) by omgo.iij.ad.jp (8.12.10/8.12.9) with ESMTP id hA51xVb0028157; Wed, 5 Nov 2003 10:59:31 +0900 (JST) Original-Received: from localhost (jc-ssh.iij.ad.jp [192.168.174.22]) by jc-smtp.iij.ad.jp (8.12.10/8.12.9) with ESMTP id hA51x2el025419; Wed, 5 Nov 2003 10:59:02 +0900 (JST) Original-To: emacs-devel@gnu.org, mew-int@mew.org In-Reply-To: <878ymwrmzm.fsf@tleepslib.sk.tsukuba.ac.jp> X-Mailer: Mew version 4.0.61 on Emacs 21.3.50 / Mule 5.0 (SAKAKI) X-ML-Name: mew-int X-Mail-Count: 01594 Xref: main.gmane.org gmane.mail.mew.general:519 gmane.emacs.devel:17693 X-Report-Spam: http://spam.gmane.org/gmane.emacs.devel:17693 Hello Stephen, # ordering was changed. > ctext can, because in the extended segment the characters will be > represented as themselves. Whether Mule can or not is a different > story. However, I'm fairly sure that all of the characters that > Windows 125x put into the C1 space are encodable by Mule. See > > http://www.microsoft.com/globaldev/reference/sbcs/1252.htm > > for example. Thank you for this information. I produced 0x80-0xff into a file and let Emacs read it as Windows 1252. Q1) According to the page above, 0x8f is undefined, and 0x9e is defined as LATIN SMALL LETTER Z WITH CARON. But Emacs 21.3.50 treated 0x8f as LATIN SMALL LETTER Z WITH CARON and 0x9e as undefined. Is this a bug? > You don't even need to do that with an extended segment. The > Windows-125x sets are all IANA-registered, which should be enough for > global uniqueness. To represent the text, you just use the name of > the character set: ESC % / 1 Windows-1252 STX ... where > and encode the length of the segment and ESC and STX are the ASCII > control characters 0x1B and 0x02. I save the buffer as ctext. The result file is attached below. All characters in Windows 1252 can be encoded with ctext. :-) Q2) However the encoding is different from the one above. Is this encoding correct? Note that I verified that Emacs can read the ctext file correctly. --Kazu ESC $ - 1 0xf4 0xcc ESC - A ESC $ - 1 0xf2 0xfa ESC - A ESC $ - 1 0xa1 0xd2 ESC - A ESC $ - 1 0xf2 0xfe ESC - A ESC $ - 1 0xf3 0xa6 ESC - A ESC $ - 1 0xf3 0xa0 ESC - A ESC $ - 1 0xf3 0xa1 ESC - A ESC $ - 1 0xa4 0xe6 ESC - A ESC $ - 1 0xf3 0xb0 ESC - A ESC $ - 1 0xa1 0xa0 ESC - A ESC $ - 1 0xf3 0xb9 ESC - A ESC $ - 1 0xa0 0xf2 ESC - A ESC $ - 1 0xa1 0xbd ESC - A ESC $ - 1 0xa1 0xbe ESC - A ESC $ - 1 0xf2 0xf8 ESC - A ESC $ - 1 0xf2 0xf9 ESC - A ESC $ - 1 0xf2 0xfc ESC - A ESC $ - 1 0xf2 0xfd ESC - A ESC $ - 1 0xf3 0xa2 ESC - A ESC $ - 1 0xf2 0xf3 ESC - A ESC $ - 1 0xf2 0xf4 ESC - A ESC $ - 1 0xa4 0xfc ESC - A ESC $ - 1 0xf5 0xe2 ESC - A ESC $ - 1 0xa1 0xa1 ESC - A ESC $ - 1 0xf3 0xba ESC - A ESC $ - 1 0xa0 0xf3 ESC - A ESC $ - 1 0xa1 0xb8 ESC - A 0xa0 0xa1 ...