From mboxrd@z Thu Jan 1 00:00:00 1970 Path: main.gmane.org!not-for-mail From: Kazu Yamamoto (=?iso-2022-jp?B?GyRCOzNLXE9CSScbKEI=?=) Newsgroups: gmane.mail.mew.general,gmane.emacs.devel Subject: [mew-int 01607] Re: windows 1252 Date: Mon, 10 Nov 2003 16:11:23 +0900 (JST) Message-ID: <20031110.161123.49979847.kazu@iijlab.net> References: <87llqzuvaj.fsf@tleepslib.sk.tsukuba.ac.jp> <20031104.111334.60445673.kazu@iijlab.net> <200311070713.QAA24793@etlken.m17n.org> NNTP-Posting-Host: deer.gmane.org Mime-Version: 1.0 Content-Type: Text/Plain; charset=us-ascii Content-Transfer-Encoding: 7bit X-Trace: sea.gmane.org 1068448317 15245 80.91.224.253 (10 Nov 2003 07:11:57 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Mon, 10 Nov 2003 07:11:57 +0000 (UTC) Original-X-From: mew-int-return-1607-gmmg-mew-int=m.gmane.org@mew.org Mon Nov 10 08:11:55 2003 Return-path: Original-Received: from mew2.iijlab.net ([202.232.15.102]) by deer.gmane.org with smtp (Exim 3.35 #1 (Debian)) id 1AJ6DK-0000bD-00 for ; Mon, 10 Nov 2003 08:11:54 +0100 Original-Received: (qmail 8655 invoked by uid 7800); 10 Nov 2003 07:11:49 -0000 Mailing-List: contact mew-int-help@mew.org; run by ezmlm Precedence: bulk List-Unsubscribe: Original-Received: (qmail 8643 invoked from network); 10 Nov 2003 07:11:48 -0000 Original-Received: from unknown (HELO omgo.iij.ad.jp) (202.232.30.157) by 202.232.15.102 with SMTP; 10 Nov 2003 07:11:48 -0000 Original-Received: from jc-smtp.iij.ad.jp ([192.168.176.35]) by omgo.iij.ad.jp (8.12.10/8.12.9) with ESMTP id hAA7Bgx3026291; Mon, 10 Nov 2003 16:11:43 +0900 (JST) Original-Received: from localhost (jc-ssh.iij.ad.jp [192.168.174.22]) by jc-smtp.iij.ad.jp (8.12.10/8.12.9) with ESMTP id hAA7BD0X007291; Mon, 10 Nov 2003 16:11:13 +0900 (JST) Original-To: emacs-devel@gnu.org, mew-int@mew.org In-Reply-To: <200311070713.QAA24793@etlken.m17n.org> X-Mailer: Mew version 4.0.61 on Emacs 21.3.50 / Mule 5.0 (SAKAKI) X-ML-Name: mew-int X-Mail-Count: 01607 Xref: main.gmane.org gmane.mail.mew.general:532 gmane.emacs.devel:17747 X-Report-Spam: http://spam.gmane.org/gmane.emacs.devel:17747 Hello Handa-san, Thank you for your explanation. > (2) ctext (alias of compound-text) > > On conversion, it works not fully compatible with the > specification of X Compound Text because it encodes any > Emacs characters while using an designation sequence for > private character sets (please note that all Emacs charasets > have a iso-final-char). So, Big5 characters are preceded by > ESC $ ( 0 or 1, mule-unicode-0100-24ff characters are > preceded by ESC - 1. ^^^^^^^ Let me clarify. Q1) It seemes to me that Emacs encodes mule-unicode-0100-24ff with ESC $ - 1. But the explanation above says ESC - 1. Which one is correct as Emacs's spec? Q2) I don't think it's not good idea to disclose the internal representation "mule-unicode-0100-24ff" into a file. According to the spec of ctext provided with XFree86, it has extension for UTF-8: --- 7. The UTF-8 encoding Unicode characters that are not contained in one of the approved standard encodings can be encoded using the UTF-8 encoding. The following escape sequences are used: 01/11 02/05 04/07 switch into UTF-8 mode 01/11 02/05 04/00 return from UTF-8 mode The first is the ISO registered sequence for UTF-8 (ISO- IR-196), the second is the ISO-2022 ``standard return'' sequence. While in UTF-8 mode, the UTF-8 encoding replaces the currently designated GL and GR encodings. After return from UTF-8 mode, the previously designated GL and GR encod- ings are reactivated. --- How about using this to encode mule-unicode-0100-24ff? > When it runs under emacs-unicode version, on writing the > file, if all the characters can be encoded by ctext, keep > using it. If not (because, in emacs-unicode, some character > doesn't belong to any charset that has iso-final-char), use > utf-8. And in both cases, add a coding tag. On reading, > check the coding tag at first. If no coding tag, read by > ctext, otherwise, read by the coding system specified in the > tag. I remember that, some years ago, Handa-san said to me, "The current Emacs is using mule-unicode but will migrate to Unicode". But I don't know what exactly emacs-unicode refers to. Which versions? Or a different source tree? --Kazu