From mboxrd@z Thu Jan 1 00:00:00 1970 Path: main.gmane.org!not-for-mail From: Kenichi Handa Newsgroups: gmane.emacs.gnus.general,gmane.emacs.devel Subject: Re: MML charset tag regression Date: Thu, 22 May 2003 09:58:22 +0900 (JST) Sender: ding-owner@lists.math.uh.edu Message-ID: <200305220058.JAA06943@etlken.m17n.org> References: <8B17870A-8BA8-11D7-8E1F-00039363E640@swipnet.se> NNTP-Posting-Host: main.gmane.org Mime-Version: 1.0 (generated by SEMI 1.14.3 - "Ushinoya") Content-Type: text/plain; charset=US-ASCII X-Trace: main.gmane.org 1053565064 13650 80.91.224.249 (22 May 2003 00:57:44 GMT) X-Complaints-To: usenet@main.gmane.org NNTP-Posting-Date: Thu, 22 May 2003 00:57:44 +0000 (UTC) Cc: rms@gnu.org, emacs-devel@gnu.org, jas@extundo.com, ding@gnus.org Original-X-From: ding-owner+M1333@lists.math.uh.edu Thu May 22 02:57:42 2003 Return-path: Original-Received: from malifon.math.uh.edu ([129.7.128.13]) by main.gmane.org with esmtp (Exim 3.35 #1 (Debian)) id 19IeOs-0003Y0-00 for ; Thu, 22 May 2003 02:57:42 +0200 Original-Received: from localhost ([127.0.0.1] helo=lists.math.uh.edu) by malifon.math.uh.edu with smtp (Exim 3.20 #1) id 19IePm-0002wt-00; Wed, 21 May 2003 19:58:38 -0500 Original-Received: from sclp3.sclp.com ([64.157.176.121]) by malifon.math.uh.edu with smtp (Exim 3.20 #1) id 19IePh-0002wo-00 for ding@lists.math.uh.edu; Wed, 21 May 2003 19:58:33 -0500 Original-Received: (qmail 33598 invoked by alias); 22 May 2003 00:58:32 -0000 Original-Received: (qmail 33593 invoked from network); 22 May 2003 00:58:32 -0000 Original-Received: from tsukuba.m17n.org (192.47.44.130) by sclp3.sclp.com with SMTP; 22 May 2003 00:58:32 -0000 Original-Received: from fs.m17n.org (fs.m17n.org [192.47.44.2]) by tsukuba.m17n.org (8.11.6p2/3.7W-20010518204228) with ESMTP id h4M0wNu11678; Thu, 22 May 2003 09:58:23 +0900 (JST) (envelope-from handa@m17n.org) Original-Received: from etlken.m17n.org (etlken.m17n.org [192.47.44.125]) by fs.m17n.org (8.11.6/3.7W-20010823150639) with ESMTP id h4M0wM918476; Thu, 22 May 2003 09:58:22 +0900 (JST) Original-Received: (from handa@localhost) by etlken.m17n.org (8.8.8+Sun/3.7W-2001040620) id JAA06943; Thu, 22 May 2003 09:58:22 +0900 (JST) Original-To: jan.h.d@swipnet.se In-reply-to: <8B17870A-8BA8-11D7-8E1F-00039363E640@swipnet.se> (jan.h.d@swipnet.se) User-Agent: SEMI/1.14.3 (Ushinoya) FLIM/1.14.2 (Yagi-Nishiguchi) APEL/10.2 Emacs/21.2.92 (sparc-sun-solaris2.6) MULE/5.0 (SAKAKI) Precedence: bulk Xref: main.gmane.org gmane.emacs.gnus.general:52789 gmane.emacs.devel:14070 X-Report-Spam: http://spam.gmane.org/gmane.emacs.devel:14070 In article <8B17870A-8BA8-11D7-8E1F-00039363E640@swipnet.se>, "Jan D." writes: > I do not understand the issues involved in detail, I do not know what > we need and why. In my naive reasoning, Emacs would be fine if it > requested UTF8_STRING first and COMPOUND_TEXT second always. This is > what the free desktop documents seems to recommend. Or prehaps request > TARGETS first to check for UTF8_STRING, and use that if available and > something else if not (COMPOUND_STRING, STRING or TEXT in that order?). The current Emacs still don't unify Unicode and the other legacy charsets (e.g. iso-8859-2, jisx0208, gb2312) automatically. So, for instance, if iso-8859-2 characters arrive at Emacs with UTF8_STRING, they are decoded into the charset mule-unicode-0100-24ff and treated differently (e.g. in searching) than the characters of the charset iso-8859-2. --- Ken'ichi HANDA handa@m17n.org