From mboxrd@z Thu Jan 1 00:00:00 1970 Path: main.gmane.org!not-for-mail From: "James H. Cloos Jr." Newsgroups: gmane.emacs.gnus.general,gmane.emacs.devel Subject: Re: MML charset tag regression Date: 26 Apr 2003 06:50:34 -0400 Sender: ding-owner@lists.math.uh.edu Message-ID: References: <8465p3kgpl.fsf@lucy.is.informatik.uni-duisburg.de> <84bryuogke.fsf@lucy.is.informatik.uni-duisburg.de> NNTP-Posting-Host: main.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit X-Trace: main.gmane.org 1051354531 24114 80.91.224.249 (26 Apr 2003 10:55:31 GMT) X-Complaints-To: usenet@main.gmane.org NNTP-Posting-Date: Sat, 26 Apr 2003 10:55:31 +0000 (UTC) Cc: emacs-devel@gnu.org Original-X-From: ding-owner+M377@lists.math.uh.edu Sat Apr 26 12:55:29 2003 Return-path: Original-Received: from malifon.math.uh.edu ([129.7.128.13]) by main.gmane.org with esmtp (Exim 3.35 #1 (Debian)) id 199NL7-0006Go-00 for ; Sat, 26 Apr 2003 12:55:29 +0200 Original-Received: from localhost ([127.0.0.1] helo=lists.math.uh.edu) by malifon.math.uh.edu with smtp (Exim 3.20 #1) id 199NJM-0002oc-00; Sat, 26 Apr 2003 05:53:40 -0500 Original-Received: from sclp3.sclp.com ([64.157.176.121]) by malifon.math.uh.edu with smtp (Exim 3.20 #1) id 199NJD-0002oU-00 for ding@lists.math.uh.edu; Sat, 26 Apr 2003 05:53:31 -0500 Original-Received: (qmail 54176 invoked by alias); 26 Apr 2003 10:53:30 -0000 Original-Received: (qmail 54171 invoked from network); 26 Apr 2003 10:53:30 -0000 Original-Received: from ore.jhcloos.com (64.240.156.239) by sclp3.sclp.com with SMTP; 26 Apr 2003 10:53:30 -0000 Original-Received: from lugabout.jhcloos.org (ppp528.tc-1.buf-ch.ny.localnet.com [207.251.210.211]) (using TLSv1 with cipher EDH-RSA-DES-CBC3-SHA (168/168 bits)) (Client CN "lugabout.jhcloos.org", Issuer "ca.jhcloos.com" (verified OK)) by ore.jhcloos.com (Postfix) with ESMTP id 779381C2D7; Sat, 26 Apr 2003 05:52:59 -0500 (CDT) Original-Received: from lugabout.jhcloos.org (localhost [127.0.0.1]) by lugabout.jhcloos.org (Postfix on SuSE Linux 7.3 (i386)) with ESMTP id 06F4525D10; Sat, 26 Apr 2003 10:50:35 +0000 (GMT) Original-To: ding@gnus.org In-Reply-To: Original-Lines: 29 User-Agent: Gnus/5.09 (Gnus v5.9.0) Emacs/21.3.50 Precedence: bulk Xref: main.gmane.org gmane.emacs.gnus.general:51834 gmane.emacs.devel:13467 X-Report-Spam: http://spam.gmane.org/gmane.emacs.devel:13467 >>>>> "Simon" == Simon Josefsson writes: Simon> For me, when I yanked the string into emacs from galeon it Simon> becomes double-width. It is single-width in galeon though. I also see that; any pasting of cyrillic text via pasting X's primary or from the clipboard. The wide cyrillic is from the japanese-jisx0208 charset. Eg, Cyrillic т gets buffer code 0x92 0xA7 0xE4 when pasted, but 0x8C 0xE2 and charset cyrillic-iso8859-5 when typed directly, or inserted from a utf-8 encoded file. In both cases, (describe-char) shows the same value one the Unicode: line, eg 0442 for т. The next line is typed directly: тхис ис а тест This line is pasted: (this probably won't be visible after I send) The first issue is to get emacs to prefer 8859-5 over jisx0208 when pasting cyrillic utf8. The next is getting the cyrillic in jisx0208 to properly convert to utf8. I'm using GNU Emacs 21.3.50.1 (i686-pc-linux-gnu, X toolkit, Xaw3d scroll bars) of 2003-03-06 in en_US.UTF-8. -JimC