From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Katsumi Yamaoka Newsgroups: gmane.emacs.devel,gmane.emacs.gnus.general Subject: Re: [Unicode-2] `read' always returns multibyte symbol Date: Thu, 15 Nov 2007 20:41:33 +0900 Organization: Emacsen advocacy group Message-ID: References: NNTP-Posting-Host: lo.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-Trace: ger.gmane.org 1195126918 14546 80.91.229.12 (15 Nov 2007 11:41:58 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Thu, 15 Nov 2007 11:41:58 +0000 (UTC) Cc: ding@gnus.org, emacs-devel@gnu.org To: Kenichi Handa Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Thu Nov 15 12:42:03 2007 Return-path: Envelope-to: ged-emacs-devel@m.gmane.org Original-Received: from lists.gnu.org ([199.232.76.165]) by lo.gmane.org with esmtp (Exim 4.50) id 1Isd6f-00040z-QZ for ged-emacs-devel@m.gmane.org; Thu, 15 Nov 2007 12:42:02 +0100 Original-Received: from localhost ([127.0.0.1] helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1Isd6T-0006Bk-2W for ged-emacs-devel@m.gmane.org; Thu, 15 Nov 2007 06:41:49 -0500 Original-Received: from mailman by lists.gnu.org with tmda-scanned (Exim 4.43) id 1Isd6P-0006BK-7T for emacs-devel@gnu.org; Thu, 15 Nov 2007 06:41:45 -0500 Original-Received: from exim by lists.gnu.org with spam-scanned (Exim 4.43) id 1Isd6N-0006As-HI for emacs-devel@gnu.org; Thu, 15 Nov 2007 06:41:44 -0500 Original-Received: from [199.232.76.173] (helo=monty-python.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1Isd6N-0006Ap-C7 for emacs-devel@gnu.org; Thu, 15 Nov 2007 06:41:43 -0500 Original-Received: from orlando.hostforweb.net ([216.246.45.90]) by monty-python.gnu.org with esmtps (TLS-1.0:DHE_RSA_AES_256_CBC_SHA1:32) (Exim 4.60) (envelope-from ) id 1Isd6M-0006NJ-W6 for emacs-devel@gnu.org; Thu, 15 Nov 2007 06:41:43 -0500 Original-Received: from [66.225.201.151] (port=33735 helo=mail.jpl.org) by orlando.hostforweb.net with esmtpa (Exim 4.68) (envelope-from ) id 1Isd6F-0000gq-Uh; Thu, 15 Nov 2007 05:41:36 -0600 X-Hashcash: 1:20:071115:handa@ni.aist.go.jp::ND6n+YgETEJFOHLf:0000000000000000000000000000000000000000005D+N X-Hashcash: 1:20:071115:ding@gnus.org::wFmX32+e3B4QL0Ms:000025i1 X-Hashcash: 1:20:071115:emacs-devel@gnu.org::xRQAz78DjyrN5AYN:0000000000000000000000000000000000000000000C/+ X-Face: #kKnN,xUnmKia.'[pp`; Omh}odZK)?7wQSl"4o04=EixTF+V[""w~iNbM9ZL+.b*_CxUmFk B#Fu[*?MZZH@IkN:!"\w%I_zt>[$nm7nQosZ<3eu; B:$Q_:p!',P.c0-_Cy[dz4oIpw0ESA^D*1Lw= L&i*6&( User-Agent: Gnus/5.110007 (No Gnus v0.7) Emacs/23.0.60 (gnu/linux) Cancel-Lock: sha1:acXwDhgqDbJShqpyDDGT37lHDII= Content-Disposition: inline X-Antivirus-Scanner: Clean mail though you should still use an Antivirus X-AntiAbuse: This header was added to track abuse, please include it with any abuse report X-AntiAbuse: Primary Hostname - orlando.hostforweb.net X-AntiAbuse: Original Domain - gnu.org X-AntiAbuse: Originator/Caller UID/GID - [47 12] / [47 12] X-AntiAbuse: Sender Address Domain - jpl.org X-Source: X-Source-Args: X-Source-Dir: X-detected-kernel: by monty-python.gnu.org: Genre and OS details not recognized. X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.devel:83260 gmane.emacs.gnus.general:65674 Archived-At: >>>>> Kenichi Handa wrote: > In article , > Katsumi Yamaoka writes: >> I realized a network process that is created by >> `open-network-stream' in Emacs 21 breaks encoded non-ASCII group >> names if the process buffer is in the multibyte mode even if the >> process coding system is binary. It behaves as if >> `toggle-enable-multibyte-characters' modifies binary data when >> turning on the multibyteness of a buffer. (The changes that I made in nntp.el has been archived in .) > If "modifies" means that 8-bit bytes are converted to > multibyte characters as what string-as-multibyte does, it's > an expected behaviour. What I observed was different. The group name "=E3=83=86=E3=82=B9=E3=83=88= " is encoded by utf-8 by the nntp server into: "\343\203\206\343\202\271\343\203\210" After it is transferred to Gnus, in the nntp process bufer it is modified into: "\343\203XY\343\203\210" Where X is (make-char 'greek-iso8859-7 99) and Y is (make-char 'latin-iso8859-2 57). Since Gnus treats a group name as a unibyte string, finally it is made into: "\343\203\343\271\343\203\210" > I long ago proposed a facility that turns on the > multibyteness of a buffer while converting 8-bit bytes to > multibyte characters as what string-to-multibyte does, but > not accepted. But the modern Emacsen does do so, doesn't it?