From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Kenichi Handa Newsgroups: gmane.emacs.bugs Subject: Re: bites assumed set mid UTF-8 Date: Tue, 07 Mar 2006 13:52:33 +0900 Message-ID: References: <87r75fignl.fsf@jidanni.org> NNTP-Posting-Host: main.gmane.org Mime-Version: 1.0 (generated by SEMI 1.14.3 - "Ushinoya") Content-Type: text/plain; charset=US-ASCII X-Trace: sea.gmane.org 1141749028 3127 80.91.229.2 (7 Mar 2006 16:30:28 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Tue, 7 Mar 2006 16:30:28 +0000 (UTC) Cc: bug-gnu-emacs@gnu.org, handa@m17n.org Original-X-From: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Tue Mar 07 17:30:22 2006 Return-path: Envelope-to: geb-bug-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([199.232.76.165]) by ciao.gmane.org with esmtp (Exim 4.43) id 1FGf4T-0000N3-ND for geb-bug-gnu-emacs@m.gmane.org; Tue, 07 Mar 2006 17:30:06 +0100 Original-Received: from localhost ([127.0.0.1] helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1FGf4S-0003Dv-KL for geb-bug-gnu-emacs@m.gmane.org; Tue, 07 Mar 2006 11:30:00 -0500 Original-Received: from mailman by lists.gnu.org with tmda-scanned (Exim 4.43) id 1FGVeG-0003Xg-2w for bug-gnu-emacs@gnu.org; Tue, 07 Mar 2006 01:26:20 -0500 Original-Received: from exim by lists.gnu.org with spam-scanned (Exim 4.43) id 1FGVeE-0003Ws-Ce for bug-gnu-emacs@gnu.org; Tue, 07 Mar 2006 01:26:19 -0500 Original-Received: from [199.232.76.173] (helo=monty-python.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1FGVIG-0001tV-0r for bug-gnu-emacs@gnu.org; Tue, 07 Mar 2006 01:03:36 -0500 Original-Received: from [192.47.44.130] (helo=tsukuba.m17n.org) by monty-python.gnu.org with esmtps (TLS-1.0:DHE_RSA_AES_256_CBC_SHA:32) (Exim 4.52) id 1FGUE7-0005t0-ER for bug-gnu-emacs@gnu.org; Mon, 06 Mar 2006 23:55:15 -0500 Original-Received: from nfs.m17n.org (nfs.m17n.org [192.47.44.7]) by tsukuba.m17n.org (8.13.4/8.13.4/Debian-3) with ESMTP id k274qZDA023584; Tue, 7 Mar 2006 13:52:35 +0900 Original-Received: from etlken (etlken.m17n.org [192.47.44.125]) by nfs.m17n.org (8.13.4/8.13.4/Debian-3) with ESMTP id k274qYK9032259; Tue, 7 Mar 2006 13:52:35 +0900 Original-Received: from handa by etlken with local (Exim 3.36 #1 (Debian)) id 1FGUBV-00068M-00; Tue, 07 Mar 2006 13:52:33 +0900 Original-To: Dan Jacobson In-reply-to: <87r75fignl.fsf@jidanni.org> (message from Dan Jacobson on Tue, 07 Mar 2006 02:15:10 +0800) User-Agent: SEMI/1.14.3 (Ushinoya) FLIM/1.14.2 (Yagi-Nishiguchi) APEL/10.2 Emacs/22.0.50 (i686-pc-linux-gnu) MULE/5.0 (SAKAKI) X-Mailman-Approved-At: Tue, 07 Mar 2006 10:47:21 -0500 X-BeenThere: bug-gnu-emacs@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: "Bug reports for GNU Emacs, the Swiss army knife of text editors" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Errors-To: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.bugs:14945 Archived-At: In article <87r75fignl.fsf@jidanni.org>, Dan Jacobson writes: > Bad bad bad. Emacs 21.4.1 shows the same Chinese character ("nuclear") > even though the second bit string is not valid UTF-8. Cc'd Handa. > 11100110 10100000 10111000 > 11100110 00100000 10111000 Thank you for the report. It is already fixed in the latest CVS code. With it, the second byte sequence (invalid utf-8) is decoded into "\346 \270" (i.e. 8-bit-char #xe6, SPC, 8-bit-char #xb8). --- Kenichi Handa handa@m17n.org