From mboxrd@z Thu Jan 1 00:00:00 1970 Path: main.gmane.org!not-for-mail From: Kenichi Handa Newsgroups: gmane.emacs.devel Subject: Re: Reporting UTF-8 related problems? Date: Tue, 30 Jul 2002 17:30:44 +0900 (JST) Sender: emacs-devel-admin@gnu.org Message-ID: <200207300830.RAA06122@etlken.m17n.org> References: <2110-Sun28Jul2002212621+0300-eliz@is.elta.co.il> <200207290518.OAA04004@etlken.m17n.org> <200207300522.OAA05828@etlken.m17n.org> <200207300711.QAA05993@etlken.m17n.org> NNTP-Posting-Host: localhost.gmane.org Mime-Version: 1.0 (generated by SEMI 1.14.3 - "Ushinoya") Content-Type: text/plain; charset=US-ASCII X-Trace: main.gmane.org 1028018209 22323 127.0.0.1 (30 Jul 2002 08:36:49 GMT) X-Complaints-To: usenet@main.gmane.org NNTP-Posting-Date: Tue, 30 Jul 2002 08:36:49 +0000 (UTC) Cc: keichwa@gmx.net, eliz@is.elta.co.il, emacs-devel@gnu.org Return-path: Original-Received: from quimby.gnus.org ([80.91.224.244]) by main.gmane.org with esmtp (Exim 3.33 #1 (Debian)) id 17ZSUq-0005nw-00 for ; Tue, 30 Jul 2002 10:36:48 +0200 Original-Received: from fencepost.gnu.org ([199.232.76.164]) by quimby.gnus.org with esmtp (Exim 3.12 #1 (Debian)) id 17ZSmT-0007Dz-00 for ; Tue, 30 Jul 2002 10:55:02 +0200 Original-Received: from localhost ([127.0.0.1] helo=fencepost.gnu.org) by fencepost.gnu.org with esmtp (Exim 3.35 #1 (Debian)) id 17ZSVH-0005db-00; Tue, 30 Jul 2002 04:37:15 -0400 Original-Received: from tsukuba.m17n.org ([192.47.44.130]) by fencepost.gnu.org with smtp (Exim 3.35 #1 (Debian)) id 17ZSU6-0005Za-00 for ; Tue, 30 Jul 2002 04:36:03 -0400 Original-Received: from fs.m17n.org (fs.m17n.org [192.47.44.2]) by tsukuba.m17n.org (8.11.6/3.7W-20010518204228) with ESMTP id g6U8Ujl27573; Tue, 30 Jul 2002 17:30:45 +0900 (JST) (envelope-from handa@m17n.org) Original-Received: from etlken.m17n.org (etlken.m17n.org [192.47.44.125]) by fs.m17n.org (8.11.3/3.7W-20010823150639) with ESMTP id g6U8Uj929830; Tue, 30 Jul 2002 17:30:45 +0900 (JST) Original-Received: (from handa@localhost) by etlken.m17n.org (8.8.8+Sun/3.7W-2001040620) id RAA06122; Tue, 30 Jul 2002 17:30:44 +0900 (JST) Original-To: schwab@suse.de In-Reply-To: (message from Andreas Schwab on Tue, 30 Jul 2002 09:57:09 +0200) User-Agent: SEMI/1.14.3 (Ushinoya) FLIM/1.14.2 (Yagi-Nishiguchi) APEL/10.2 Emacs/21.1.30 (sparc-sun-solaris2.6) MULE/5.0 (SAKAKI) Errors-To: emacs-devel-admin@gnu.org X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.0.11 Precedence: bulk List-Help: List-Post: List-Subscribe: , List-Id: Emacs development discussions. List-Unsubscribe: , List-Archive: Xref: main.gmane.org gmane.emacs.devel:6171 X-Report-Spam: http://spam.gmane.org/gmane.emacs.devel:6171 In article , Andreas Schwab writes: > |> „Die Familie Schroffenstein“ > |> > |> I thought that the notation &#NUMBER is for transmitting > |> Unicode character of code NUMBER. But, 132 and 147 are > |> control codes in Unicode, not any kind of quotings. Do you > |> know a proper web page describing the meaning of them? > The numbers are supposed to be ISO 8859-1 characters codes. I'd guess the > page has been written with some broken (a.k.a. W*nd*ws) software (the use > of *.htm makes this apparent). There is no hope for being compliant to > any standard. I tried to validate it through the W3.org validator, but no > document type matches. Ah, I see. I found that windows-125X maps 132 and 147 to U+201E and U+201C. So, perhaps those systems (galeon and lynx) parse them as U+201E and U+201C. Anyway, how to encode them in X selection is their problem and Emacs can't do anything about it. --- Ken'ichi HANDA handa@etl.go.jp