From mboxrd@z Thu Jan 1 00:00:00 1970 Path: main.gmane.org!not-for-mail From: Miles Bader Newsgroups: gmane.emacs.devel Subject: Re: utf-8 cjk translation bug? Date: 06 Oct 2003 11:29:25 +0900 Sender: emacs-devel-bounces+emacs-devel=quimby.gnus.org@gnu.org Message-ID: References: <200309301259.VAA01304@etlken.m17n.org> <200310020108.KAA03803@etlken.m17n.org> <3F7DA52B.2060208@gnu.org> Reply-To: Miles Bader NNTP-Posting-Host: deer.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-Trace: sea.gmane.org 1065407480 6510 80.91.224.253 (6 Oct 2003 02:31:20 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Mon, 6 Oct 2003 02:31:20 +0000 (UTC) Cc: Dave Love , emacs-devel@gnu.org Original-X-From: emacs-devel-bounces+emacs-devel=quimby.gnus.org@gnu.org Mon Oct 06 04:31:18 2003 Return-path: Original-Received: from quimby.gnus.org ([80.91.224.244]) by deer.gmane.org with esmtp (Exim 3.35 #1 (Debian)) id 1A6L9a-0006Z9-00 for ; Mon, 06 Oct 2003 04:31:18 +0200 Original-Received: from monty-python.gnu.org ([199.232.76.173]) by quimby.gnus.org with esmtp (Exim 3.35 #1 (Debian)) id 1A6L9a-0004QT-00 for ; Mon, 06 Oct 2003 04:31:18 +0200 Original-Received: from localhost ([127.0.0.1] helo=monty-python.gnu.org) by monty-python.gnu.org with esmtp (Exim 4.24) id 1A6L9Y-0006pk-V3 for emacs-devel@quimby.gnus.org; Sun, 05 Oct 2003 22:31:16 -0400 Original-Received: from list by monty-python.gnu.org with tmda-scanned (Exim 4.24) id 1A6L9T-0006n1-HN for emacs-devel@gnu.org; Sun, 05 Oct 2003 22:31:11 -0400 Original-Received: from mail by monty-python.gnu.org with spam-scanned (Exim 4.24) id 1A6L99-0006Ug-Cu for emacs-devel@gnu.org; Sun, 05 Oct 2003 22:31:10 -0400 Original-Received: from [202.32.8.214] (helo=TYO201.gate.nec.co.jp) by monty-python.gnu.org with esmtp (Exim 4.24) id 1A6L7y-0005Mu-Sz; Sun, 05 Oct 2003 22:29:39 -0400 Original-Received: from mailgate3.nec.co.jp (mailgate53.nec.co.jp [10.7.69.194]) by TYO201.gate.nec.co.jp (8.11.7/3.7W01080315) with ESMTP id h962TTg04313; Mon, 6 Oct 2003 11:29:30 +0900 (JST) Original-Received: from mailsv.nec.co.jp (mailgate51.nec.co.jp [10.7.69.196]) by mailgate3.nec.co.jp (8.11.7/3.7W-MAILGATE-NEC) with ESMTP id h962TSf28018; Mon, 6 Oct 2003 11:29:28 +0900 (JST) Original-Received: from edtmg01.lsi.nec.co.jp ([10.26.16.201]) by mailsv.nec.co.jp (8.11.7/3.7W-MAILSV-NEC) with ESMTP id h962TRA06792; Mon, 6 Oct 2003 11:29:27 +0900 (JST) Original-Received: from mcsss2.ucom.lsi.nec.co.jp (localhost [127.0.0.1]) by edtmg01.lsi.nec.co.jp (8.9.3p2+3.2W/3.7W_EDC_Ver.1.0) with ESMTP id LAA24441; Mon, 6 Oct 2003 11:29:27 +0900 (JST) Original-Received: from mcspd15.ucom.lsi.nec.co.jp (mcspd15 [10.30.114.174]) by mcsss2.ucom.lsi.nec.co.jp (8.12.10/8.12.8/EDcg v2.01-mc/1046780839) with ESMTP id h962TP7Q011788; Mon, 6 Oct 2003 11:29:26 +0900 (JST) Original-Received: by mcspd15.ucom.lsi.nec.co.jp (Postfix, from userid 31295) id 65633370D; Mon, 6 Oct 2003 11:29:25 +0900 (JST) Original-To: Jason Rumney System-Type: i686-pc-linux-gnu Blat: Foop In-Reply-To: <3F7DA52B.2060208@gnu.org> Original-Lines: 36 X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.2 Precedence: list List-Id: Emacs development discussions. List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-devel-bounces+emacs-devel=quimby.gnus.org@gnu.org Xref: main.gmane.org gmane.emacs.devel:16955 X-Report-Spam: http://spam.gmane.org/gmane.emacs.devel:16955 Jason Rumney writes: > > I would have expected them to have iso10646 fonts if they are using > > utf-8 (for the sake of applications other than Emacs) but maybe that > > isn't the case. >=20 > I think the problem is not that they don't have iso10646 fonts, it is > that the iso10646 fonts they do have do not contain any of the double > width characters, including double width roman that is in the > 2500-33ff range. Yeah, that's definitely the case, and it's not just a problem with double-width characters -- the coverage of many iso10646 fonts seems completely crap. E.g., see a post by `Danilo Segan' on this list. It apparently contains cyrillic characters encoded in UTF-8, which emacs dutifully tries to render using an iso10646 font, but show up as square boxes on my system... Here's the output of `C-u C-x =3D', in case anyone is interested: character: =D1=81 (01212141, 332897, 0x51461, U+0441) charset: mule-unicode-0100-24ff (Unicode characters of the range U+0100..U+24FF.) code point: 40 97 syntax: w which means: word category: y:Cyrillic=20=20 buffer code: 0x9C 0xF4 0xA8 0xE1 file code: 0x9C 0xF4 0xA8 0xE1 (encoded by coding system raw-text-unix) display: by this font (glyph code) -bitstream-bitstream vera sans mono-medium-r-normal--16-122-95-95-c= -100-iso10646-1 (0x441) -Miles --=20 `Suppose Korea goes to the World Cup final against Japan and wins,' Moon sa= id. `All the past could be forgiven.' [NYT]