From mboxrd@z Thu Jan 1 00:00:00 1970 Path: main.gmane.org!not-for-mail From: Simon Josefsson Newsgroups: gmane.emacs.devel Subject: Re: Cyrillic vs UTF-8 Date: Fri, 25 Apr 2003 18:54:21 +0200 Sender: emacs-devel-bounces+emacs-devel=quimby.gnus.org@gnu.org Message-ID: References: NNTP-Posting-Host: main.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Trace: main.gmane.org 1051289977 29797 80.91.224.249 (25 Apr 2003 16:59:37 GMT) X-Complaints-To: usenet@main.gmane.org NNTP-Posting-Date: Fri, 25 Apr 2003 16:59:37 +0000 (UTC) Original-X-From: emacs-devel-bounces+emacs-devel=quimby.gnus.org@gnu.org Fri Apr 25 18:59:35 2003 Return-path: Original-Received: from quimby.gnus.org ([80.91.224.244]) by main.gmane.org with esmtp (Exim 3.35 #1 (Debian)) id 1996Xv-0007kI-00 for ; Fri, 25 Apr 2003 18:59:35 +0200 Original-Received: from monty-python.gnu.org ([199.232.76.173]) by quimby.gnus.org with esmtp (Exim 3.12 #1 (Debian)) id 1996eY-0003XP-00 for ; Fri, 25 Apr 2003 19:06:36 +0200 Original-Received: from localhost ([127.0.0.1] helo=monty-python.gnu.org) by monty-python.gnu.org with esmtp (Exim 4.10.13) id 1996TW-0007xW-08 for emacs-devel@quimby.gnus.org; Fri, 25 Apr 2003 12:55:02 -0400 Original-Received: from list by monty-python.gnu.org with tmda-scanned (Exim 4.10.13) id 1996T8-0007WG-00 for emacs-devel@gnu.org; Fri, 25 Apr 2003 12:54:38 -0400 Original-Received: from mail by monty-python.gnu.org with spam-scanned (Exim 4.10.13) id 1996T0-000752-00 for emacs-devel@gnu.org; Fri, 25 Apr 2003 12:54:32 -0400 Original-Received: from 178.230.13.217.in-addr.dgcsystems.net ([217.13.230.178] helo=yxa.extundo.com) by monty-python.gnu.org with esmtp (Exim 4.10.13) id 1996Su-0006mM-00 for emacs-devel@gnu.org; Fri, 25 Apr 2003 12:54:25 -0400 Original-Received: from latte.josefsson.org (yxa.extundo.com [217.13.230.178]) (authenticated bits=0) by yxa.extundo.com (8.12.9/8.12.9) with ESMTP id h3PGsL07020874 (version=TLSv1/SSLv3 cipher=EDH-RSA-DES-CBC3-SHA bits=168 verify=OK) for ; Fri, 25 Apr 2003 18:54:22 +0200 Original-To: emacs-devel@gnu.org Mail-Copies-To: nobody X-Payment: hashcash 1.2 0:030425:emacs-devel@gnu.org:ab5b569412e28d5e X-Hashcash: 0:030425:emacs-devel@gnu.org:ab5b569412e28d5e In-Reply-To: (Simon Josefsson's message of "Fri, 25 Apr 2003 18:12:17 +0200") User-Agent: Gnus/5.090019 (Oort Gnus v0.19) Emacs/21.3.50 (gnu/linux) X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1b5 Precedence: list List-Id: Emacs development discussions. List-Help: List-Post: List-Subscribe: , List-Archive: List-Unsubscribe: , Errors-To: emacs-devel-bounces+emacs-devel=quimby.gnus.org@gnu.org Xref: main.gmane.org gmane.emacs.devel:13451 X-Report-Spam: http://spam.gmane.org/gmane.emacs.devel:13451 Simon Josefsson writes: > I think there are two problems. Opening the file the first time > should guess it is a utf-8 file. Secondly, emacs should be able to > find a font that contains the characters -- I have all font packages > from Debian installed. The following works fine: > > -Misc-Fixed-Medium-R-Normal--18-120-100-100-C-90-ISO10646-1 It seems the second problem was documented in PROBLEMS (see below). Sorry. Still, I don't see similar behaviour with, e.g., Mozilla, so wouldn't it be possible to check which characters exist within the font, and perhaps change font as appropriate? It would be nice if there were some more information how to set the suggested fontset. Reading the manual I get the impression that 'emacs -fn mule-unicode-...' should work, but it doesn't. I also tried setting the emacs.font X resource, but same problem. Starting emacs says: No fonts match `mule-unicode-2500-33ff:-gnu-unifont-*-iso10646-1,mule-unicode-e000-ffff:-gnu-unifont-*-iso10646-1,mule-unicode-0100-24ff:-gnu-unifont-*-iso10646-1' I do have GNU unifont (from Debian unstable) installed. * Characters from the mule-unicode charsets aren't displayed under X. XFree86 4 contains many fonts in iso10646-1 encoding which have minimal character repertoires (whereas the encoding is meant to be a reasonable indication of the repertoire). Emacs may choose one of these to display characters from the mule-unicode charsets and then typically won't be able to find the glyphs to display many characters. (Check with C-u C-x = .) To avoid this, you may need to use a fontset which sets the font for the mule-unicode sets explicitly. E.g. to use GNU unifont, include in the fontset spec: mule-unicode-2500-33ff:-gnu-unifont-*-iso10646-1,\ mule-unicode-e000-ffff:-gnu-unifont-*-iso10646-1,\ mule-unicode-0100-24ff:-gnu-unifont-*-iso10646-1