From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Eli Zaretskii Newsgroups: gmane.emacs.devel Subject: Re: utf8 char display in buffer Date: Sat, 13 Jun 2009 09:23:59 -0400 Message-ID: References: <7I2dndeTy7sqkLLXnZ2dnUVZ_gmdnZ2d@sysmatrix.net> <4A32D54D.1040405@mousecar.com> <4A32E6F6.5080501@mousecar.com> <4A339BED.4020105@mousecar.com> Reply-To: Eli Zaretskii NNTP-Posting-Host: lo.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=iso-8859-15 Content-Transfer-Encoding: 8bit X-Trace: ger.gmane.org 1244899893 27878 80.91.229.12 (13 Jun 2009 13:31:33 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Sat, 13 Jun 2009 13:31:33 +0000 (UTC) Cc: emacs-devel@gnu.org To: gebser@mousecar.com Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Sat Jun 13 15:31:30 2009 Return-path: Envelope-to: ged-emacs-devel@m.gmane.org Original-Received: from lists.gnu.org ([199.232.76.165]) by lo.gmane.org with esmtp (Exim 4.50) id 1MFTKS-0003Md-Nj for ged-emacs-devel@m.gmane.org; Sat, 13 Jun 2009 15:31:29 +0200 Original-Received: from localhost ([127.0.0.1]:60514 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1MFTKS-0002YT-57 for ged-emacs-devel@m.gmane.org; Sat, 13 Jun 2009 09:31:28 -0400 Original-Received: from mailman by lists.gnu.org with tmda-scanned (Exim 4.43) id 1MFTJY-000220-TQ for emacs-devel@gnu.org; Sat, 13 Jun 2009 09:30:32 -0400 Original-Received: from exim by lists.gnu.org with spam-scanned (Exim 4.43) id 1MFTJR-0001xP-Jd for emacs-devel@gnu.org; Sat, 13 Jun 2009 09:30:30 -0400 Original-Received: from [199.232.76.173] (port=53922 helo=monty-python.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1MFTJR-0001xH-2k for emacs-devel@gnu.org; Sat, 13 Jun 2009 09:30:25 -0400 Original-Received: from fencepost.gnu.org ([140.186.70.10]:60362) by monty-python.gnu.org with esmtp (Exim 4.60) (envelope-from ) id 1MFTJQ-0006ut-IN for emacs-devel@gnu.org; Sat, 13 Jun 2009 09:30:24 -0400 Original-Received: from eliz by fencepost.gnu.org with local (Exim 4.67) (envelope-from ) id 1MFTDD-0006e1-As; Sat, 13 Jun 2009 09:23:59 -0400 In-reply-to: <4A339BED.4020105@mousecar.com> (message from ken on Sat, 13 Jun 2009 08:30:37 -0400) X-detected-operating-system: by monty-python.gnu.org: GNU/Linux 2.6, seldom 2.4 (older, 4) X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.devel:111490 Archived-At: > Date: Sat, 13 Jun 2009 08:30:37 -0400 > From: ken > Reply-To: gebser@mousecar.com > > $-1 ¡ is 'a' with a horizontal bar over it. On first inputting it > (after doing "set-input-method latin-4-postfix" and before changing the > input method to anything else), it appears correctly and "C-u C-x =" yields: > > ============================================= > > character: $-1 ¡ (05140, 2656, 0xa60) > charset: latin-iso8859-4 > (Right-Hand Part of Latin Alphabet 4 (ISO/IEC 8859-4): ISO-IR-110) > code point: 96 > syntax: word > category: l:Latin > buffer code: 0x84 0xE0 > file code: 0xC4 0x81 (encoded by coding system mule-utf-8-unix) > font: -ETL-Fixed-Medium-R-Normal--16-160-72-72-C-80-ISO8859-4 > > ============================================= > > When I reload the file (revisit the file), the same character is > replaced with a little box. Doing "C-u C-x =" here yields: > > ============================================= > > character: $-1 ¡ (01210041, 331809, 0x51021) > charset: mule-unicode-0100-24ff > (Unicode characters of the range U+0100..U+24FF.) > code point: 32 33 > syntax: word > category: l:Latin > buffer code: 0x9C 0xF4 0xA0 0xA1 > file code: 0xC4 0x81 (encoded by coding system mule-utf-8-unix) > font: -- none -- > > ============================================= So I think everything is clear now: you have a font that covers this characters when they are from the 8859-4 character set, but you do not have a font that covers them in Unicode. You should install the Unicode font that supports these characters. > As for the meaning of the two outputs above, all that I can confidently > glean is that, if I want to use non-English characters in emacs, I have > to be an expert emacs developer. :) That's exaggeration, I think. You can use the "C-u C-x =" command, just as you did above, to find out what Emacs thinks about each character that is displayed as an empty box. You can then look for fonts that cover these characters. "C-u C-x =" is a user-level command, and one of its uses is precisely this: to find out what fonts are missing on your machine.