From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: ken Newsgroups: gmane.emacs.help,gmane.emacs.devel Subject: Re: utf8 char display in buffer Date: Sat, 13 Jun 2009 08:30:37 -0400 Message-ID: <4A339BED.4020105@mousecar.com> References: <7I2dndeTy7sqkLLXnZ2dnUVZ_gmdnZ2d@sysmatrix.net> <4A32D54D.1040405@mousecar.com> <4A32E6F6.5080501@mousecar.com> Reply-To: gebser@mousecar.com NNTP-Posting-Host: lo.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-15 Content-Transfer-Encoding: 8bit X-Trace: ger.gmane.org 1244896287 19522 80.91.229.12 (13 Jun 2009 12:31:27 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Sat, 13 Jun 2009 12:31:27 +0000 (UTC) To: Eli Zaretskii , GNU Emacs List , emacs-devel@gnu.org Original-X-From: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Sat Jun 13 14:31:25 2009 Return-path: Envelope-to: geh-help-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([199.232.76.165]) by lo.gmane.org with esmtp (Exim 4.50) id 1MFSOJ-0001M0-QY for geh-help-gnu-emacs@m.gmane.org; Sat, 13 Jun 2009 14:31:24 +0200 Original-Received: from localhost ([127.0.0.1]:52779 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1MFSOI-0006SR-Hx for geh-help-gnu-emacs@m.gmane.org; Sat, 13 Jun 2009 08:31:22 -0400 Original-Received: from mailman by lists.gnu.org with tmda-scanned (Exim 4.43) id 1MFSNt-0006RB-5q for help-gnu-emacs@gnu.org; Sat, 13 Jun 2009 08:30:57 -0400 Original-Received: from exim by lists.gnu.org with spam-scanned (Exim 4.43) id 1MFSNq-0006Pa-TU for help-gnu-emacs@gnu.org; Sat, 13 Jun 2009 08:30:56 -0400 Original-Received: from [199.232.76.173] (port=45364 helo=monty-python.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1MFSNq-0006PQ-PB; Sat, 13 Jun 2009 08:30:54 -0400 Original-Received: from mout.perfora.net ([74.208.4.195]:55430) by monty-python.gnu.org with esmtp (Exim 4.60) (envelope-from ) id 1MFSNm-0000f4-EW; Sat, 13 Jun 2009 08:30:51 -0400 Original-Received: from [192.168.0.26] (dsl093-011-017.cle1.dsl.speakeasy.net [66.93.11.17]) by mrelay.perfora.net (node=mrus0) with ESMTP (Nemesis) id 0MKp8S-1MFSNd1cDc-000g0b; Sat, 13 Jun 2009 08:30:45 -0400 User-Agent: Thunderbird 2.0.0.0 (X11/20070326) In-Reply-To: X-Enigmail-Version: 0.95.7 OpenPGP: id=5AD091E7 X-Provags-ID: V01U2FsdGVkX1+fL/FY1xalMdLHRnGPbx/z8xPekqIABhTywYi hnDqGVNslbYTO+baw/GGXFckTYo/uH+4eOQk2ow9M+/B+iTyj+ G7Cvws3pr4pQYj0hzQgiCd69KTAt/ak X-detected-operating-system: by monty-python.gnu.org: Genre and OS details not recognized. X-BeenThere: help-gnu-emacs@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Users list for the GNU Emacs text editor List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Errors-To: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.help:65240 gmane.emacs.devel:111488 Archived-At: On 06/13/2009 12:11 AM Eli Zaretskii wrote: >> .... > > Please provide the output of "C-u C-x =" on these characters, both > when they are displayed correctly and when they are displayed as empty > boxes. In a similar post on the same thread Eli Zaretskii wrote: > Please post here the full output of "C-u C-x =" (from a buffer popped > up by Emacs) for these characters, both when you type them using the > appropriate input method and they are displayed correctly (as in 1) > above), and when you see them as empty boxes after revisiting the > file. The differences between these two cases should give you a hint > what is wrong; if not, someone else here might have ideas. Eli, thanks for your response. Here it is: $-1 ¡ is 'a' with a horizontal bar over it. On first inputting it (after doing "set-input-method latin-4-postfix" and before changing the input method to anything else), it appears correctly and "C-u C-x =" yields: ============================================= character: $-1 ¡ (05140, 2656, 0xa60) charset: latin-iso8859-4 (Right-Hand Part of Latin Alphabet 4 (ISO/IEC 8859-4): ISO-IR-110) code point: 96 syntax: word category: l:Latin buffer code: 0x84 0xE0 file code: 0xC4 0x81 (encoded by coding system mule-utf-8-unix) font: -ETL-Fixed-Medium-R-Normal--16-160-72-72-C-80-ISO8859-4 ============================================= When I reload the file (revisit the file), the same character is replaced with a little box. Doing "C-u C-x =" here yields: ============================================= character: $-1 ¡ (01210041, 331809, 0x51021) charset: mule-unicode-0100-24ff (Unicode characters of the range U+0100..U+24FF.) code point: 32 33 syntax: word category: l:Latin buffer code: 0x9C 0xF4 0xA0 0xA1 file code: 0xC4 0x81 (encoded by coding system mule-utf-8-unix) font: -- none -- ============================================= Note: For some reason, possibly related, had difficulty copying the above text from emacs into clipboard (i.e., "M-w" didn't do anything), so had to use a workaround. It seems that this workaround altered the character in question, the one above following each of the two instances of "character:". As for the meaning of the two outputs above, all that I can confidently glean is that, if I want to use non-English characters in emacs, I have to be an expert emacs developer. :)