From mboxrd@z Thu Jan 1 00:00:00 1970 Path: main.gmane.org!not-for-mail From: Oliver Scholz Newsgroups: gmane.emacs.help Subject: Re: Problem with UTF-8 Date: Wed, 03 Dec 2003 14:33:23 +0100 Sender: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Message-ID: References: NNTP-Posting-Host: deer.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit X-Trace: sea.gmane.org 1070459078 27668 80.91.224.253 (3 Dec 2003 13:44:38 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Wed, 3 Dec 2003 13:44:38 +0000 (UTC) Original-X-From: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Wed Dec 03 14:44:35 2003 Return-path: Original-Received: from monty-python.gnu.org ([199.232.76.173]) by deer.gmane.org with esmtp (Exim 3.35 #1 (Debian)) id 1ARXIx-0002Ak-00 for ; Wed, 03 Dec 2003 14:44:35 +0100 Original-Received: from localhost ([127.0.0.1] helo=monty-python.gnu.org) by monty-python.gnu.org with esmtp (Exim 4.24) id 1ARYG5-0003y1-78 for geh-help-gnu-emacs@m.gmane.org; Wed, 03 Dec 2003 09:45:41 -0500 Original-Path: shelby.stanford.edu!newsfeed.stanford.edu!news-spur1.maxwell.syr.edu!news.maxwell.syr.edu!fu-berlin.de!uni-berlin.de!dialin-145-254-193-187.arcor-ip.NET!not-for-mail Original-Newsgroups: gnu.emacs.help Original-Lines: 53 Original-NNTP-Posting-Host: dialin-145-254-193-187.arcor-ip.net (145.254.193.187) Original-X-Trace: news.uni-berlin.de 1070458573 70626857 145.254.193.187 ([87814]) X-Attribution: os X-Face: "HgH2sgK|bfH$; PiOJI6|qUCf.ve<51_Od(%ynHr?=>znn#~#oS>",F%B8&\vus),2AsPYb -n>PgddtGEn}s7kH?7kH{P_~vu?]OvVN^qD(L)>G^gDCl(U9n{:d>'DkilN!_K"eNzjrtI4Ya6; Td% IZGMbJ{lawG+'J>QXPZD&TwWU@^~A}f^zAb[Ru;CT(UA]c& User-Agent: Gnus/5.1002 (Gnus v5.10.2) Emacs/22.0.0 (windows-nt) Cancel-Lock: sha1:UsO/woUOAH3DlFN1R8O4wqA3w6XDsyRIw7h1w7s= Original-Xref: shelby.stanford.edu gnu.emacs.help:118943 Original-To: help-gnu-emacs@gnu.org X-BeenThere: help-gnu-emacs@gnu.org X-Mailman-Version: 2.1.2 Precedence: list List-Id: Users list for the GNU Emacs text editor List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Xref: main.gmane.org gmane.emacs.help:14884 X-Report-Spam: http://spam.gmane.org/gmane.emacs.help:14884 Konstantinos Efstathiou writes: > When writing the file: > > character: .[alpha] (05541, 2913, 0xb61, U+03B1).-A > charset: greek-iso8859-7 [...] > display: by this font (glyph code) > -etl-fixed-medium-r-normal--14-140-75-75-m-140-iso8859-7 (0xE1) > > After saving and reopening the file: > > character: .[square] (01211661, 332721, 0x513b1, U+03B1).-A > charset: mule-unicode-0100-24ff [...] > display: no font avairable I see. It is a font problem. You need to create a fontset in order to solve it. Look for an appropriate ISO 10646-1 font and set Emacs up to use it for the charset mule-unicode-0100-24ff. For example this should work on GNU/Linux: (create-fontset-from-fontset-spec "\ -b&h-lucidatypewriter-medium-r-*-*-14-*-*-*-*-*-fontset-konstantinos,\ latin-iso8859-15:-b&h-lucidatypewriter-medium-r-*-*-14-*-*-*-*-*-iso8859-1,\ mule-unicode-0100-24ff:-*-fixed-medium-r-*-*-14-*-*-*-*-*-iso10646-1") (set-face-font 'default "fontset-konstantinos") Maybe there is a ISO 10646 variant of the ETL font that you are using? Some people prefer to do it via X-ressources, but I don't use that and I am too lazy to look the syntax up in the manual. What is happening? Emacs internal encoding (emacs-mule) is divided into several so-called "charsets". Every character in a buffer belongs to a charset and some charsets contain the same (abstract) characters. The Greek input method you use obviously inserts chars from `greek-iso8859-7'. (Some or all (?) of the input methods for classical Greek, for example, use `mule-unicode-0100-24ff'.) Font resolving and similar things depend on those charsets. When you save the file as UTF-8, then Emacs knows how to generate the appropriate UTF-8 octets for the charset `greek-iso8859-7'. But when you visit it again, then Emacs does not decode those UTF-8 octets to characters belonging to `greek-iso8859-7', but to chars from `mule-unicode-0100-24ff'. This is actually a good thing. It's just a font issue that you don't see the right glyphs for those characters. Oliver -- 13 Frimaire an 212 de la Révolution Liberté, Egalité, Fraternité!