From mboxrd@z Thu Jan 1 00:00:00 1970 Path: main.gmane.org!not-for-mail From: Bruce Ingalls Newsgroups: gmane.emacs.help Subject: Re: recommended russian encoding Date: Sat, 17 Jul 2004 14:57:38 GMT Sender: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Message-ID: References: <40F570B2.5010500@yahoo.com> <65wJc.6722$oD5.4219@nwrdny03.gnilink.net> <40F6A69D.5040500@yahoo.com> NNTP-Posting-Host: deer.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 8bit X-Trace: sea.gmane.org 1090076564 16631 80.91.224.253 (17 Jul 2004 15:02:44 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Sat, 17 Jul 2004 15:02:44 +0000 (UTC) Original-X-From: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Sat Jul 17 17:02:35 2004 Return-path: Original-Received: from lists.gnu.org ([199.232.76.165]) by deer.gmane.org with esmtp (Exim 3.35 #1 (Debian)) id 1Blqhu-0005bI-00 for ; Sat, 17 Jul 2004 17:02:35 +0200 Original-Received: from localhost ([127.0.0.1] helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.33) id 1BlqkV-0005Mq-AS for geh-help-gnu-emacs@m.gmane.org; Sat, 17 Jul 2004 11:05:15 -0400 Original-Path: shelby.stanford.edu!newsfeed.stanford.edu!news-spur1.maxwell.syr.edu!news.maxwell.syr.edu!border1.nntp.dca.giganews.com!border2.nntp.dca.giganews.com!nntp.giganews.com!cyclone1.gnilink.net!spamkiller2.gnilink.net!gnilink.net!nwrdny03.gnilink.net.POSTED!1c9f55fe!not-for-mail User-Agent: Mozilla Thunderbird 0.7.2 (Windows/20040707) X-Accept-Language: en-us, en Original-Newsgroups: gnu.emacs.help In-Reply-To: Original-Lines: 47 Original-NNTP-Posting-Host: 138.89.68.23 Original-X-Complaints-To: abuse@verizon.net Original-X-Trace: nwrdny03.gnilink.net 1090076258 138.89.68.23 (Sat, 17 Jul 2004 10:57:38 EDT) Original-NNTP-Posting-Date: Sat, 17 Jul 2004 10:57:38 EDT Original-Xref: shelby.stanford.edu gnu.emacs.help:124358 Original-To: help-gnu-emacs@gnu.org X-BeenThere: help-gnu-emacs@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Users list for the GNU Emacs text editor List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Xref: main.gmane.org gmane.emacs.help:19693 X-Report-Spam: http://spam.gmane.org/gmane.emacs.help:19693 Stefan Monnier wrote: >>>>However, when I tried opening the file in Emacs on Linux, the UTF-8 >>>>encoded Russian characters displayed as ... empty boxes. > > So it seems that Emacs correctly detected the utf-8 encoding but just > can't find the chars in the unicode font. > You can check with C-u C-x = when point is on one of those empty boxes. I get: character: а (01212120, 332880, 0x51450) charset: mule-unicode-0100-24ff (Unicode characters of the range U+0100..U+24FF.) code point: 40 80 syntax: word category: y:Cyrillic buffer code: 0x9C 0xF4 0xA8 0xD0 file code: 0xD0 0xB0 (encoded by coding system mule-utf-8) font: -Adobe-Courier-Medium-R-Normal--12-120-75-75-M-70-ISO10646-1 (the encoded char above might not have survived copy & pasting) >>>bogus glyphs indicate a problem with the encoding. Did you visit the >>>file with `C-x RET c utf-8 RET C-x C-f'? >>I think you got the above syntax wrong. > > Care to tell us what was wrong? I discovered that the better syntax, which worked for me, is: C-x C-m c utf-8 RET C-x C-f -------------^ C-m is not the same as RET. >>C-h h does display Russian "hello" properly. >>Emacs also works fine in -nw mode in an xterm. > >>This seems to demonstrate that the font is available... > > Probably that you have a font for the koi-8 characters but not for the > russian unicode characters (and your Emacs doesn't realize that they are > the same). That's it. koi-8-r encoding worked. Is there some way to tell Emacs to map the Cyrillic fonts that it has, to UTF-8 encodings? Now, I'm going through all this again with XEmacs. The precompiled v21.4.13 cygwin (XEmacs, not Cygwin distro) binary seems not to have LEIM/MULE support compiled in. When I type 'C-x C-m c', the only completion choices are 'default' & 'raw-text'