From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Eli Zaretskii Newsgroups: gmane.emacs.devel Subject: Re: Russian letters Date: Thu, 06 Jul 2006 23:11:11 +0300 Message-ID: References: <200607052110.21816.pogonyshev@gmx.net> <200607061859.01046.pogonyshev@gmx.net> <200607062117.25013.pogonyshev@gmx.net> Reply-To: Eli Zaretskii NNTP-Posting-Host: main.gmane.org X-Trace: sea.gmane.org 1152216714 25344 80.91.229.2 (6 Jul 2006 20:11:54 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Thu, 6 Jul 2006 20:11:54 +0000 (UTC) Cc: schwab@suse.de, emacs-devel@gnu.org Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Thu Jul 06 22:11:51 2006 Return-path: Envelope-to: ged-emacs-devel@m.gmane.org Original-Received: from lists.gnu.org ([199.232.76.165]) by ciao.gmane.org with esmtp (Exim 4.43) id 1FyaCF-0000oc-3x for ged-emacs-devel@m.gmane.org; Thu, 06 Jul 2006 22:11:35 +0200 Original-Received: from localhost ([127.0.0.1] helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1FyaCE-0006Yw-GW for ged-emacs-devel@m.gmane.org; Thu, 06 Jul 2006 16:11:34 -0400 Original-Received: from mailman by lists.gnu.org with tmda-scanned (Exim 4.43) id 1FyaC0-0006YJ-Kd for emacs-devel@gnu.org; Thu, 06 Jul 2006 16:11:20 -0400 Original-Received: from exim by lists.gnu.org with spam-scanned (Exim 4.43) id 1FyaBz-0006Xz-RE for emacs-devel@gnu.org; Thu, 06 Jul 2006 16:11:20 -0400 Original-Received: from [199.232.76.173] (helo=monty-python.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1FyaBz-0006Xw-NB for emacs-devel@gnu.org; Thu, 06 Jul 2006 16:11:19 -0400 Original-Received: from [192.114.186.20] (helo=nitzan.inter.net.il) by monty-python.gnu.org with esmtp (Exim 4.52) id 1FyaCF-0007vS-JP for emacs-devel@gnu.org; Thu, 06 Jul 2006 16:11:35 -0400 Original-Received: from HOME-C4E4A596F7 (IGLD-83-130-192-59.inter.net.il [83.130.192.59]) by nitzan.inter.net.il (MOS 3.7.3-GA) with ESMTP id EAQ85349 (AUTH halo1); Thu, 6 Jul 2006 23:11:11 +0300 (IDT) Original-To: Paul Pogonyshev In-reply-to: <200607062117.25013.pogonyshev@gmx.net> (message from Paul Pogonyshev on Thu, 6 Jul 2006 21:17:24 +0300) X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.devel:56651 Archived-At: > From: Paul Pogonyshev > Date: Thu, 6 Jul 2006 21:17:24 +0300 > Cc: Andreas Schwab > > Andreas Schwab wrote: > > Paul Pogonyshev writes: > > > > > The file is UTF-8 and mentions its coding in `Local variables'. Again, > > > the file is read just fine. The problems begin when I type new > > > characters into the buffer: they are treated differently than the same > > > characters read from the file. > > > > I can't reproduce that here. Whenever I read a file with russian letters > > that is encoded in utf-8 the letters are decoded into > > mule-unicode-0100-24ff. > > I said many times that the problems begin when I _type_ characters, not > when they are read from file. No, you said, and I quote: Russian letters loaded from file and newly typed are different character no matter if `unify-8859-on-...-mode's are active or not. Characters loaded from file: character: a (3664, #o7120, #xe50, U+0430) charset: cyrillic-iso8859-5 (Right-Hand Part of Latin/Cyrillic Alphabet (ISO/IEC 8859-5): ISO-IR-144.) code point: #x50 syntax: w which means: word category: y:Cyrillic buffer code: #x8C #xD0 file code: #xD0 #xB0 (encoded by coding system mule-utf-8-unix) display: by this font (glyph code) -cronyx-Fixed-Medium-R-Normal--18-120-100-100-C-90-ISO8859-5 (#xD0) Newly typed characters: character: a (332880, #o1212120, #x51450, U+0430) charset: mule-unicode-0100-24ff (Unicode characters of the range U+0100..U+24FF.) code point: #x28 #x50 syntax: w which means: word category: y:Cyrillic buffer code: #x9C #xF4 #xA8 #xD0 file code: #xD0 #xB0 (encoded by coding system mule-utf-8-unix) display: by this font (glyph code) -Adobe-Courier-Medium-R-Normal--17-120-100-100-M-100-ISO10646-1 (#x430) That is, you said that characters read from a file are decoded into cyrillic-iso8859-5, while characters you type are decoded into mule-unicode-0100-24ff. Now it sounds like it's the other way around, especially since you say that the file is encoded in UTF-8 (which is _always_ decoded into mule-unicode-0100-24ff, AFAIR). Please clarify which one is it. Also, please try the same file and keyboard keys in "emacs -Q", perhaps something in your .emacs has unpleasant side effects.