From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Paul Pogonyshev Newsgroups: gmane.emacs.devel Subject: Russian numero sign Date: Sun, 19 Feb 2006 14:31:36 +0200 Message-ID: <200602191431.36541.pogonyshev@gmx.net> NNTP-Posting-Host: main.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset="utf-8" Content-Transfer-Encoding: quoted-printable X-Trace: sea.gmane.org 1140384249 24148 80.91.229.2 (19 Feb 2006 21:24:09 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Sun, 19 Feb 2006 21:24:09 +0000 (UTC) Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Sun Feb 19 22:24:07 2006 Return-path: Envelope-to: ged-emacs-devel@m.gmane.org Original-Received: from lists.gnu.org ([199.232.76.165]) by ciao.gmane.org with esmtp (Exim 4.43) id 1FAw2D-0008Ha-FH for ged-emacs-devel@m.gmane.org; Sun, 19 Feb 2006 22:24:01 +0100 Original-Received: from localhost ([127.0.0.1] helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1FAw24-0005M0-J8 for ged-emacs-devel@m.gmane.org; Sun, 19 Feb 2006 16:23:52 -0500 Original-Received: from mailman by lists.gnu.org with tmda-scanned (Exim 4.43) id 1FAqoe-0007Sp-5v for emacs-devel@gnu.org; Sun, 19 Feb 2006 10:49:41 -0500 Original-Received: from exim by lists.gnu.org with spam-scanned (Exim 4.43) id 1FAqgt-0005vA-0U for emacs-devel@gnu.org; Sun, 19 Feb 2006 10:42:19 -0500 Original-Received: from [199.232.76.173] (helo=monty-python.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1FAqWu-0003zv-6c for emacs-devel@gnu.org; Sun, 19 Feb 2006 10:31:21 -0500 Original-Received: from [213.165.64.20] (helo=mail.gmx.net) by monty-python.gnu.org with smtp (Exim 4.52) id 1FAqcj-0004nj-Jx for emacs-devel@gnu.org; Sun, 19 Feb 2006 10:37:21 -0500 Original-Received: (qmail invoked by alias); 19 Feb 2006 15:31:17 -0000 Original-Received: from dialup.194.158.192.63.belpak.by (EHLO dialup.194.158.192.63.belpak.by) [194.158.192.63] by mail.gmx.net (mp026) with SMTP; 19 Feb 2006 16:31:17 +0100 X-Authenticated: #16844820 Original-To: emacs-devel@gnu.org User-Agent: KMail/1.7.2 Content-Disposition: inline X-Y-GMX-Trusted: 0 X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.devel:50754 Archived-At: Activate `russian-computer' input method and type S-3 (i.e. the hash sign on US English layout.) This inputs Russian numero sign, which looks like capital latin `N' with a small underlined circle by the right side. (AFAIK, it was also used in French typography before.) Now, Emacs puts this character in the buffer (output of `describe-char'): character: =EF=BF=BD (3696, #o7160, #xe70) charset: cyrillic-iso8859-5 (Right-Hand Part of Latin/Cyrillic Alphabet= (ISO/IEC 8859-5): ISO-IR-144.) code point: #x70 syntax: . which means: punctuation category: y:Cyrillic to input: type "#" with russian-computer buffer code: #x8C #xF0 file code: not encodable by coding system mule-utf-8-unix display: by this font (glyph code) -ETL-fixed-medium-r-normal--16-160-72-72-C-80-ISO8859-5 (#xF0) Which means that you cannot save this character into an UTF-8 file even though UTF-8 is meant to be able to encode anything in Unicode. There is actually a Unicode character `numero sign', U+2116, which I can type in other desktop applications (probably through XIM) by the same Shift-3 combo with Russian layout. When copied into Emacs, this character is displayed as a box (i.e. no glyph for it is found) and is described like this: character: =E2=84=96 (342742, #o1235326, #x53ad6, U+2116) charset: mule-unicode-0100-24ff (Unicode characters of the range U+0100= =2E.U+24FF.) code point: #x75 #x56 syntax: w which means: word buffer code: #x9C #xF4 #xF5 #xD6 file code: #xE2 #x84 #x96 (encoded by coding system mule-utf-8-unix) display: by this font (glyph code) -Adobe-Courier-Medium-R-Normal--17-120-100-100-M-100-ISO10646-1 (#x211= 6) So, is there an easy way to fix Emacs to input the Unicode character instead of that 3696 or make them same character somehow? It is kind of annoying that I cannot input the numero sign in an UTF-8 buffer even though I have a key for it on the keyboard. I don't really care if the glyph is not found, it is probably a deficiency in the font, not Emacs, but I think UTF-8 unencodable characters is something that should not exist at all, especially when you have the corresponding Unicode character. Paul