From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Peter Dyballa Newsgroups: gmane.emacs.help Subject: Re: Cocoa emacs renders Unicode combining diacritics improperly Date: Tue, 17 Jul 2012 23:15:20 +0200 Message-ID: <911968B3-5CD4-42F0-8227-122D0235D9B3@web.de> References: <89CCB64C-8963-45D5-866C-A6B208F007E2@Web.DE> <26966270-2EEA-42B0-A669-D946E1927F3E@web.de> NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 (Apple Message framework v1084) Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-Trace: dough.gmane.org 1342559738 11020 80.91.229.3 (17 Jul 2012 21:15:38 GMT) X-Complaints-To: usenet@dough.gmane.org NNTP-Posting-Date: Tue, 17 Jul 2012 21:15:38 +0000 (UTC) Cc: help-gnu-emacs@gnu.org To: Dan Maftei Original-X-From: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Tue Jul 17 23:15:36 2012 Return-path: Envelope-to: geh-help-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1SrF7G-0001UN-8S for geh-help-gnu-emacs@m.gmane.org; Tue, 17 Jul 2012 23:15:34 +0200 Original-Received: from localhost ([::1]:58770 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1SrF7F-0006QT-8N for geh-help-gnu-emacs@m.gmane.org; Tue, 17 Jul 2012 17:15:33 -0400 Original-Received: from eggs.gnu.org ([208.118.235.92]:35654) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1SrF78-0006QM-EB for help-gnu-emacs@gnu.org; Tue, 17 Jul 2012 17:15:28 -0400 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1SrF76-0007P5-Sy for help-gnu-emacs@gnu.org; Tue, 17 Jul 2012 17:15:26 -0400 Original-Received: from mout.web.de ([212.227.17.11]:53885) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1SrF76-0007Oq-Id for help-gnu-emacs@gnu.org; Tue, 17 Jul 2012 17:15:24 -0400 Original-Received: from [192.168.178.8] ([81.210.160.98]) by smtp.web.de (mrweb003) with ESMTPA (Nemesis) id 0LopdT-1TXAZe2qWR-00gGxv; Tue, 17 Jul 2012 23:15:21 +0200 In-Reply-To: X-Mailer: Apple Mail (2.1084) X-Provags-ID: V02:K0:5mbTmDRbAC4Py8Fy6DFZnWDSTSTF5Y1E9GYRgJk08nr ZZFgk2w6EiAMQMCaCWV0TofacMAoSShQSE61YLrH0OsfqlyNQK bnsv5f42PK3SpeOENuzndW+Rd+BABZobkSHVVkQ270Lc2cHDhD BwjfqwgsEiJsLqLMiGwanx61+7ElPSu3HAHrn4kj8D2aK5OY4z jjYYAdTlYDfZYYFmYnc6g== X-detected-operating-system: by eggs.gnu.org: Genre and OS details not recognized. X-Received-From: 212.227.17.11 X-BeenThere: help-gnu-emacs@gnu.org X-Mailman-Version: 2.1.14 Precedence: list List-Id: Users list for the GNU Emacs text editor List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Original-Sender: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.help:85907 Archived-At: Am 17.07.2012 um 14:52 schrieb Dan Maftei: >=20 > Here's how to make =C3=B1 compositionally: >=20 > n C-x 8 0303 I perform this much simple: ~n. ~ is on me German keyboard combining. = The same is true for =C2=B4,`, ^, =C2=A8. >=20 > Could you run describe-char on a compositional character and post the > results? I want to see how it differs from my output. (Presuming, of > course, that your emacs renders them correctly :-) This is from the NS variant of GNU Emacs 23.4: character: =C3=B1 (241, #o361, #xf1) preferred charset: iso-8859-1 (Latin-1 (ISO/IEC 8859-1)) code point: 0xF1 syntax: w which means: word category: .:Base, j:Japanese, l:Latin buffer code: #xC3 #xB1 file code: #xC3 #xB1 (encoded by coding system = utf-8-unix) display: by this font (glyph code) = nil:-apple-Lucida_Sans_Typewriter-medium-normal-normal-*-9-*-*-*-m-0-iso10= 646-1 (#x78) =09 Character code properties: customize what to show name: LATIN SMALL LETTER N WITH TILDE general-category: Ll (Letter, Lowercase) canonical-combining-class: 0 (Spacing, split, enclosing, = reordrant, and Tibetan subjoined) decomposition: (110 771) ('n' '=CC=83') =09 There are text properties here: fontified t and this is from the NS variant of GNU Emacs 24.1: character: =C3=B1 (displayed as =C3=B1) (codepoint = 241, #o361, #xf1) preferred charset: iso-8859-1 (Latin-1 (ISO/IEC 8859-1)) code point in charset: 0xF1 syntax: w which means: word category: .:Base, L:Left-to-right (strong), = j:Japanese, l:Latin buffer code: #xC3 #xB1 file code: #xC3 #xB1 (encoded by coding system = utf-8-unix) display: by this font (glyph code) = nil:-apple-Menlo-medium-normal-normal-*-9-*-*-*-m-0-iso10646-1 (#xB3) =09 Character code properties: customize what to show name: LATIN SMALL LETTER N WITH TILDE general-category: Ll (Letter, Lowercase) canonical-combining-class: 0 (Spacing, split, enclosing, = reordrant, and Tibetan subjoined) decomposition: (110 771) ('n' '=CC=83') =09 There are text properties here: fontified t You can see the different "character:" lines and font (type) = descriptions. This comes from the "AppKit Emacs": character: =C3=B1 (displayed as =C3=B1) (codepoint = 241, #o361, #xf1) preferred charset: iso-8859-1 (Latin-1 (ISO/IEC 8859-1)) code point in charset: 0xF1 syntax: w which means: word category: .:Base, L:Left-to-right (strong), = j:Japanese, l:Latin buffer code: #xC3 #xB1 file code: #x6E #xCC #x83 (encoded by coding system = utf-8-hfs-unix) display: by this font (glyph code) = mac-ct:-*-Monaco-normal-normal-normal-*-10-*-*-*-m-0-iso10646-1 (#x78) =09 Character code properties: customize what to show name: LATIN SMALL LETTER N WITH TILDE general-category: Ll (Letter, Lowercase) canonical-combining-class: 0 (Spacing, split, enclosing, = reordrant, and Tibetan subjoined) decomposition: (110 771) ('n' '=CC=83') =09 There are text properties here: fontified t You can see that the two 24.1 versions use different coding systems. >=20 > Thanks for the patches. I've applied them to the 24.1.1 source but = make > segfaults when compiling profile.c. I don't have the time to fix this > unfortunately. I wrote "GNU Emacs 24.1" and YAMAMOTO Mitsuharu mentions in NEWS-mac at = its top: * emacs-24.1-mac-3.0 (2012-06-10) Based on Emacs 24.1. So using the sources for GNU Emacs 24.1.1 is not correct. Use the = sources from the official GNU Emacs 24.1 release! >=20 > I presume you use emacs on OS X? Did you build it using this patch? Do > compositional characters work? Three times: yes. > Further, if you have the time, could you build the regular source = --with-ns and see if they work there? Perhaps the issue is with my OS. It works. Your fault is that you try to use an Emacs input method, which = is not necessary. Just use your keyboard and its own dead (combining) = accents! If I try to use your input method I get: character: n (displayed as n) (codepoint 110, #o156, = #x6e) preferred charset: ascii (ASCII (ISO646 IRV)) code point in charset: 0x6E syntax: w which means: word category: .:Base, L:Left-to-right (strong), = a:ASCII, l:Latin, r:Roman buffer code: #x6E file code: #x6E (encoded by coding system = utf-8-unix) display: composed to form "=C3=B1" (see below) =09 Composed with the following character(s) "=CC=83" using this = font: nil:-apple-Menlo-medium-normal-normal-*-9-*-*-*-m-0-iso10646-1 by these glyphs: [0 1 110 81 5 0 4 5 0 nil] [0 1 771 648 5 0 3 1 0 [-4 0 0]] =09 Character code properties: customize what to show name: LATIN SMALL LETTER N general-category: Ll (Letter, Lowercase) canonical-combining-class: 0 (Spacing, split, enclosing, = reordrant, and Tibetan subjoined) decomposition: (110) ('n') =09 There are text properties here: fontified t The combined character looks quite good with Menlo on Snow Leopard but = as awful as your screenshot with Monaco (differently awful with Lucida = Sans Typewriter). In the "AppKit Emacs" with Monaco the accented = character looks exactly like the ~n composed character and is described = as: character: n (displayed as n) (codepoint 110, #o156, = #x6e) preferred charset: ascii (ASCII (ISO646 IRV)) code point in charset: 0x6E syntax: w which means: word category: .:Base, L:Left-to-right (strong), = a:ASCII, l:Latin, r:Roman buffer code: #x6E file code: #x6E (encoded by coding system = utf-8-hfs-unix) display: composed to form "=C3=B1" (see below) =09 Composed with the following character(s) "=CC=83" using this = font: = mac-ct:-*-Monaco-normal-normal-normal-*-10-*-*-*-m-0-iso10646-1 by these glyphs: [0 1 110 120 6 0 6 8 0 nil] =09 Character code properties: customize what to show name: LATIN SMALL LETTER N general-category: Ll (Letter, Lowercase) canonical-combining-class: 0 (Spacing, split, enclosing, = reordrant, and Tibetan subjoined) decomposition: (110) ('n') =09 There are text properties here: fontified t -- Greetings Pete The best way to accelerate a PC is 9.8 m/s=C2=B2