all messages for Emacs-related lists mirrored at yhetil.org
 help / color / mirror / code / Atom feed
From: Peter Dyballa <Peter_Dyballa@web.de>
To: Dan Maftei <ninestraycats@gmail.com>
Cc: help-gnu-emacs@gnu.org
Subject: Re: Cocoa emacs renders Unicode combining diacritics improperly
Date: Tue, 17 Jul 2012 23:15:20 +0200	[thread overview]
Message-ID: <911968B3-5CD4-42F0-8227-122D0235D9B3@web.de> (raw)
In-Reply-To: <CABxY=vhoLp9m7aU+P=FcTvka6bb53d32_DFcyt1Q4RPyFrAN9Q@mail.gmail.com>


Am 17.07.2012 um 14:52 schrieb Dan Maftei:

> 
> Here's how to make ñ compositionally:
> 
> n C-x 8 <RET> 0303 <RET>

I perform this much simple: ~n. ~ is on me German keyboard combining. The same is true for ´,`, ^, ¨.

> 
> Could you run describe-char on a compositional character and post the
> results? I want to see how it differs from my output. (Presuming, of
> course, that your emacs renders them correctly :-)

This is from the NS variant of GNU Emacs 23.4:

	        character: ñ (241, #o361, #xf1)
	preferred charset: iso-8859-1 (Latin-1 (ISO/IEC 8859-1))
	       code point: 0xF1
	           syntax: w 	which means: word
	         category: .:Base, j:Japanese, l:Latin
	      buffer code: #xC3 #xB1
	        file code: #xC3 #xB1 (encoded by coding system utf-8-unix)
	          display: by this font (glyph code)
	    nil:-apple-Lucida_Sans_Typewriter-medium-normal-normal-*-9-*-*-*-m-0-iso10646-1 (#x78)
	
	Character code properties: customize what to show
	  name: LATIN SMALL LETTER N WITH TILDE
	  general-category: Ll (Letter, Lowercase)
	  canonical-combining-class: 0 (Spacing, split, enclosing, reordrant, and Tibetan subjoined)
	  decomposition: (110 771) ('n' '̃')
	
	There are text properties here:
	  fontified            t

and this is from the NS variant of GNU Emacs 24.1:

	            character: ñ (displayed as ñ) (codepoint 241, #o361, #xf1)
	    preferred charset: iso-8859-1 (Latin-1 (ISO/IEC 8859-1))
	code point in charset: 0xF1
	               syntax: w 	which means: word
	             category: .:Base, L:Left-to-right (strong), j:Japanese, l:Latin
	          buffer code: #xC3 #xB1
	            file code: #xC3 #xB1 (encoded by coding system utf-8-unix)
	              display: by this font (glyph code)
	    nil:-apple-Menlo-medium-normal-normal-*-9-*-*-*-m-0-iso10646-1 (#xB3)
	
	Character code properties: customize what to show
	  name: LATIN SMALL LETTER N WITH TILDE
	  general-category: Ll (Letter, Lowercase)
	  canonical-combining-class: 0 (Spacing, split, enclosing, reordrant, and Tibetan subjoined)
	  decomposition: (110 771) ('n' '̃')
	
	There are text properties here:
	  fontified            t

You can see the different "character:" lines and font (type) descriptions.


This comes from the "AppKit Emacs":

	            character: ñ (displayed as ñ) (codepoint 241, #o361, #xf1)
	    preferred charset: iso-8859-1 (Latin-1 (ISO/IEC 8859-1))
	code point in charset: 0xF1
	               syntax: w 	which means: word
	             category: .:Base, L:Left-to-right (strong), j:Japanese, l:Latin
	          buffer code: #xC3 #xB1
	            file code: #x6E #xCC #x83 (encoded by coding system utf-8-hfs-unix)
	              display: by this font (glyph code)
	    mac-ct:-*-Monaco-normal-normal-normal-*-10-*-*-*-m-0-iso10646-1 (#x78)
	
	Character code properties: customize what to show
	  name: LATIN SMALL LETTER N WITH TILDE
	  general-category: Ll (Letter, Lowercase)
	  canonical-combining-class: 0 (Spacing, split, enclosing, reordrant, and Tibetan subjoined)
	  decomposition: (110 771) ('n' '̃')
	
	There are text properties here:
	  fontified            t

You can see that the two 24.1 versions use different coding systems.


> 
> Thanks for the patches. I've applied them to the 24.1.1 source but make
> segfaults when compiling profile.c. I don't have the time to fix this
> unfortunately.

I wrote "GNU Emacs 24.1" and YAMAMOTO Mitsuharu mentions in NEWS-mac at its top:

	* emacs-24.1-mac-3.0 (2012-06-10)
	Based on Emacs 24.1.

So using the sources for GNU Emacs 24.1.1 is not correct. Use the sources from the official GNU Emacs 24.1 release!

> 
> I presume you use emacs on OS X? Did you build it using this patch? Do
> compositional characters work?

Three times: yes.

> Further, if you have the time, could you build the regular source --with-ns and see if they work there? Perhaps the issue is with my OS.

It works. Your fault is that you try to use an Emacs input method, which is not necessary. Just use your keyboard and its own dead (combining) accents! If I try to use your input method I get:

	            character: n (displayed as n) (codepoint 110, #o156, #x6e)
	    preferred charset: ascii (ASCII (ISO646 IRV))
	code point in charset: 0x6E
	               syntax: w 	which means: word
	             category: .:Base, L:Left-to-right (strong), a:ASCII, l:Latin, r:Roman
	          buffer code: #x6E
	            file code: #x6E (encoded by coding system utf-8-unix)
	              display: composed to form "ñ" (see below)
	
	Composed with the following character(s) "̃" using this font:
	  nil:-apple-Menlo-medium-normal-normal-*-9-*-*-*-m-0-iso10646-1
	by these glyphs:
	  [0 1 110 81 5 0 4 5 0 nil]
	  [0 1 771 648 5 0 3 1 0 [-4 0 0]]
	
	Character code properties: customize what to show
	  name: LATIN SMALL LETTER N
	  general-category: Ll (Letter, Lowercase)
	  canonical-combining-class: 0 (Spacing, split, enclosing, reordrant, and Tibetan subjoined)
	  decomposition: (110) ('n')
	
	There are text properties here:
	  fontified            t

The combined character looks quite good with Menlo on Snow Leopard but as awful as your screenshot with Monaco (differently awful with Lucida Sans Typewriter). In the "AppKit Emacs" with Monaco the accented character looks exactly like the ~n composed character and is described as:

	            character: n (displayed as n) (codepoint 110, #o156, #x6e)
	    preferred charset: ascii (ASCII (ISO646 IRV))
	code point in charset: 0x6E
	               syntax: w 	which means: word
	             category: .:Base, L:Left-to-right (strong), a:ASCII, l:Latin, r:Roman
	          buffer code: #x6E
	            file code: #x6E (encoded by coding system utf-8-hfs-unix)
	              display: composed to form "ñ" (see below)
	
	Composed with the following character(s) "̃" using this font:
	  mac-ct:-*-Monaco-normal-normal-normal-*-10-*-*-*-m-0-iso10646-1
	by these glyphs:
	  [0 1 110 120 6 0 6 8 0 nil]
	
	Character code properties: customize what to show
	  name: LATIN SMALL LETTER N
	  general-category: Ll (Letter, Lowercase)
	  canonical-combining-class: 0 (Spacing, split, enclosing, reordrant, and Tibetan subjoined)
	  decomposition: (110) ('n')
	
	There are text properties here:
	  fontified            t


--
Greetings

  Pete

The best way to accelerate a PC is 9.8 m/s²




  reply	other threads:[~2012-07-17 21:15 UTC|newest]

Thread overview: 12+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2012-07-16 17:09 Cocoa emacs renders Unicode combining diacritics improperly Dan Maftei
2012-07-16 20:16 ` Peter Dyballa
2012-07-16 22:15   ` Dan Maftei
2012-07-16 23:09     ` Peter Dyballa
2012-07-16 23:23       ` Dan Maftei
2012-07-17  9:51         ` Peter Dyballa
2012-07-17 12:52           ` Dan Maftei
2012-07-17 21:15             ` Peter Dyballa [this message]
2012-07-17 22:16               ` Dan Maftei
2012-07-17 23:10                 ` Peter Dyballa
2012-07-18  0:03                   ` Dan Maftei
2012-07-18  8:48                     ` Peter Dyballa

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=911968B3-5CD4-42F0-8227-122D0235D9B3@web.de \
    --to=peter_dyballa@web.de \
    --cc=help-gnu-emacs@gnu.org \
    --cc=ninestraycats@gmail.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
Code repositories for project(s) associated with this external index

	https://git.savannah.gnu.org/cgit/emacs.git
	https://git.savannah.gnu.org/cgit/emacs/org-mode.git

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.