unofficial mirror of emacs-devel@gnu.org 
 help / color / mirror / code / Atom feed
* 23.0.60; GNU Emacs does not handle composed characters
@ 2007-12-26 16:36 Peter Dyballa
  2007-12-26 17:10 ` Andreas Schwab
  2008-01-08  6:29 ` Kenichi Handa
  0 siblings, 2 replies; 8+ messages in thread
From: Peter Dyballa @ 2007-12-26 16:36 UTC (permalink / raw)
  To: emacs-pretest-bug

Hello!

When pasting a copy of a filled in text from OmniWeb (a Web browser  
coming from the NeXT) into a file buffer in GNU Emacs (mode-line  
starts with -U:), which was launched with -Q, the German umlauts ä,  
ö, ü, Ä, Ö, Ü are stripped off their diaereses and mutate to a, o, u,  
A, O, U. Typing C-u C-x = on such a Marcel Duchamps character leads  
to an error message in echo area:

	Format specifier doesn't match argument type

In *Messages* buffer a ``describe-char: ´´is prepended the text.

In dired buffers the same umlauts in file names are stripped, while  
the date fields can display an abbreviated month's name as ``Mär´´.  
Typing on this ä C-c C-x = I get:

	        character: ä (228, #o344, #xe4)
	preferred charset: iso-8859-1 (Latin-1 (ISO/IEC 8859-1))
	       code point: 0xE4
	           syntax: w 	which means: word
	         category: j:Japanese l:Latin
	      buffer code: #xC3 #xA4
	        file code: #xC3 #xA4 (encoded by coding system utf-8-unix)
	          display: by this font (glyph code)
	     -B&H-LucidaTypewriter-Medium-R-Normal-Sans-10-100-75-75-M-60- 
ISO8859-1 (#xE4)
	
	Character code properties are not shown: customize what to show
	
	There are text properties here:
	  auto-composed        t
	  fontified            t

instead of an error message. The error message is output when I type  
on as tripped umlaut in a file name!


Looking with GNU Emacs 23.0.50 into the file saved with its stripped  
umlauts I see instead of them ``{äöüÄÖÜ}<an open box>´. This box is  
described in GNU Emacs 23.0.50 as:

	        character:  (332488, #o1211310, #x512c8, U+0308)
	          charset: mule-unicode-0100-24ff
			   (Unicode characters of the range U+0100..U+24FF.)
	       code point: #x25 #x48
	           syntax: w 	which means: word
	         category: ^:Combining diacritic or mark
	      buffer code: #x9C #xF4 #xA5 #xC8
	        file code: #xCC #x88 (encoded by coding system mule-utf-8-unix)
	          display: by this font (glyph code)
	     -b&h-lucida sans typewriter-medium-r-normal--10-98-74-74-m-60- 
iso10646-1 (#x308)
	     Unicode data:
	             Name: COMBINING DIAERESIS
	         Category: non-spacing mark
	  Combining class: Above
	    Bidi category: Non-Spacing Mark
	         Old name: NON-SPACING DIAERESIS
	ISO 10646 comment: Dialytika


In GNU Emacs 23.0.60.1 (powerpc-apple-darwin8.11.0, GTK+ Version 2.6.10)
  of 2007-12-25 on Latsche.local
Windowing system distributor `The XFree86 Project, Inc', version  
11.0.40400000
configured using `configure  '--enable-font-backend' '--with-x- 
toolkit=gtk' '--with-dbus' '--without-sound' '--without-pop' '--with- 
xpm' '--with-jpeg' '--with-tiff' '--with-gif' '--with-png' '--enable- 
locallisppath=/Library/Application Support/Emacs/calendar22:/Library/ 
Application Support/Emacs/caml:/Library/Application Support/Emacs:/sw/ 
share/emacs21/site-lisp/elib' 'PKG_CONFIG_PATH=/sw/lib/freetype219/ 
lib/pkgconfig:/sw/lib/fontconfig2/lib/pkgconfig:/sw/lib/pkgconfig:/sw/ 
lib/system-openssl/lib/pkgconfig:/sw/share/pkgconfig:/usr/lib/ 
pkgconfig:/usr/local/lib/pkgconfig:/usr/local/clamXav/lib/pkgconfig:/ 
usr/local/lib/pkgconfig' 'CPPFLAGS=-no-cpp-precomp -D__BIND_NOSTATIC - 
I/usr/include/openssl -I/sw/include/pango-1.0 -I/sw/lib/fontconfig2/ 
include -I/sw/lib/freetype219/include -I/sw/lib/freetype219/include/ 
freetype2 -I/sw/include -I/usr/local/include -idirafter /usr/X11R6/ 
include' 'CXXFLAGS=-no-cpp-precomp -I/usr/include/openssl -I/sw/ 
include/pango-1.0 -I/sw/lib/fontconfig2/include -I/sw/lib/freetype219/ 
include -I/sw/lib/freetype219/include/freetype2 -I/sw/include -I/usr/ 
local/include' 'CFLAGS=-ggdb -g3 -gfull -H -bind_at_load -pipe -fPIC - 
mcpu=7450 -mtune=7450 -O0' 'LDFLAGS=-dead_strip -multiply_defined  
suppress -L/sw/lib/ncurses -L/sw/lib/fontconfig2/lib -L/sw/lib/ 
freetype219/lib -L/sw/lib -L/usr/local/lib -L/usr/X11R6/lib''

Important settings:
   value of $LC_ALL: nil
   value of $LC_COLLATE: nil
   value of $LC_CTYPE: de_DE.UTF-8
   value of $LC_MESSAGES: nil
   value of $LC_MONETARY: nil
   value of $LC_NUMERIC: nil
   value of $LC_TIME: nil
   value of $LANG: de_DE.UTF-8
   value of $XMODIFIERS: nil
   locale-coding-system: utf-8-unix
   default-enable-multibyte-characters: t

Major mode: Fundamental

Minor modes in effect:
   tooltip-mode: t
   mouse-wheel-mode: t
   menu-bar-mode: t
   file-name-shadow-mode: t
   global-font-lock-mode: t
   font-lock-mode: t
   blink-cursor-mode: t
   global-auto-composition-mode: t
   auto-composition-mode: t
   auto-compression-mode: t
   line-number-mode: t

Recent input:
<down-mouse-1> <mouse-1> C-x C-f D a t e i <return>
<down-mouse-2> <mouse-2> <down-mouse-1> <mouse-1> C-x


--
Greetings

   Pete

America believes in education: the average professor earns more money  
in a year than a professional athlete earns in a whole week.
				­ Evan Esar

^ permalink raw reply	[flat|nested] 8+ messages in thread

end of thread, other threads:[~2008-01-09  1:56 UTC | newest]

Thread overview: 8+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2007-12-26 16:36 23.0.60; GNU Emacs does not handle composed characters Peter Dyballa
2007-12-26 17:10 ` Andreas Schwab
2007-12-26 18:53   ` Peter Dyballa
2007-12-30 15:04   ` Peter Dyballa
2007-12-30 21:50     ` James Cloos
2008-01-08  6:29 ` Kenichi Handa
2008-01-08 15:08   ` Peter Dyballa
2008-01-09  1:56     ` Kenichi Handa

Code repositories for project(s) associated with this public inbox

	https://git.savannah.gnu.org/cgit/emacs.git

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).