From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Peter Dyballa Newsgroups: gmane.emacs.devel,gmane.emacs.pretest.bugs Subject: 23.0.60; GNU Emacs does not handle composed characters Date: Wed, 26 Dec 2007 17:36:25 +0100 Message-ID: NNTP-Posting-Host: lo.gmane.org Mime-Version: 1.0 (Apple Message framework v753) Content-Type: text/plain; charset=X-MAC-ROMAN-LATIN1; delsp=yes; format=flowed Content-Transfer-Encoding: quoted-printable X-Trace: ger.gmane.org 1198687091 31844 80.91.229.12 (26 Dec 2007 16:38:11 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Wed, 26 Dec 2007 16:38:11 +0000 (UTC) To: emacs-pretest-bug@gnu.org Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Wed Dec 26 17:38:21 2007 Return-path: Envelope-to: ged-emacs-devel@m.gmane.org Original-Received: from lists.gnu.org ([199.232.76.165]) by lo.gmane.org with esmtp (Exim 4.50) id 1J7ZGn-00055U-2F for ged-emacs-devel@m.gmane.org; Wed, 26 Dec 2007 17:38:13 +0100 Original-Received: from localhost ([127.0.0.1] helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1J7ZGS-0006nB-62 for ged-emacs-devel@m.gmane.org; Wed, 26 Dec 2007 11:37:52 -0500 Original-Received: from mailman by lists.gnu.org with tmda-scanned (Exim 4.43) id 1J7ZFD-000681-LT for emacs-devel@gnu.org; Wed, 26 Dec 2007 11:36:35 -0500 Original-Received: from exim by lists.gnu.org with spam-scanned (Exim 4.43) id 1J7ZFC-00066g-BP for emacs-devel@gnu.org; Wed, 26 Dec 2007 11:36:34 -0500 Original-Received: from [199.232.76.173] (helo=monty-python.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1J7ZFC-00066Z-7T for emacs-devel@gnu.org; Wed, 26 Dec 2007 11:36:34 -0500 Original-Received: from fencepost.gnu.org ([140.186.70.10]) by monty-python.gnu.org with esmtp (Exim 4.60) (envelope-from ) id 1J7ZFC-0001ti-28 for emacs-devel@gnu.org; Wed, 26 Dec 2007 11:36:34 -0500 Original-Received: from mail.gnu.org ([199.232.76.166] helo=mx10.gnu.org) by fencepost.gnu.org with esmtp (Exim 4.60) (envelope-from ) id 1J7ZFB-0006LB-CM for emacs-pretest-bug@gnu.org; Wed, 26 Dec 2007 11:36:33 -0500 Original-Received: from Debian-exim by monty-python.gnu.org with spam-scanned (Exim 4.60) (envelope-from ) id 1J7ZF8-0001t0-BS for emacs-pretest-bug@gnu.org; Wed, 26 Dec 2007 11:36:33 -0500 Original-Received: from mout0.freenet.de ([195.4.92.90]) by monty-python.gnu.org with esmtps (TLS-1.0:DHE_RSA_AES_256_CBC_SHA1:32) (Exim 4.60) (envelope-from ) id 1J7ZF7-0001sg-P3 for emacs-pretest-bug@gnu.org; Wed, 26 Dec 2007 11:36:30 -0500 Original-Received: from [195.4.92.23] (helo=13.mx.freenet.de) by mout0.freenet.de with esmtpa (Exim 4.68) (envelope-from ) id 1J7ZF5-00061V-VZ for emacs-pretest-bug@gnu.org; Wed, 26 Dec 2007 17:36:27 +0100 Original-Received: from fce07.f.ppp-pool.de ([195.4.206.7]:54884 helo=[192.168.1.2]) by 13.mx.freenet.de with esmtpsa (ID peter_dyballa@freenet.de) (TLSv1:AES128-SHA:128) (port 25) (Exim 4.68 #1) id 1J7ZF5-0006zX-Ks for emacs-pretest-bug@gnu.org; Wed, 26 Dec 2007 17:36:27 +0100 X-Mailer: Apple Mail (2.753) X-detected-kernel: by monty-python.gnu.org: Linux 2.6 (newer, 3) X-detected-kernel: by monty-python.gnu.org: Linux 2.6, seldom 2.4 (older, 4) X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.devel:85477 gmane.emacs.pretest.bugs:20478 Archived-At: Hello! When pasting a copy of a filled in text from OmniWeb (a Web browser =20 coming from the NeXT) into a file buffer in GNU Emacs (mode-line =20 starts with -U:), which was launched with -Q, the German umlauts =E4, =20= =F6, =FC, =C4, =D6, =DC are stripped off their diaereses and mutate to = a, o, u, =20 A, O, U. Typing C-u C-x =3D on such a Marcel Duchamps character leads =20= to an error message in echo area: Format specifier doesn't match argument type In *Messages* buffer a ``describe-char: =B4=B4is prepended the text. In dired buffers the same umlauts in file names are stripped, while =20 the date fields can display an abbreviated month's name as ``M=E4r=B4=B4. = =20 Typing on this =E4 C-c C-x =3D I get: character: =E4 (228, #o344, #xe4) preferred charset: iso-8859-1 (Latin-1 (ISO/IEC 8859-1)) code point: 0xE4 syntax: w which means: word category: j:Japanese l:Latin buffer code: #xC3 #xA4 file code: #xC3 #xA4 (encoded by coding system = utf-8-unix) display: by this font (glyph code) = -B&H-LucidaTypewriter-Medium-R-Normal-Sans-10-100-75-75-M-60-=20 ISO8859-1 (#xE4) =09 Character code properties are not shown: customize what to show =09 There are text properties here: auto-composed t fontified t instead of an error message. The error message is output when I type =20 on as tripped umlaut in a file name! Looking with GNU Emacs 23.0.50 into the file saved with its stripped =20 umlauts I see instead of them ``{=E4=F6=FC=C4=D6=DC}=B4. = This box is =20 described in GNU Emacs 23.0.50 as: character: (332488, #o1211310, #x512c8, U+0308) charset: mule-unicode-0100-24ff (Unicode characters of the range = U+0100..U+24FF.) code point: #x25 #x48 syntax: w which means: word category: ^:Combining diacritic or mark buffer code: #x9C #xF4 #xA5 #xC8 file code: #xCC #x88 (encoded by coding system = mule-utf-8-unix) display: by this font (glyph code) -b&h-lucida sans = typewriter-medium-r-normal--10-98-74-74-m-60-=20 iso10646-1 (#x308) Unicode data: Name: COMBINING DIAERESIS Category: non-spacing mark Combining class: Above Bidi category: Non-Spacing Mark Old name: NON-SPACING DIAERESIS ISO 10646 comment: Dialytika In GNU Emacs 23.0.60.1 (powerpc-apple-darwin8.11.0, GTK+ Version 2.6.10) of 2007-12-25 on Latsche.local Windowing system distributor `The XFree86 Project, Inc', version =20 11.0.40400000 configured using `configure '--enable-font-backend' '--with-x-=20 toolkit=3Dgtk' '--with-dbus' '--without-sound' '--without-pop' '--with-=20= xpm' '--with-jpeg' '--with-tiff' '--with-gif' '--with-png' '--enable-=20 locallisppath=3D/Library/Application Support/Emacs/calendar22:/Library/=20= Application Support/Emacs/caml:/Library/Application Support/Emacs:/sw/=20= share/emacs21/site-lisp/elib' 'PKG_CONFIG_PATH=3D/sw/lib/freetype219/=20 lib/pkgconfig:/sw/lib/fontconfig2/lib/pkgconfig:/sw/lib/pkgconfig:/sw/=20= lib/system-openssl/lib/pkgconfig:/sw/share/pkgconfig:/usr/lib/=20 pkgconfig:/usr/local/lib/pkgconfig:/usr/local/clamXav/lib/pkgconfig:/=20 usr/local/lib/pkgconfig' 'CPPFLAGS=3D-no-cpp-precomp -D__BIND_NOSTATIC -=20= I/usr/include/openssl -I/sw/include/pango-1.0 -I/sw/lib/fontconfig2/=20 include -I/sw/lib/freetype219/include -I/sw/lib/freetype219/include/=20 freetype2 -I/sw/include -I/usr/local/include -idirafter /usr/X11R6/=20 include' 'CXXFLAGS=3D-no-cpp-precomp -I/usr/include/openssl -I/sw/=20 include/pango-1.0 -I/sw/lib/fontconfig2/include -I/sw/lib/freetype219/=20= include -I/sw/lib/freetype219/include/freetype2 -I/sw/include -I/usr/=20 local/include' 'CFLAGS=3D-ggdb -g3 -gfull -H -bind_at_load -pipe -fPIC -=20= mcpu=3D7450 -mtune=3D7450 -O0' 'LDFLAGS=3D-dead_strip -multiply_defined =20= suppress -L/sw/lib/ncurses -L/sw/lib/fontconfig2/lib -L/sw/lib/=20 freetype219/lib -L/sw/lib -L/usr/local/lib -L/usr/X11R6/lib'' Important settings: value of $LC_ALL: nil value of $LC_COLLATE: nil value of $LC_CTYPE: de_DE.UTF-8 value of $LC_MESSAGES: nil value of $LC_MONETARY: nil value of $LC_NUMERIC: nil value of $LC_TIME: nil value of $LANG: de_DE.UTF-8 value of $XMODIFIERS: nil locale-coding-system: utf-8-unix default-enable-multibyte-characters: t Major mode: Fundamental Minor modes in effect: tooltip-mode: t mouse-wheel-mode: t menu-bar-mode: t file-name-shadow-mode: t global-font-lock-mode: t font-lock-mode: t blink-cursor-mode: t global-auto-composition-mode: t auto-composition-mode: t auto-compression-mode: t line-number-mode: t Recent input: C-x C-f D a t e i C-x -- Greetings Pete America believes in education: the average professor earns more money =20= in a year than a professional athlete earns in a whole week. =AD Evan Esar