From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: James Cloos Newsgroups: gmane.emacs.help Subject: Re: UTF-8 in path / filename Date: Mon, 28 Aug 2006 11:11:14 -0400 Message-ID: References: <7D07BEAB-2279-48C5-BB9A-3FF3A15D0FED@Web.DE> <20060826000627.b8b44e95.gregory.schmitt@free.fr> <87odu8ct0a.fsf@catnip.gol.com> <0C15C504-B711-403E-B8D1-F03234C453E3@Web.DE> <25A143BA-4E99-4FF9-B6C0-A8F42146D0C9@Web.DE> NNTP-Posting-Host: main.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-Trace: sea.gmane.org 1156777943 12094 80.91.229.2 (28 Aug 2006 15:12:23 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Mon, 28 Aug 2006 15:12:23 +0000 (UTC) Cc: help-gnu-emacs@gnu.org, Miles Bader Original-X-From: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Mon Aug 28 17:12:18 2006 Return-path: Envelope-to: geh-help-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([199.232.76.165]) by ciao.gmane.org with esmtp (Exim 4.43) id 1GHimf-0001P5-LW for geh-help-gnu-emacs@m.gmane.org; Mon, 28 Aug 2006 17:12:17 +0200 Original-Received: from localhost ([127.0.0.1] helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1GHime-0004xP-Qt for geh-help-gnu-emacs@m.gmane.org; Mon, 28 Aug 2006 11:12:16 -0400 Original-Received: from mailman by lists.gnu.org with tmda-scanned (Exim 4.43) id 1GHimP-0004wu-TN for help-gnu-emacs@gnu.org; Mon, 28 Aug 2006 11:12:01 -0400 Original-Received: from exim by lists.gnu.org with spam-scanned (Exim 4.43) id 1GHimO-0004wR-6X for help-gnu-emacs@gnu.org; Mon, 28 Aug 2006 11:12:01 -0400 Original-Received: from [199.232.76.173] (helo=monty-python.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1GHimO-0004wO-2c for help-gnu-emacs@gnu.org; Mon, 28 Aug 2006 11:12:00 -0400 Original-Received: from [64.240.156.239] (helo=ore.jhcloos.com) by monty-python.gnu.org with esmtps (TLS-1.0:DHE_RSA_3DES_EDE_CBC_SHA:24) (Exim 4.52) id 1GHivK-00023r-K4; Mon, 28 Aug 2006 11:21:15 -0400 Original-Received: from lugabout.jhcloos.org (ppp38.pm3-5.buf-ch.ny.localnet.com [69.48.11.38]) (using TLSv1 with cipher EDH-RSA-DES-CBC3-SHA (168/168 bits)) (Client CN "lugabout.jhcloos.org", Issuer "ca.jhcloos.com" (verified OK)) by ore.jhcloos.com (Postfix) with ESMTP id D7C161C57E; Mon, 28 Aug 2006 10:08:17 -0500 (CDT) Original-Received: by lugabout.jhcloos.org (Postfix, from userid 500) id 3039135923A; Mon, 28 Aug 2006 15:11:37 +0000 (GMT) Original-To: Peter Dyballa In-Reply-To: <25A143BA-4E99-4FF9-B6C0-A8F42146D0C9@Web.DE> (Peter Dyballa's message of "Sun, 27 Aug 2006 15:12:15 +0200") Copyright: Copyright 2006 James Cloos X-Hashcash: 1:21:060828:peter_dyballa@web.de::rhAw2/epGKd399g0:000000000000000000000000000000000000000007Lki X-Hashcash: 1:21:060828:miles@gnu.org::en6e3M4ktL1nkr03:0000AHET X-Hashcash: 1:21:060828:help-gnu-emacs@gnu.org::WzCQDAxVfnEreQPp:000000000000000000000000000000000000000CiX3 Original-Lines: 35 User-Agent: Gnus/5.110006 (No Gnus v0.6) Emacs/23.0.0 (gnu/linux) X-BeenThere: help-gnu-emacs@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Users list for the GNU Emacs text editor List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Errors-To: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.help:36991 Archived-At: JimC> Doesn't apple by default use NFD (Normalizaion Form Decomposed) JimC> for filenames? That would explain the sequences. Peter> Yes, that's the correct term for the way file names are Peter> recorded in HFS+. So then the problem is narrowed to support for composition. I just gave it a test, running the unicode-2 branch on a linux box, using the en_US-UTF8 locale. I copied the filename you quoted (a=CC=88o=CC=88u=CC=88=C3=A6=C3=86U=CC=88= O=CC=88A=CC=88.txt), gave it a prefix to ease globbing (resulting in /tmp/xxx-a=CC=88o=CC=88u=CC=88=C3=A6=C3=86U=CC= =88O=CC=88A=CC=88.txt), and ran find-file on /tmp. It worked correctly. (Well, almost; the glyphs composed by emacs have twice the height of pre-composed glyphs. There was a time when emacs didn't do that, but it is doing it again. Including in this buffer. But that looks to be specific to --enable-font-backend and DejaVu Sans Mono. With other fonts I do not get visible accents, even though C-u C-x =3D claims it is composing. And without --e-f-b I get composed glyphs which have correct vertical metrics.) I also tested this: :; echo /tmp/xxx-a* and got the filename, showing that bash treats the code points as separate characters when globbing. (Which also means I didn't actually need the xxx- prefix, since a* will therefore match the original filename....) So. Does C-u C-x =3D claim to be composing for you? -JimC --=20 James Cloos OpenPGP: 0xED7DAEA6