From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Peter Dyballa Newsgroups: gmane.emacs.help Subject: Re: Dired doesn't decode UTF-8 file names (was: Dired confused by filenames starting with date-like strings) Date: Thu, 24 Aug 2006 21:48:51 +0200 Message-ID: <10737F7A-CC2F-4281-B611-CB3BBE791A07@Web.DE> References: <1156356934.469929.210320@i3g2000cwc.googlegroups.com> <87sljnf670.fsf@catnip.gol.com> <87mz9uyzfj.fsf@catnip.gol.com> <55FE760B-5409-4876-9E4D-0B15A0F5AEAF@Web.DE> NNTP-Posting-Host: main.gmane.org Mime-Version: 1.0 (Apple Message framework v752.2) Content-Type: text/plain; charset=WINDOWS-1252; delsp=yes; format=flowed Content-Transfer-Encoding: quoted-printable X-Trace: sea.gmane.org 1156449007 14480 80.91.229.2 (24 Aug 2006 19:50:07 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Thu, 24 Aug 2006 19:50:07 +0000 (UTC) Cc: help-gnu-emacs@gnu.org Original-X-From: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Thu Aug 24 21:49:57 2006 Return-path: Envelope-to: geh-help-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([199.232.76.165]) by ciao.gmane.org with esmtp (Exim 4.43) id 1GGLCz-0007ST-Px for geh-help-gnu-emacs@m.gmane.org; Thu, 24 Aug 2006 21:49:46 +0200 Original-Received: from localhost ([127.0.0.1] helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1GGLCy-0002mA-QA for geh-help-gnu-emacs@m.gmane.org; Thu, 24 Aug 2006 15:49:44 -0400 Original-Received: from mailman by lists.gnu.org with tmda-scanned (Exim 4.43) id 1GGLCD-0001m6-Q6 for help-gnu-emacs@gnu.org; Thu, 24 Aug 2006 15:48:57 -0400 Original-Received: from exim by lists.gnu.org with spam-scanned (Exim 4.43) id 1GGLCB-0001hZ-Qc for help-gnu-emacs@gnu.org; Thu, 24 Aug 2006 15:48:57 -0400 Original-Received: from [199.232.76.173] (helo=monty-python.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1GGLCB-0001gx-J9 for help-gnu-emacs@gnu.org; Thu, 24 Aug 2006 15:48:55 -0400 Original-Received: from [217.72.192.227] (helo=fmmailgate02.web.de) by monty-python.gnu.org with esmtp (Exim 4.52) id 1GGLKI-0000u1-V1 for help-gnu-emacs@gnu.org; Thu, 24 Aug 2006 15:57:19 -0400 Original-Received: from smtp07.web.de (fmsmtp07.dlan.cinetic.de [172.20.5.215]) by fmmailgate02.web.de (Postfix) with ESMTP id 22595170C3C2; Thu, 24 Aug 2006 21:48:54 +0200 (CEST) Original-Received: from [84.245.174.38] (helo=[192.168.1.2]) by smtp07.web.de with asmtp (TLSv1:RC4-SHA:128) (WEB.DE 4.107 #114) id 1GGLC9-0003Qv-00; Thu, 24 Aug 2006 21:48:54 +0200 In-Reply-To: X-Image-Url: http://homepage.mac.com/sparifankal/.cv/thumbs/me.thumbnail Original-To: Kevin Rodgers X-Mailer: Apple Mail (2.752.2) X-Sender: Peter_Dyballa@web.de X-BeenThere: help-gnu-emacs@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Users list for the GNU Emacs text editor List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Errors-To: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.help:36915 Archived-At: Am 24.08.2006 um 18:12 schrieb Kevin Rodgers: > Peter Dyballa wrote: >> Am 24.08.2006 um 16:27 schrieb Kevin Rodgers: >>> There is no reason to set dired-use-ls-dired to anything but t or =20= >>> nil. >>> In particular, it does not determine which ls program is run. Try: >>> >>> (setq insert-directory-program "/sw/bin/ls" >>> dired-use-ls-dired t) >> Am 24.08.2006 um 16:35 schrieb Miles Bader: >>> So I guess the following should work: >>> >>> (setq dired-use-ls-dired t) >>> (setq insert-directory-program "/sw/bin/gls") >> Oh, yes, these work! Thank you! There isn't much difference =20 >> compared to using Apple's ls; in GNU Emacs 22.0.50 UTF-8 =20 >> characters are still displayed as a series of UTF-8 octets, and in =20= >> GNU Emacs 23.0.0 I still cannot search for file names with German =20 >> umlauts etc. > > That is the first mention of UTF-8 in this thread. Mile suggested =20 > using > GNU ls --dired to solve the OP's problem with date-like filenames. I remember Miles Bader mentioned some time before that GNU ls could =20 solve UTF-8 problems. Then it did not work, for me, so I thought, =20 hoped, that now, with 'dired extra switches' it could work better. I =20 have problems with non-7 bit US-ASCII characters in the file name and =20= in the abbreviated month name (M=E4r, for March) =96 at least this one =20= *is* solved (3 M=C3=A4r 13:39 =3D> 2006-03-03 13:39). > > Is file-name-coding-system (or default-file-name-coding-system) set > to utf-8? No, it is iso-latin-1! Although my language environment in shell is =20 set to UTF-8, as it is for GNU Emacs 23. File-name-coding-system is nil, so default-file-name-coding-system =20 should be used, according to the documentation. But =96 it's still iso-=20= latin-1! Ah, here is a difference: it *came* from '(current-language-=20= environment "UTF-8") in the customisation section! (I warned about =20 its use, but had it myself still set! Arghh!!!) > Should it be? Yes, it should! The abbreviated month's name is now correctly =20 displayed! GNU Emacs 22.0.50 still has problems: German umlauts in =20 file names are displayed as [AOUaou]=A8 and because =A8 is not found in =20= the fontset I see after each vowel an empty box. The same is true for =20= other accented characters in file names. GNU ls and Apple ls show no =20 difference. Launching GNU Emacs 22.0.50 in Terminal with no windows the de-=20 composed UTF-8 characters are composed correctly! > > Is there an entry in process-coding-system-alist for ls? Should there > be? Not directly ... (("\\*shell\\*\\'" utf-8 . utf-8) ("\\*.* output\\*\\'" iso-8859-15-unix . iso-8859-15-unix)) The second line is meant for AUCTeX output buffers (although not =20 really working, probably I have to find some hook). Is there really =20 more needed? -- Greetings Pete "Let's face it; we don't want a free market economy either." James Farley, president, Coca-Cola Export Corp., 1959