From mboxrd@z Thu Jan 1 00:00:00 1970 Path: main.gmane.org!not-for-mail From: Richard Stallman Newsgroups: gmane.emacs.devel Subject: Re: Strange behaviour with dired and UTF8 Date: Fri, 09 May 2003 07:19:38 -0400 Sender: emacs-devel-bounces+emacs-devel=quimby.gnus.org@gnu.org Message-ID: References: <200305071609.h47G9Wfh015163@rum.cs.yale.edu> Reply-To: rms@gnu.org NNTP-Posting-Host: main.gmane.org X-Trace: main.gmane.org 1052480358 20446 80.91.224.249 (9 May 2003 11:39:18 GMT) X-Complaints-To: usenet@main.gmane.org NNTP-Posting-Date: Fri, 9 May 2003 11:39:18 +0000 (UTC) Cc: emacs-devel@gnu.org Original-X-From: emacs-devel-bounces+emacs-devel=quimby.gnus.org@gnu.org Fri May 09 13:39:16 2003 Return-path: Original-Received: from quimby.gnus.org ([80.91.224.244]) by main.gmane.org with esmtp (Exim 3.35 #1 (Debian)) id 19E6Dc-0005Jb-00 for ; Fri, 09 May 2003 13:39:16 +0200 Original-Received: from monty-python.gnu.org ([199.232.76.173]) by quimby.gnus.org with esmtp (Exim 3.12 #1 (Debian)) id 19E6IX-0005OT-00 for ; Fri, 09 May 2003 13:44:21 +0200 Original-Received: from localhost ([127.0.0.1] helo=monty-python.gnu.org) by monty-python.gnu.org with esmtp (Exim 4.10.13) id 19E68K-0001Hc-05 for emacs-devel@quimby.gnus.org; Fri, 09 May 2003 07:33:48 -0400 Original-Received: from list by monty-python.gnu.org with tmda-scanned (Exim 4.10.13) id 19E616-0007ZI-00 for emacs-devel@gnu.org; Fri, 09 May 2003 07:26:20 -0400 Original-Received: from mail by monty-python.gnu.org with spam-scanned (Exim 4.10.13) id 19E5zz-0007Hw-00 for emacs-devel@gnu.org; Fri, 09 May 2003 07:25:12 -0400 Original-Received: from fencepost.gnu.org ([199.232.76.164]) by monty-python.gnu.org with esmtp (Exim 4.10.13) id 19E5ud-0005hk-00 for emacs-devel@gnu.org; Fri, 09 May 2003 07:19:39 -0400 Original-Received: from rms by fencepost.gnu.org with local (Exim 4.10) id 19E5uc-000160-00; Fri, 09 May 2003 07:19:38 -0400 Original-To: "Stefan Monnier" In-reply-to: <200305071609.h47G9Wfh015163@rum.cs.yale.edu> (monnier+gnu/emacs@rum.cs.yale.edu) Original-cc: handa@m17n.org Original-cc: jan.h.d@swipnet.se X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1b5 Precedence: list List-Id: Emacs development discussions. List-Help: List-Post: List-Subscribe: , List-Archive: List-Unsubscribe: , Errors-To: emacs-devel-bounces+emacs-devel=quimby.gnus.org@gnu.org Xref: main.gmane.org gmane.emacs.devel:13781 X-Report-Spam: http://spam.gmane.org/gmane.emacs.devel:13781 I don't exactly understand the Handa's proposal, so could someone explain to me how it handles a situation such as // where is encoded in latin-1 and in utf-8 ? If you literally mean that the absolute file name in the file system consists of a Latin-1 part and a UTF-8 part, my first reaction would have been "give up". But it occurs to me that if Emacs decodes the components one by one, it might be able to handle this case correctly without too much work. Re-encoding such names is more difficult. I think the only possible method is to record the proper coding system in text properties in the string. We would have to make expand-file-name preserve these properties when it makes sense; likewise other functions that operate on file names. It adds up to a fair amount of work--not impossible, but perhaps not worth the trouble. I mean, of course Emacs should do better than the rest of the crowd, but if most/all other applications fail miserably, then it's unlikely that people will use such setups and it would be wrong for Emacs to make it easier to create such a setup I agree with that point.