From mboxrd@z Thu Jan  1 00:00:00 1970
Path: main.gmane.org!not-for-mail
From: Richard Stallman <rms@gnu.org>
Newsgroups: gmane.emacs.devel
Subject: Re: Strange behaviour with dired and UTF8
Date: Fri, 09 May 2003 07:19:38 -0400
Sender: emacs-devel-bounces+emacs-devel=quimby.gnus.org@gnu.org
Message-ID: <E19E5uc-000160-00@fencepost.gnu.org>
References: <BE7150A4-80A3-11D7-B1A5-00039363E640@swipnet.se>
	<200305071609.h47G9Wfh015163@rum.cs.yale.edu>
Reply-To: rms@gnu.org
NNTP-Posting-Host: main.gmane.org
X-Trace: main.gmane.org 1052480358 20446 80.91.224.249 (9 May 2003 11:39:18 GMT)
X-Complaints-To: usenet@main.gmane.org
NNTP-Posting-Date: Fri, 9 May 2003 11:39:18 +0000 (UTC)
Cc: emacs-devel@gnu.org
Original-X-From: emacs-devel-bounces+emacs-devel=quimby.gnus.org@gnu.org Fri May 09 13:39:16 2003
Return-path: <emacs-devel-bounces+emacs-devel=quimby.gnus.org@gnu.org>
Original-Received: from quimby.gnus.org ([80.91.224.244])
	by main.gmane.org with esmtp (Exim 3.35 #1 (Debian))
	id 19E6Dc-0005Jb-00
	for <emacs-devel@main.gmane.org>; Fri, 09 May 2003 13:39:16 +0200
Original-Received: from monty-python.gnu.org ([199.232.76.173])
	by quimby.gnus.org with esmtp (Exim 3.12 #1 (Debian))
	id 19E6IX-0005OT-00
	for <emacs-devel@quimby.gnus.org>; Fri, 09 May 2003 13:44:21 +0200
Original-Received: from localhost ([127.0.0.1] helo=monty-python.gnu.org)
	by monty-python.gnu.org with esmtp (Exim 4.10.13)
	id 19E68K-0001Hc-05
	for emacs-devel@quimby.gnus.org; Fri, 09 May 2003 07:33:48 -0400
Original-Received: from list by monty-python.gnu.org with tmda-scanned (Exim 4.10.13)
	id 19E616-0007ZI-00
	for emacs-devel@gnu.org; Fri, 09 May 2003 07:26:20 -0400
Original-Received: from mail by monty-python.gnu.org with spam-scanned (Exim 4.10.13)
	id 19E5zz-0007Hw-00
	for emacs-devel@gnu.org; Fri, 09 May 2003 07:25:12 -0400
Original-Received: from fencepost.gnu.org ([199.232.76.164])
	by monty-python.gnu.org with esmtp (Exim 4.10.13)
	id 19E5ud-0005hk-00
	for emacs-devel@gnu.org; Fri, 09 May 2003 07:19:39 -0400
Original-Received: from rms by fencepost.gnu.org with local (Exim 4.10)
	id 19E5uc-000160-00; Fri, 09 May 2003 07:19:38 -0400
Original-To: "Stefan Monnier" <monnier+gnu/emacs@rum.cs.yale.edu>
In-reply-to: <200305071609.h47G9Wfh015163@rum.cs.yale.edu>
	(monnier+gnu/emacs@rum.cs.yale.edu)
Original-cc: handa@m17n.org
Original-cc: jan.h.d@swipnet.se
X-BeenThere: emacs-devel@gnu.org
X-Mailman-Version: 2.1b5
Precedence: list
List-Id: Emacs development discussions. <emacs-devel.gnu.org>
List-Help: <mailto:emacs-devel-request@gnu.org?subject=help>
List-Post: <mailto:emacs-devel@gnu.org>
List-Subscribe: <http://mail.gnu.org/mailman/listinfo/emacs-devel>,
	<mailto:emacs-devel-request@gnu.org?subject=subscribe>
List-Archive: <http://mail.gnu.org/pipermail/emacs-devel>
List-Unsubscribe: <http://mail.gnu.org/mailman/listinfo/emacs-devel>,
	<mailto:emacs-devel-request@gnu.org?subject=unsubscribe>
Errors-To: emacs-devel-bounces+emacs-devel=quimby.gnus.org@gnu.org
Xref: main.gmane.org gmane.emacs.devel:13781
X-Report-Spam: http://spam.gmane.org/gmane.emacs.devel:13781

    I don't exactly understand the Handa's proposal, so could someone
    explain to me how it handles a situation such as /<foo>/<bar>
    where <foo> is encoded in latin-1 and <bar> in utf-8 ?

If you literally mean that the absolute file name in the file system
consists of a Latin-1 part and a UTF-8 part, my first reaction would
have been "give up".  But it occurs to me that if Emacs decodes the
components one by one, it might be able to handle this case correctly
without too much work.

Re-encoding such names is more difficult.  I think the only possible method
is to record the proper coding system in text properties in the string.
We would have to make expand-file-name preserve these properties when
it makes sense; likewise other functions that operate on file names.

It adds up to a fair amount of work--not impossible, but perhaps
not worth the trouble.

    I mean, of course Emacs should do better than the rest of the crowd,
    but if most/all other applications fail miserably, then it's unlikely
    that people will use such setups and it would be wrong for Emacs to
    make it easier to create such a setup

I agree with that point.