From mboxrd@z Thu Jan 1 00:00:00 1970 Path: main.gmane.org!not-for-mail From: Karl Eichwalder Newsgroups: gmane.emacs.devel Subject: Re: ISO-8859-1 encoded file names and UTF-8 Date: Sat, 08 Mar 2003 19:25:25 +0100 Sender: emacs-devel-bounces+emacs-devel=quimby.gnus.org@gnu.org Message-ID: References: <7704-Sat08Mar2003111630+0200-eliz@elta.co.il> <8582-Sat08Mar2003190654+0200-eliz@elta.co.il> NNTP-Posting-Host: main.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Trace: main.gmane.org 1047150444 5859 80.91.224.249 (8 Mar 2003 19:07:24 GMT) X-Complaints-To: usenet@main.gmane.org NNTP-Posting-Date: Sat, 8 Mar 2003 19:07:24 +0000 (UTC) Cc: emacs-devel@gnu.org Original-X-From: emacs-devel-bounces+emacs-devel=quimby.gnus.org@gnu.org Sat Mar 08 20:07:22 2003 Return-path: Original-Received: from quimby.gnus.org ([80.91.224.244]) by main.gmane.org with esmtp (Exim 3.35 #1 (Debian)) id 18rjfG-0001WL-00 for ; Sat, 08 Mar 2003 20:07:22 +0100 Original-Received: from monty-python.gnu.org ([199.232.76.173]) by quimby.gnus.org with esmtp (Exim 3.12 #1 (Debian)) id 18rk0g-0007bF-00 for ; Sat, 08 Mar 2003 20:29:30 +0100 Original-Received: from localhost ([127.0.0.1] helo=monty-python.gnu.org) by monty-python.gnu.org with esmtp (Exim 4.10.13) id 18rjfT-0006oM-02 for emacs-devel@quimby.gnus.org; Sat, 08 Mar 2003 14:07:35 -0500 Original-Received: from list by monty-python.gnu.org with tmda-scanned (Exim 4.10.13) id 18rjf2-0006CI-00 for emacs-devel@gnu.org; Sat, 08 Mar 2003 14:07:08 -0500 Original-Received: from mail by monty-python.gnu.org with spam-scanned (Exim 4.10.13) id 18rjen-0005L4-00 for emacs-devel@gnu.org; Sat, 08 Mar 2003 14:06:57 -0500 Original-Received: from elvis.franken.de ([193.175.24.41]) by monty-python.gnu.org with esmtp (Exim 4.10.13) id 18rjei-00051r-00 for emacs-devel@gnu.org; Sat, 08 Mar 2003 14:06:48 -0500 Original-Received: from uucp by elvis.franken.de with local-rmail (Exim 3.36 #1) id 18rjeh-0004om-00; Sat, 08 Mar 2003 20:06:47 +0100 Original-Received: by tux.gnu.franken.de (Postfix, from userid 270) id DF92CA43E4; Sat, 8 Mar 2003 19:25:25 +0100 (CET) Original-To: Eli Zaretskii In-Reply-To: <8582-Sat08Mar2003190654+0200-eliz@elta.co.il> ("Eli Zaretskii"'s message of "Sat, 08 Mar 2003 19:06:54 +0200") User-Agent: Gnus/5.090016 (Oort Gnus v0.16) Emacs/21.3.50 (i686-pc-linux-gnu) X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1b5 Precedence: list List-Id: Emacs development discussions. List-Help: List-Post: List-Subscribe: , List-Archive: List-Unsubscribe: , Errors-To: emacs-devel-bounces+emacs-devel=quimby.gnus.org@gnu.org Xref: main.gmane.org gmane.emacs.devel:12180 X-Report-Spam: http://spam.gmane.org/gmane.emacs.devel:12180 "Eli Zaretskii" writes: > And what is the value of `default-file-name-coding-system'? If it's > anything but `utf-8', please try setting `file-name-coding-system' to > `utf-8' and see if that helps. I did ask for help; I reported a problem worth fixing ;) It is a reather serious problem, IMO. > This would contradict the whole purpose of a multilingual Emacs: it is > meant to seamlessly display characters from different character sets > without any special effects. This might be appropriate when you use Emacs for reading mail and news. But it's a wrong behavior when it come to "source code" > How can Emacs know that in this particular case, you want it to > display different character sets differently? When a XML file or a PO file (that's a message file used by gettext) is declared as UTF-8 encoded and Emacs detects an error within such a file, it's surely worth notifying the user. > I believe that if such a feature is added, it must be driven by > user-level settings. For example, users could define a list of > character sets or codepoints which they don't expect to see in their > buffers, and Emacs will then flag characters from those sets with some > visual cue. Yes, approximately. But instead of a list of character sets it must depend on a list of file types (.xml, .po, .java, etc.). > It's even possible that you can do that yourself right now by using > hi-lock.el or something similar, since IIRC regular expressions can be > used to express character categories. Thanks for advice :) I know how to work around the problem; for checking .po files one can use 'msgfmt' etc. I'd rather vote to change Emacs to make users happy -- that's probably not that urgent, but it should happen for the next major release coming from CVS HEAD. -- ke@suse.de (work) / keichwa@gmx.net (home): | http://www.gnu.franken.de/ke/ | ,__o Free Translation Project: | _-\_<, http://www.iro.umontreal.ca/contrib/po/HTML/ | (*)/'(*)