From mboxrd@z Thu Jan 1 00:00:00 1970 Path: main.gmane.org!not-for-mail From: Kenichi Handa Newsgroups: gmane.emacs.devel Subject: Re: etags and UTF-8 encoded file names (Re: ISO-8859-1 encoded file names and UTF-8) Date: Wed, 2 Apr 2003 10:34:42 +0900 (JST) Sender: emacs-devel-bounces+emacs-devel=quimby.gnus.org@gnu.org Message-ID: <200304020134.KAA04167@etlken.m17n.org> References: NNTP-Posting-Host: main.gmane.org Mime-Version: 1.0 (generated by SEMI 1.14.3 - "Ushinoya") Content-Type: text/plain; charset=US-ASCII X-Trace: main.gmane.org 1049247423 30724 80.91.224.249 (2 Apr 2003 01:37:03 GMT) X-Complaints-To: usenet@main.gmane.org NNTP-Posting-Date: Wed, 2 Apr 2003 01:37:03 +0000 (UTC) Cc: emacs-devel@gnu.org Original-X-From: emacs-devel-bounces+emacs-devel=quimby.gnus.org@gnu.org Wed Apr 02 03:37:02 2003 Return-path: Original-Received: from quimby.gnus.org ([80.91.224.244]) by main.gmane.org with esmtp (Exim 3.35 #1 (Debian)) id 190XBW-0007zQ-00 for ; Wed, 02 Apr 2003 03:37:02 +0200 Original-Received: from monty-python.gnu.org ([199.232.76.173]) by quimby.gnus.org with esmtp (Exim 3.12 #1 (Debian)) id 190XCN-0000aR-00 for ; Wed, 02 Apr 2003 03:37:55 +0200 Original-Received: from localhost ([127.0.0.1] helo=monty-python.gnu.org) by monty-python.gnu.org with esmtp (Exim 4.10.13) id 190XB6-0006zB-01 for emacs-devel@quimby.gnus.org; Tue, 01 Apr 2003 20:36:36 -0500 Original-Received: from list by monty-python.gnu.org with tmda-scanned (Exim 4.10.13) id 190XAD-0005ts-00 for emacs-devel@gnu.org; Tue, 01 Apr 2003 20:35:41 -0500 Original-Received: from mail by monty-python.gnu.org with spam-scanned (Exim 4.10.13) id 190X9c-00050T-00 for emacs-devel@gnu.org; Tue, 01 Apr 2003 20:35:06 -0500 Original-Received: from tsukuba.m17n.org ([192.47.44.130]) by monty-python.gnu.org with esmtp (Exim 4.10.13) id 190X9T-0004ld-00 for emacs-devel@gnu.org; Tue, 01 Apr 2003 20:34:55 -0500 Original-Received: from fs.m17n.org (fs.m17n.org [192.47.44.2])h321Yh926561; Wed, 2 Apr 2003 10:34:43 +0900 (JST) (envelope-from handa@m17n.org) Original-Received: from etlken.m17n.org (etlken.m17n.org [192.47.44.125]) h321YhA12215; Wed, 2 Apr 2003 10:34:43 +0900 (JST) Original-Received: (from handa@localhost) by etlken.m17n.org (8.8.8+Sun/3.7W-2001040620) id KAA04167; Wed, 2 Apr 2003 10:34:42 +0900 (JST) Original-To: keichwa@gmx.net In-reply-to: (message from Karl Eichwalder on Tue, 01 Apr 2003 23:17:07 +0200) User-Agent: SEMI/1.14.3 (Ushinoya) FLIM/1.14.2 (Yagi-Nishiguchi) APEL/10.2 Emacs/21.2.92 (sparc-sun-solaris2.6) MULE/5.0 (SAKAKI) X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1b5 Precedence: list List-Id: Emacs development discussions. List-Help: List-Post: List-Subscribe: , List-Archive: List-Unsubscribe: , Errors-To: emacs-devel-bounces+emacs-devel=quimby.gnus.org@gnu.org Xref: main.gmane.org gmane.emacs.devel:12824 X-Report-Spam: http://spam.gmane.org/gmane.emacs.devel:12824 In article , Karl Eichwalder writes: > Now the next one: `tags-query-replace' does not work properly when file > names are UTF-8 encoded. First run `etags *' on the files and then > call `tags-query-replace'. This is the same type of bug (but more difficult) as what I posted to emacs-devel by the subjest "bad interaction with C-x RET c and vc-cvs-registered". A tag file contains file names plus parts of source code. The former must be decoded by file-name-coding-system, but the latter must be decoded by the coding system of each file. It's very hard to decided a coding system for the latter without actually reading the file. Perhaps, a tag file must be read as raw-text (thus in a unibyte buffer), and if one gives a non-ASCII TAGNAME to `find-tag', it must be encoded by the buffer-file-coding-system of the current buffer. --- Ken'ichi HANDA handa@m17n.org