From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Kenichi Handa Newsgroups: gmane.emacs.devel Subject: Re: ffap not UTF-8 ready Date: Tue, 03 Oct 2006 10:20:42 +0900 Message-ID: References: <87y7ryjuml.fsf@zip.com.au> NNTP-Posting-Host: main.gmane.org Mime-Version: 1.0 (generated by SEMI 1.14.3 - "Ushinoya") Content-Type: text/plain; charset=US-ASCII X-Trace: sea.gmane.org 1159838604 31314 80.91.229.2 (3 Oct 2006 01:23:24 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Tue, 3 Oct 2006 01:23:24 +0000 (UTC) Cc: rv@gnu.org, emacs-devel@gnu.org Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Tue Oct 03 03:23:22 2006 Return-path: Envelope-to: ged-emacs-devel@m.gmane.org Original-Received: from lists.gnu.org ([199.232.76.165]) by ciao.gmane.org with esmtp (Exim 4.43) id 1GUYzH-0001vo-JA for ged-emacs-devel@m.gmane.org; Tue, 03 Oct 2006 03:22:25 +0200 Original-Received: from localhost ([127.0.0.1] helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1GUYzH-0004R0-1v for ged-emacs-devel@m.gmane.org; Mon, 02 Oct 2006 21:22:23 -0400 Original-Received: from mailman by lists.gnu.org with tmda-scanned (Exim 4.43) id 1GUYz4-0004Qc-Gm for emacs-devel@gnu.org; Mon, 02 Oct 2006 21:22:10 -0400 Original-Received: from exim by lists.gnu.org with spam-scanned (Exim 4.43) id 1GUYz2-0004Oq-Ph for emacs-devel@gnu.org; Mon, 02 Oct 2006 21:22:10 -0400 Original-Received: from [199.232.76.173] (helo=monty-python.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1GUYz2-0004Of-KL for emacs-devel@gnu.org; Mon, 02 Oct 2006 21:22:08 -0400 Original-Received: from [150.29.246.133] (helo=mx1.aist.go.jp) by monty-python.gnu.org with esmtp (Exim 4.52) id 1GUZ57-0006xT-Al; Mon, 02 Oct 2006 21:28:25 -0400 Original-Received: from smtp3.aist.go.jp ([150.29.246.12]) by mx1.aist.go.jp with ESMTP id k931M3J1012906; Tue, 3 Oct 2006 10:22:03 +0900 (JST) env-from (handa@m17n.org) Original-Received: by smtp3.aist.go.jp with ESMTP id k931M268005890; Tue, 3 Oct 2006 10:22:02 +0900 (JST) env-from (handa@m17n.org) Original-Received: from handa by etlken with local (Exim 3.36 #1 (Debian)) id 1GUYxe-0001es-00; Tue, 03 Oct 2006 10:20:42 +0900 Original-To: Kevin Ryde In-reply-to: <87y7ryjuml.fsf@zip.com.au> (message from Kevin Ryde on Tue, 03 Oct 2006 07:05:54 +1000) User-Agent: SEMI/1.14.3 (Ushinoya) FLIM/1.14.2 (Yagi-Nishiguchi) APEL/10.2 Emacs/22.0.50 (i686-pc-linux-gnu) MULE/5.0 (SAKAKI) X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.devel:60385 Archived-At: In article <87y7ryjuml.fsf@zip.com.au>, Kevin Ryde writes: > Kenichi Handa writes: > > > > It seems that this is because the variable > > ffap-string-at-point-mode-alist doesn't contain a multibyte > > character in CHARS. > Perhaps "(thing-at-point 'filename)", in thing-at-point-file-name-chars, > has the same problem. !! The variable thing-at-point-file-name-chars is defined as "-~/[:alnum:]_.${}#%,:". I've forgotten about [:XXX:] notation. I've just read src/regex.c and found that [:alnum:] also works for multibyte characters (it matches with a multibyte character whose syntax is "word"), and [:multibyte:] is available too. So, the current definition of thing-at-point-file-name-chars works in most cases. But, considering that a non-word multibyte character can also be used in a file name, perhaps, defining that as "-~/[:alnum:][:multibyte:]_.${}#%,:" is better. And, I think ffap.el should also use that kind of pattern instead of something like this: "0-9A-Za-z". --- Kenichi Handa handa@m17n.org