From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: James Cloos Newsgroups: gmane.emacs.devel Subject: Re: [jidanni@jidanni.org: ffap not UTF-8 ready] Date: Tue, 26 Dec 2006 13:58:12 -0500 Message-ID: References: NNTP-Posting-Host: dough.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-Trace: sea.gmane.org 1167159551 28484 80.91.229.10 (26 Dec 2006 18:59:11 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Tue, 26 Dec 2006 18:59:11 +0000 (UTC) Cc: rms@gnu.org Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Tue Dec 26 19:59:08 2006 Return-path: Envelope-to: ged-emacs-devel@m.gmane.org Original-Received: from lists.gnu.org ([199.232.76.165]) by dough.gmane.org with esmtp (Exim 4.50) id 1GzHVz-0004Qx-MK for ged-emacs-devel@m.gmane.org; Tue, 26 Dec 2006 19:59:07 +0100 Original-Received: from localhost ([127.0.0.1] helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1GzHVz-0000VY-A1 for ged-emacs-devel@m.gmane.org; Tue, 26 Dec 2006 13:59:07 -0500 Original-Received: from mailman by lists.gnu.org with tmda-scanned (Exim 4.43) id 1GzHVn-0000U4-Ju for emacs-devel@gnu.org; Tue, 26 Dec 2006 13:58:55 -0500 Original-Received: from exim by lists.gnu.org with spam-scanned (Exim 4.43) id 1GzHVm-0000S5-O8 for emacs-devel@gnu.org; Tue, 26 Dec 2006 13:58:55 -0500 Original-Received: from [199.232.76.173] (helo=monty-python.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1GzHVm-0000Rq-GL for emacs-devel@gnu.org; Tue, 26 Dec 2006 13:58:54 -0500 Original-Received: from [64.240.156.239] (helo=ore.jhcloos.com) by monty-python.gnu.org with esmtps (TLS-1.0:DHE_RSA_3DES_EDE_CBC_SHA:24) (Exim 4.52) id 1GzHVk-0004Pr-Cj; Tue, 26 Dec 2006 13:58:52 -0500 Original-Received: from lugabout.jhcloos.org (ppp43.pm3-2.buf-ch.ny.localnet.com [69.48.10.107]) (using TLSv1 with cipher EDH-RSA-DES-CBC3-SHA (168/168 bits)) (Client CN "lugabout.jhcloos.org", Issuer "ca.jhcloos.com" (verified OK)) by ore.jhcloos.com (Postfix) with ESMTP id B15DF1C57D; Tue, 26 Dec 2006 12:58:47 -0600 (CST) Original-Received: by lugabout.jhcloos.org (Postfix, from userid 500) id 255CF2B83DF; Tue, 26 Dec 2006 18:58:36 +0000 (GMT) Original-To: emacs-devel@gnu.org In-Reply-To: (Richard Stallman's message of "Tue\, 26 Dec 2006 12\:22\:29 -0500") Copyright: Copyright 2006 James Cloos X-Hashcash: 1:23:061226:emacs-devel@gnu.org::hWJNztv7LPUCnzvL:000000000000000000000000000000000000000000Ag9s X-Hashcash: 1:23:061226:rms@gnu.org::LXne89InFpIs/WWV:000000Fmrg Original-Lines: 26 User-Agent: Gnus/5.110006 (No Gnus v0.6) Emacs/23.0.0 (gnu/linux) X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.devel:64301 Archived-At: |> ffap is not UTF-8 ready, I put the cursor on ./???? or whatever and it |> acts like the file Doesn't exist. I tried it on several filenames (all of which exist on my filesystem). On every file where the first character was ASCII and was not modified by a combining character, ffap worked as expected. But on every file where the second character was a combining character (such as the file named C=CC=B6.utf8 -- that is a C followed by U+0336, w= hich is called COMBINING LONG STROKE OVERLAY) ffap failed to recognize the string as being a filename. It also failed when the string started with a non-ASCII character, such as a kanji or a greek character. Some testing shows that (ffap-string-at-point) skips strings such as those described above. I guess this is because of the default value of ffap-string-at-point-mode-alist. For finding files it looks for strings of "--:$+<>@-Z_a-z~*?", dropping "<@" from the beginning and dropping "@>;.,!:" from the end. That first string needs to be expanded to support non-ASCII characters which might be used for filenames. -JimC --=20 James Cloos OpenPGP: 1024D/ED7DAEA6