From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Juri Linkov Newsgroups: gmane.emacs.bugs Subject: bug#13041: 24.2; diacritic-fold-search Date: Fri, 07 Dec 2012 02:58:17 +0200 Organization: JURTA Message-ID: <871uf2647i.fsf@mail.jurta.org> References: <20121130182205.C722F14B8D@panix1.panix.com> <87ip8fjzwn.fsf@gnu.org> NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-Trace: ger.gmane.org 1354843977 28791 80.91.229.3 (7 Dec 2012 01:32:57 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Fri, 7 Dec 2012 01:32:57 +0000 (UTC) Cc: perin@panix.com, 13041@debbugs.gnu.org, perin@acm.org To: Kenichi Handa Original-X-From: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Fri Dec 07 02:33:09 2012 Return-path: Envelope-to: geb-bug-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1TgmoL-0003xr-1J for geb-bug-gnu-emacs@m.gmane.org; Fri, 07 Dec 2012 02:33:05 +0100 Original-Received: from localhost ([::1]:40062 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Tgmo9-0008Jb-0Y for geb-bug-gnu-emacs@m.gmane.org; Thu, 06 Dec 2012 20:32:53 -0500 Original-Received: from eggs.gnu.org ([208.118.235.92]:34770) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Tgmo5-0008FO-GG for bug-gnu-emacs@gnu.org; Thu, 06 Dec 2012 20:32:50 -0500 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Tgmo2-0002Sq-PS for bug-gnu-emacs@gnu.org; Thu, 06 Dec 2012 20:32:49 -0500 Original-Received: from debbugs.gnu.org ([140.186.70.43]:47258) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Tgmo2-0002Sm-Mf for bug-gnu-emacs@gnu.org; Thu, 06 Dec 2012 20:32:46 -0500 Original-Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.72) (envelope-from ) id 1TgmoI-0007P4-9u for bug-gnu-emacs@gnu.org; Thu, 06 Dec 2012 20:33:02 -0500 X-Loop: help-debbugs@gnu.org Resent-From: Juri Linkov Original-Sender: debbugs-submit-bounces@debbugs.gnu.org Resent-CC: bug-gnu-emacs@gnu.org Resent-Date: Fri, 07 Dec 2012 01:33:02 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: followup 13041 X-GNU-PR-Package: emacs X-GNU-PR-Keywords: Original-Received: via spool by 13041-submit@debbugs.gnu.org id=B13041.135484396028422 (code B ref 13041); Fri, 07 Dec 2012 01:33:02 +0000 Original-Received: (at 13041) by debbugs.gnu.org; 7 Dec 2012 01:32:40 +0000 Original-Received: from localhost ([127.0.0.1]:57508 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.72) (envelope-from ) id 1Tgmnv-0007OM-Dm for submit@debbugs.gnu.org; Thu, 06 Dec 2012 20:32:40 -0500 Original-Received: from ps18281.dreamhost.com ([69.163.218.105]:39225 helo=ps18281.dreamhostps.com) by debbugs.gnu.org with esmtp (Exim 4.72) (envelope-from ) id 1Tgmnr-0007O2-3V for 13041@debbugs.gnu.org; Thu, 06 Dec 2012 20:32:35 -0500 Original-Received: from localhost (ps18281.dreamhostps.com [69.163.218.105]) by ps18281.dreamhostps.com (Postfix) with ESMTP id 9041646FA012; Thu, 6 Dec 2012 17:32:17 -0800 (PST) In-Reply-To: <87ip8fjzwn.fsf@gnu.org> (Kenichi Handa's message of "Thu, 06 Dec 2012 18:25:12 +0900") User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/24.3.50 (x86_64-pc-linux-gnu) X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.13 Precedence: list X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6.x X-Received-From: 140.186.70.43 X-BeenThere: bug-gnu-emacs@gnu.org List-Id: "Bug reports for GNU Emacs, the Swiss army knife of text editors" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Original-Sender: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.bugs:68075 Archived-At: > Emacs contains ucs-normailze package which provides various > normalization functions. For instance, > > (require 'ucs-normalize) > (ucs-normalize-NFKD-string "=C3=84ffin") =3D> "A=CC=88ffin" > > Isn't it usable? This is usable to sort and compare strings, but I don't see how ucs-normalize.el could help in the search. I suppose the searched buffer can't be normalized before starting a search. So the search function somehow should be able to skip combining characters in the buffer. But to do this, the translation table needs to contain additional information about certain characters to ignore. Also the translation table should be able to map a sequence of characters like "ss" to "=C3=9F".