From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Eli Zaretskii Newsgroups: gmane.emacs.bugs Subject: bug#13041: 24.2; diacritic-fold-search Date: Fri, 07 Dec 2012 08:33:04 +0200 Message-ID: <83pq2mnzhb.fsf@gnu.org> References: <20121130182205.C722F14B8D@panix1.panix.com> <87ip8fjzwn.fsf@gnu.org> <871uf2647i.fsf@mail.jurta.org> Reply-To: Eli Zaretskii NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: QUOTED-PRINTABLE X-Trace: ger.gmane.org 1354862032 21480 80.91.229.3 (7 Dec 2012 06:33:52 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Fri, 7 Dec 2012 06:33:52 +0000 (UTC) Cc: 13041@debbugs.gnu.org, perin@panix.com, perin@acm.org To: Juri Linkov Original-X-From: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Fri Dec 07 07:34:04 2012 Return-path: Envelope-to: geb-bug-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1TgrVY-0002Js-RU for geb-bug-gnu-emacs@m.gmane.org; Fri, 07 Dec 2012 07:34:00 +0100 Original-Received: from localhost ([::1]:48840 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1TgrVM-0005nN-Ol for geb-bug-gnu-emacs@m.gmane.org; Fri, 07 Dec 2012 01:33:48 -0500 Original-Received: from eggs.gnu.org ([208.118.235.92]:45666) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1TgrVK-0005nI-Ie for bug-gnu-emacs@gnu.org; Fri, 07 Dec 2012 01:33:47 -0500 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1TgrVJ-0008QD-Et for bug-gnu-emacs@gnu.org; Fri, 07 Dec 2012 01:33:46 -0500 Original-Received: from debbugs.gnu.org ([140.186.70.43]:47448) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1TgrVJ-0008Pt-CK for bug-gnu-emacs@gnu.org; Fri, 07 Dec 2012 01:33:45 -0500 Original-Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.72) (envelope-from ) id 1TgrVZ-0006jy-Nn for bug-gnu-emacs@gnu.org; Fri, 07 Dec 2012 01:34:01 -0500 X-Loop: help-debbugs@gnu.org Resent-From: Eli Zaretskii Original-Sender: debbugs-submit-bounces@debbugs.gnu.org Resent-CC: bug-gnu-emacs@gnu.org Resent-Date: Fri, 07 Dec 2012 06:34:01 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: followup 13041 X-GNU-PR-Package: emacs X-GNU-PR-Keywords: Original-Received: via spool by 13041-submit@debbugs.gnu.org id=B13041.135486202125883 (code B ref 13041); Fri, 07 Dec 2012 06:34:01 +0000 Original-Received: (at 13041) by debbugs.gnu.org; 7 Dec 2012 06:33:41 +0000 Original-Received: from localhost ([127.0.0.1]:57699 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.72) (envelope-from ) id 1TgrVF-0006jQ-79 for submit@debbugs.gnu.org; Fri, 07 Dec 2012 01:33:41 -0500 Original-Received: from mtaout22.012.net.il ([80.179.55.172]:59911) by debbugs.gnu.org with esmtp (Exim 4.72) (envelope-from ) id 1TgrVB-0006jG-84 for 13041@debbugs.gnu.org; Fri, 07 Dec 2012 01:33:39 -0500 Original-Received: from conversion-daemon.a-mtaout22.012.net.il by a-mtaout22.012.net.il (HyperSendmail v2007.08) id <0MEN00600E63FQ00@a-mtaout22.012.net.il> for 13041@debbugs.gnu.org; Fri, 07 Dec 2012 08:33:19 +0200 (IST) Original-Received: from HOME-C4E4A596F7 ([87.69.4.28]) by a-mtaout22.012.net.il (HyperSendmail v2007.08) with ESMTPA id <0MEN006TJE7I8Y30@a-mtaout22.012.net.il>; Fri, 07 Dec 2012 08:33:19 +0200 (IST) In-reply-to: <871uf2647i.fsf@mail.jurta.org> X-012-Sender: halo1@inter.net.il X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.13 Precedence: list X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6.x X-Received-From: 140.186.70.43 X-BeenThere: bug-gnu-emacs@gnu.org List-Id: "Bug reports for GNU Emacs, the Swiss army knife of text editors" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Original-Sender: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.bugs:68090 Archived-At: > From: Juri Linkov > Date: Fri, 07 Dec 2012 02:58:17 +0200 > Cc: perin@panix.com, 13041@debbugs.gnu.org, perin@acm.org >=20 > > Emacs contains ucs-normailze package which provides various > > normalization functions. For instance, > > > > (require 'ucs-normalize) > > (ucs-normalize-NFKD-string "=C3=84ffin") =3D> "A=CC=88ffin" > > > > Isn't it usable? >=20 > This is usable to sort and compare strings, but I don't see > how ucs-normalize.el could help in the search. I agree. > I suppose the searched buffer can't be normalized before starting a > search. Yes, that's not acceptable. > So the search function somehow should be able to skip combining > characters in the buffer. But to do this, the translation table ne= eds > to contain additional information about certain characters to ignor= e. Right. This is very similar to how the search primitives currently use the case tables, except that they don't skip characters. But adding such a skip operation should be easy. > Also the translation table should be able to map a sequence of > characters like "ss" to "=C3=9F". I'd say the other way around: map =C3=9F to ss.