From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: martin rudalics Newsgroups: gmane.emacs.bugs Subject: bug#13041: 24.2; diacritic-fold-search Date: Thu, 06 Dec 2012 11:34:26 +0100 Message-ID: <50C074B2.60808@gmx.at> References: <87ip8fjzwn.fsf@gnu.org> NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: quoted-printable X-Trace: ger.gmane.org 1354790108 30156 80.91.229.3 (6 Dec 2012 10:35:08 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Thu, 6 Dec 2012 10:35:08 +0000 (UTC) Cc: perin@panix.com, perin@acm.org, 13041@debbugs.gnu.org To: Kenichi Handa Original-X-From: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Thu Dec 06 11:35:20 2012 Return-path: Envelope-to: geb-bug-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1TgYnY-0005b1-0v for geb-bug-gnu-emacs@m.gmane.org; Thu, 06 Dec 2012 11:35:20 +0100 Original-Received: from localhost ([::1]:44834 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1TgYnL-0002Im-Ml for geb-bug-gnu-emacs@m.gmane.org; Thu, 06 Dec 2012 05:35:07 -0500 Original-Received: from eggs.gnu.org ([208.118.235.92]:35887) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1TgYnD-0002FY-2F for bug-gnu-emacs@gnu.org; Thu, 06 Dec 2012 05:35:04 -0500 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1TgYn3-0005K7-J6 for bug-gnu-emacs@gnu.org; Thu, 06 Dec 2012 05:34:59 -0500 Original-Received: from debbugs.gnu.org ([140.186.70.43]:45662) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1TgYn3-0005K3-G2 for bug-gnu-emacs@gnu.org; Thu, 06 Dec 2012 05:34:49 -0500 Original-Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.72) (envelope-from ) id 1TgYnF-00023v-H5 for bug-gnu-emacs@gnu.org; Thu, 06 Dec 2012 05:35:01 -0500 X-Loop: help-debbugs@gnu.org Resent-From: martin rudalics Original-Sender: debbugs-submit-bounces@debbugs.gnu.org Resent-CC: bug-gnu-emacs@gnu.org Resent-Date: Thu, 06 Dec 2012 10:35:01 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: followup 13041 X-GNU-PR-Package: emacs X-GNU-PR-Keywords: Original-Received: via spool by 13041-submit@debbugs.gnu.org id=B13041.13547900867906 (code B ref 13041); Thu, 06 Dec 2012 10:35:01 +0000 Original-Received: (at 13041) by debbugs.gnu.org; 6 Dec 2012 10:34:46 +0000 Original-Received: from localhost ([127.0.0.1]:55913 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.72) (envelope-from ) id 1TgYn0-00023T-57 for submit@debbugs.gnu.org; Thu, 06 Dec 2012 05:34:46 -0500 Original-Received: from mailout-de.gmx.net ([213.165.64.22]:35115) by debbugs.gnu.org with smtp (Exim 4.72) (envelope-from ) id 1TgYmx-00023H-IQ for 13041@debbugs.gnu.org; Thu, 06 Dec 2012 05:34:44 -0500 Original-Received: (qmail invoked by alias); 06 Dec 2012 10:34:30 -0000 Original-Received: from 62-47-51-163.adsl.highway.telekom.at (EHLO [62.47.51.163]) [62.47.51.163] by mail.gmx.net (mp028) with SMTP; 06 Dec 2012 11:34:30 +0100 X-Authenticated: #14592706 X-Provags-ID: V01U2FsdGVkX1+qtVAEjVSqEXUZnP3oofy2aKZyW3uH8kMAGucwOF WCXXnwTtJ8B0CU In-Reply-To: <87ip8fjzwn.fsf@gnu.org> X-Y-GMX-Trusted: 0 X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.13 Precedence: list X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6.x X-Received-From: 140.186.70.43 X-BeenThere: bug-gnu-emacs@gnu.org List-Id: "Bug reports for GNU Emacs, the Swiss army knife of text editors" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Original-Sender: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.bugs:68023 Archived-At: > Emacs contains ucs-normailze package which provides various > normalization functions. For instance, > > (require 'ucs-normalize) > (ucs-normalize-NFKD-string "=C3=84ffin") =3D> "A=CC=88ffin" > > Isn't it usable? Actually, the function should do what we need. But I have no idea how to integrate it into a searching algorithm. And when sorting, it seems expensive for comparing buffer substrings. Also, the use of a temporary buffer for normalizing every single string makes its weight quite heavy. In any case, I would probably steal the entire decomposition property handling part from it. So thanks a lot for this hint. martin