From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Eli Zaretskii Newsgroups: gmane.emacs.devel Subject: Re: extending case-fold-search to remove nonspacing marks (diacritics etc.) Date: Fri, 06 Feb 2015 12:04:00 +0200 Message-ID: <838ugbe4q7.fsf@gnu.org> References: <87fvakvwbf.fsf@lifelogs.com> <87386jx2m2.fsf@mail.linkov.net> <83fvajeb03.fsf@gnu.org> <83bnl7e5ri.fsf@gnu.org> Reply-To: Eli Zaretskii NNTP-Posting-Host: plane.gmane.org X-Trace: ger.gmane.org 1423217062 31733 80.91.229.3 (6 Feb 2015 10:04:22 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Fri, 6 Feb 2015 10:04:22 +0000 (UTC) Cc: juri@linkov.net, bruce.connor.am@gmail.com, emacs-devel@gnu.org To: Eli Zaretskii Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Fri Feb 06 11:04:22 2015 Return-path: Envelope-to: ged-emacs-devel@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1YJflt-0001PO-Ev for ged-emacs-devel@m.gmane.org; Fri, 06 Feb 2015 11:04:21 +0100 Original-Received: from localhost ([::1]:47437 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1YJfls-0002Qv-Lj for ged-emacs-devel@m.gmane.org; Fri, 06 Feb 2015 05:04:20 -0500 Original-Received: from eggs.gnu.org ([2001:4830:134:3::10]:37485) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1YJflo-0002Qp-5z for emacs-devel@gnu.org; Fri, 06 Feb 2015 05:04:17 -0500 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1YJflm-0006y4-TF for emacs-devel@gnu.org; Fri, 06 Feb 2015 05:04:15 -0500 Original-Received: from mtaout20.012.net.il ([80.179.55.166]:45093) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1YJfli-0006x9-AA; Fri, 06 Feb 2015 05:04:10 -0500 Original-Received: from conversion-daemon.a-mtaout20.012.net.il by a-mtaout20.012.net.il (HyperSendmail v2007.08) id <0NJC00D00H2E0100@a-mtaout20.012.net.il>; Fri, 06 Feb 2015 12:04:02 +0200 (IST) Original-Received: from HOME-C4E4A596F7 ([87.69.4.28]) by a-mtaout20.012.net.il (HyperSendmail v2007.08) with ESMTPA id <0NJC00CDGHAQT870@a-mtaout20.012.net.il>; Fri, 06 Feb 2015 12:04:02 +0200 (IST) In-reply-to: <83bnl7e5ri.fsf@gnu.org> X-012-Sender: halo1@inter.net.il X-detected-operating-system: by eggs.gnu.org: Solaris 10 X-Received-From: 80.179.55.166 X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.14 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.devel:182515 Archived-At: > Date: Fri, 06 Feb 2015 11:41:37 +0200 > From: Eli Zaretskii > Cc: emacs-devel@gnu.org, juri@linkov.net > > > Yes. By "understanding" I was referring to the c code necessary, not the > > database. I've actually got quite familiar with the database. :-P > > OK, then please don't hesitate to post questions and ask for help, > including for writing some code, if needed. This doesn't have to be a > single-person effort (unless you want it to be ;-). Certainly advice > and answers to questions are abundantly available here, for any code > that is in the core, and for issues related to Unicode. Btw, an alternative idea which might be worth exploring is to use string-collate-equalp for comparison during search, or wcscoll it calls on the C level. That will use the character databases of the underlying libraries, instead of using the Emacs's own databases (and so will be less prone to customization), but that might be good enough in this case.