From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Artur Malabarba Newsgroups: gmane.emacs.devel Subject: Re: extending case-fold-search to remove nonspacing marks (diacritics etc.) Date: Fri, 6 Feb 2015 08:03:15 -0200 Message-ID: References: <87fvakvwbf.fsf@lifelogs.com> <87386jx2m2.fsf@mail.linkov.net> <83fvajeb03.fsf@gnu.org> <83bnl7e5ri.fsf@gnu.org> Reply-To: bruce.connor.am@gmail.com NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=UTF-8 X-Trace: ger.gmane.org 1423217006 30841 80.91.229.3 (6 Feb 2015 10:03:26 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Fri, 6 Feb 2015 10:03:26 +0000 (UTC) Cc: emacs-devel , Juri Linkov To: Eli Zaretskii Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Fri Feb 06 11:03:24 2015 Return-path: Envelope-to: ged-emacs-devel@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1YJfky-0000uh-0g for ged-emacs-devel@m.gmane.org; Fri, 06 Feb 2015 11:03:24 +0100 Original-Received: from localhost ([::1]:47433 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1YJfkx-0001dC-F0 for ged-emacs-devel@m.gmane.org; Fri, 06 Feb 2015 05:03:23 -0500 Original-Received: from eggs.gnu.org ([2001:4830:134:3::10]:37122) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1YJfks-0001aQ-2w for emacs-devel@gnu.org; Fri, 06 Feb 2015 05:03:19 -0500 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1YJfkr-0006gm-Aa for emacs-devel@gnu.org; Fri, 06 Feb 2015 05:03:18 -0500 Original-Received: from mail-ob0-x232.google.com ([2607:f8b0:4003:c01::232]:54232) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1YJfkq-0006gK-6k; Fri, 06 Feb 2015 05:03:16 -0500 Original-Received: by mail-ob0-f178.google.com with SMTP id uz6so12281638obc.9; Fri, 06 Feb 2015 02:03:15 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:reply-to:sender:in-reply-to:references:date:message-id :subject:from:to:cc:content-type; bh=Q9Y6vr7BMFfNz4TJhVATVdYBKrN+tYBS3ZmXZ9Pf5ok=; b=vnbneIVDou8hMxSizN41QBJCgZYfh/gu1eoVFAp6DYU5WOUPHiGMyO/3wMSzcXb4wC O2PZ2W0KpQ0mFAWMUSgyFzCqhwcsuCdSPJKb7p+U/wZZqJUjJgjy3I/veiQwWE3fVE1d UnRmyP0g3qR1SE7MZIDDA/MNPBk50LC64J8LDDm6WfKMdtiBDtYdETLRK4G5iFT3jIA7 KZPxC75KinmQcl5NuQARGSVk1RjCaVNqffhqHNlWFEPs/tK1g4Ek9nFVf+mqpM09aqeF 7ReeyCeSgi6oqdRA+8ABFFSQX7xIxXBb5tYoSTGnzbCtzS0QNcDYdUpMUP2eP3o7x5B2 8ACg== X-Received: by 10.182.230.165 with SMTP id sz5mr1912175obc.29.1423216995700; Fri, 06 Feb 2015 02:03:15 -0800 (PST) Original-Received: by 10.76.125.1 with HTTP; Fri, 6 Feb 2015 02:03:15 -0800 (PST) In-Reply-To: <83bnl7e5ri.fsf@gnu.org> X-Google-Sender-Auth: k6nazAVkIOUpcsQA_mBVyVa5Kyc X-detected-operating-system: by eggs.gnu.org: Error: Malformed IPv6 address (bad octet value). X-Received-From: 2607:f8b0:4003:c01::232 X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.14 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.devel:182514 Archived-At: > OK, then please don't hesitate to post questions and ask for help, > including for writing some code, if needed. This doesn't have to be a > single-person effort (unless you want it to be ;-). Certainly advice > and answers to questions are abundantly available here, for any code > that is in the core, and for issues related to Unicode. Yes, I've been meaning to. I have 3 solutions which I can (as in "I have enough motivation to") implement. One of them can handle multi-char symbols, but it's only for isearch would need to be specifically extended to other searches (it's essentially an improvement of the previous regexp proposal). The other two options use char tables, so they are faster and immediately apply to any searching but only handle single-char symbols. I'll write up an email with diffs and help requests next time I have a minute.