From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: "Drew Adams" Newsgroups: gmane.emacs.help Subject: RE: diacritic-fold-search? Date: Thu, 29 Nov 2012 09:39:25 -0800 Message-ID: <6112F2D7CC7B4F01ABEACDCBD2DA50ED@us.oracle.com> References: NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit X-Trace: ger.gmane.org 1354210787 1122 80.91.229.3 (29 Nov 2012 17:39:47 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Thu, 29 Nov 2012 17:39:47 +0000 (UTC) To: "'Lewis Perin'" , Original-X-From: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Thu Nov 29 18:39:59 2012 Return-path: Envelope-to: geh-help-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1Te85d-0005aL-Kx for geh-help-gnu-emacs@m.gmane.org; Thu, 29 Nov 2012 18:39:57 +0100 Original-Received: from localhost ([::1]:37156 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Te85S-0002YX-CF for geh-help-gnu-emacs@m.gmane.org; Thu, 29 Nov 2012 12:39:46 -0500 Original-Received: from eggs.gnu.org ([208.118.235.92]:43727) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Te85H-0002YR-Qy for help-gnu-emacs@gnu.org; Thu, 29 Nov 2012 12:39:41 -0500 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Te85C-0005hx-0R for help-gnu-emacs@gnu.org; Thu, 29 Nov 2012 12:39:35 -0500 Original-Received: from userp1040.oracle.com ([156.151.31.81]:23838) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Te85B-0005hp-PY for help-gnu-emacs@gnu.org; Thu, 29 Nov 2012 12:39:29 -0500 Original-Received: from acsinet21.oracle.com (acsinet21.oracle.com [141.146.126.237]) by userp1040.oracle.com (Sentrion-MTA-4.2.2/Sentrion-MTA-4.2.2) with ESMTP id qATHdRN6002651 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=OK); Thu, 29 Nov 2012 17:39:28 GMT Original-Received: from acsmt356.oracle.com (acsmt356.oracle.com [141.146.40.156]) by acsinet21.oracle.com (8.14.4+Sun/8.14.4) with ESMTP id qATHdQcL029731 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NO); Thu, 29 Nov 2012 17:39:27 GMT Original-Received: from abhmt119.oracle.com (abhmt119.oracle.com [141.146.116.71]) by acsmt356.oracle.com (8.12.11.20060308/8.12.11) with ESMTP id qATHdQRE000791; Thu, 29 Nov 2012 11:39:26 -0600 Original-Received: from dradamslap1 (/130.35.178.8) by default (Oracle Beehive Gateway v4.0) with ESMTP ; Thu, 29 Nov 2012 09:39:26 -0800 X-Mailer: Microsoft Office Outlook 11 X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2900.6157 In-Reply-To: Thread-Index: Ac3OVn8w3qjIrmmNT7WLlA6GoqYz1QAAEUzA X-Source-IP: acsinet21.oracle.com [141.146.126.237] X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.4.x-2.6.x [generic] X-Received-From: 156.151.31.81 X-BeenThere: help-gnu-emacs@gnu.org X-Mailman-Version: 2.1.14 Precedence: list List-Id: Users list for the GNU Emacs text editor List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Original-Sender: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.help:87942 Archived-At: > Is there a way to search ignoring diacritics, e.g. capturing "apres" > both with and without an accent grave over the "e"? Great question. I don't think so, but I'm guessing that lots of users could make good use of such a feature! Unless someone points out here that this is already possible, why don't you submit an enhancement request for this feature (`M-x report-emacs-bug' is also for enhancement requests): be able to toggle Isearch distinguishing certain sets of similar chars (diacritics). There could be predefined sets of equivalence classes of chars (e.g., the same letter, modulo diacritical marks). And users could be able to customize these classes. Likewise, for punctuation chars that are very similar (in purpose/visually), such as straight quotes and curly quotes, and no-break hyphen, hyphen, and the various dashes. Likewise, for whitespace chars other than the standard SPC, TAB, etc. For whitespace, I believe there might be some handling of additional chars such as no-break space, but what's needed, here too, is a simple way to toggle distinguishing them on/off. But your use case is the best one: be able to optionally ignore diacritical marks when searching.