From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: "Drew Adams" Newsgroups: gmane.emacs.bugs Subject: bug#13041: 24.2; diacritic-fold-search Date: Wed, 5 Dec 2012 10:00:14 -0800 Message-ID: References: <20121130182205.C722F14B8D@panix1.panix.com><87hao69b5r.fsf@mail.jurta.org><20665.8224.844876.619203@panix5.panix.com><87hao6zko4.fsf@mail.jurta.org><83fw3qtboc.fsf@gnu.org><87hao5jqu3.fsf@mail.jurta.org><50BB93C2.1050007@gmx.at><83y5hgs564.fsf@gnu.org><50BC7BF5.2020400@gmx.at><83hao3rskd.fsf@gnu.org><50BCE49D.6010001@gmx.at><837gozrp8f.fsf@gnu.org><50BE38F3.3030907@gmx.at><3E2D742BA0FC44B7A61665D85AAC3712@us.oracle.com><50BF1702.4020100@gmx.at><611DD154E83240D183A7B5B88691DC37@us.oracle.com> <8164D22E74F94504B41247F314787E10@us.oracle.com> NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit X-Trace: ger.gmane.org 1354730467 6526 80.91.229.3 (5 Dec 2012 18:01:07 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Wed, 5 Dec 2012 18:01:07 +0000 (UTC) Cc: perin@panix.com, 13041@debbugs.gnu.org, perin@acm.org To: "'martin rudalics'" Original-X-From: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Wed Dec 05 19:01:19 2012 Return-path: Envelope-to: geb-bug-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1TgJHZ-0001V0-VP for geb-bug-gnu-emacs@m.gmane.org; Wed, 05 Dec 2012 19:01:18 +0100 Original-Received: from localhost ([::1]:59057 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1TgJHO-0007Gi-4r for geb-bug-gnu-emacs@m.gmane.org; Wed, 05 Dec 2012 13:01:06 -0500 Original-Received: from eggs.gnu.org ([208.118.235.92]:37947) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1TgJHH-0007E7-Aw for bug-gnu-emacs@gnu.org; Wed, 05 Dec 2012 13:01:05 -0500 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1TgJHC-000115-Om for bug-gnu-emacs@gnu.org; Wed, 05 Dec 2012 13:00:59 -0500 Original-Received: from debbugs.gnu.org ([140.186.70.43]:44985) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1TgJHC-000111-LO for bug-gnu-emacs@gnu.org; Wed, 05 Dec 2012 13:00:54 -0500 Original-Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.72) (envelope-from ) id 1TgJHK-00055I-3c for bug-gnu-emacs@gnu.org; Wed, 05 Dec 2012 13:01:02 -0500 X-Loop: help-debbugs@gnu.org Resent-From: "Drew Adams" Original-Sender: debbugs-submit-bounces@debbugs.gnu.org Resent-CC: bug-gnu-emacs@gnu.org Resent-Date: Wed, 05 Dec 2012 18:01:02 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: followup 13041 X-GNU-PR-Package: emacs X-GNU-PR-Keywords: Original-Received: via spool by 13041-submit@debbugs.gnu.org id=B13041.135473043819513 (code B ref 13041); Wed, 05 Dec 2012 18:01:02 +0000 Original-Received: (at 13041) by debbugs.gnu.org; 5 Dec 2012 18:00:38 +0000 Original-Received: from localhost ([127.0.0.1]:55236 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.72) (envelope-from ) id 1TgJGt-00054c-VE for submit@debbugs.gnu.org; Wed, 05 Dec 2012 13:00:37 -0500 Original-Received: from userp1040.oracle.com ([156.151.31.81]:20993) by debbugs.gnu.org with esmtp (Exim 4.72) (envelope-from ) id 1TgJGq-00054T-FZ for 13041@debbugs.gnu.org; Wed, 05 Dec 2012 13:00:33 -0500 Original-Received: from ucsinet22.oracle.com (ucsinet22.oracle.com [156.151.31.94]) by userp1040.oracle.com (Sentrion-MTA-4.2.2/Sentrion-MTA-4.2.2) with ESMTP id qB5I0HC4022071 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=OK); Wed, 5 Dec 2012 18:00:17 GMT Original-Received: from acsmt356.oracle.com (acsmt356.oracle.com [141.146.40.156]) by ucsinet22.oracle.com (8.14.4+Sun/8.14.4) with ESMTP id qB5I0Grm007615 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NO); Wed, 5 Dec 2012 18:00:16 GMT Original-Received: from abhmt106.oracle.com (abhmt106.oracle.com [141.146.116.58]) by acsmt356.oracle.com (8.12.11.20060308/8.12.11) with ESMTP id qB5I0FH7009706; Wed, 5 Dec 2012 12:00:15 -0600 Original-Received: from dradamslap1 (/130.35.178.8) by default (Oracle Beehive Gateway v4.0) with ESMTP ; Wed, 05 Dec 2012 10:00:15 -0800 X-Mailer: Microsoft Office Outlook 11 In-Reply-To: <8164D22E74F94504B41247F314787E10@us.oracle.com> Thread-Index: Ac3SzNkIeOhfShQLQAypKA2c11bTAgAKwFpwAATZzQAAAWVdYA== X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2900.6157 X-Source-IP: ucsinet22.oracle.com [156.151.31.94] X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.13 Precedence: list X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6.x X-Received-From: 140.186.70.43 X-BeenThere: bug-gnu-emacs@gnu.org List-Id: "Bug reports for GNU Emacs, the Swiss army knife of text editors" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Original-Sender: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.bugs:67976 Archived-At: FWIW - Some more browsing on the topic tells me that what we are trying to come up with here is a predicate for the NFKD canonical ordering (as applied to a char sequence, not to a single char). IOW, a string-ordering predicate that uses the canonical ordering for a character's decomposed normal code point sequence. We are using compatibility normalization, not canonical normalization. So a search (or a string comparison test) for `f' will match the ligature `ffi' (whereas it would not match wrt canonical normalization). Someone please correct me if any of this is wrong.