From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: martin rudalics Newsgroups: gmane.emacs.bugs Subject: bug#13041: 24.2; diacritic-fold-search Date: Sun, 09 Dec 2012 18:52:17 +0100 Message-ID: <50C4CFD1.9000101@gmx.at> References: <20121130182205.C722F14B8D@panix1.panix.com> <87ip8fjzwn.fsf@gnu.org> <871uf2647i.fsf@mail.jurta.org> <50C1C6CC.9020103@gmx.at> <87ehj18l9p.fsf@mail.jurta.org> <50C322CC.1000806@gmx.at> <87ip8cz2zu.fsf@mail.jurta.org> NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: quoted-printable X-Trace: ger.gmane.org 1355075619 30593 80.91.229.3 (9 Dec 2012 17:53:39 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Sun, 9 Dec 2012 17:53:39 +0000 (UTC) Cc: 13041@debbugs.gnu.org, perin@panix.com, perin@acm.org To: Juri Linkov Original-X-From: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Sun Dec 09 18:53:51 2012 Return-path: Envelope-to: geb-bug-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1Thl4X-0006Y1-8n for geb-bug-gnu-emacs@m.gmane.org; Sun, 09 Dec 2012 18:53:49 +0100 Original-Received: from localhost ([::1]:54100 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Thl4K-0004wi-Ta for geb-bug-gnu-emacs@m.gmane.org; Sun, 09 Dec 2012 12:53:36 -0500 Original-Received: from eggs.gnu.org ([208.118.235.92]:57276) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Thl4H-0004wA-Hc for bug-gnu-emacs@gnu.org; Sun, 09 Dec 2012 12:53:34 -0500 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Thl4G-0003c5-EK for bug-gnu-emacs@gnu.org; Sun, 09 Dec 2012 12:53:33 -0500 Original-Received: from debbugs.gnu.org ([140.186.70.43]:52298) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Thl4G-0003c0-Ax for bug-gnu-emacs@gnu.org; Sun, 09 Dec 2012 12:53:32 -0500 Original-Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.72) (envelope-from ) id 1Thl4k-0007Yf-Ma for bug-gnu-emacs@gnu.org; Sun, 09 Dec 2012 12:54:02 -0500 X-Loop: help-debbugs@gnu.org Resent-From: martin rudalics Original-Sender: debbugs-submit-bounces@debbugs.gnu.org Resent-CC: bug-gnu-emacs@gnu.org Resent-Date: Sun, 09 Dec 2012 17:54:02 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: followup 13041 X-GNU-PR-Package: emacs X-GNU-PR-Keywords: Original-Received: via spool by 13041-submit@debbugs.gnu.org id=B13041.135507558428971 (code B ref 13041); Sun, 09 Dec 2012 17:54:02 +0000 Original-Received: (at 13041) by debbugs.gnu.org; 9 Dec 2012 17:53:04 +0000 Original-Received: from localhost ([127.0.0.1]:34313 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.72) (envelope-from ) id 1Thl3o-0007XE-7v for submit@debbugs.gnu.org; Sun, 09 Dec 2012 12:53:04 -0500 Original-Received: from mailout-de.gmx.net ([213.165.64.22]:54253) by debbugs.gnu.org with smtp (Exim 4.72) (envelope-from ) id 1Thl3k-0007Wm-MF for 13041@debbugs.gnu.org; Sun, 09 Dec 2012 12:53:01 -0500 Original-Received: (qmail invoked by alias); 09 Dec 2012 17:52:28 -0000 Original-Received: from 62-47-58-158.adsl.highway.telekom.at (EHLO [62.47.58.158]) [62.47.58.158] by mail.gmx.net (mp038) with SMTP; 09 Dec 2012 18:52:28 +0100 X-Authenticated: #14592706 X-Provags-ID: V01U2FsdGVkX188XZs0a648yTFv3wF+/lywp3vgSp7mCrwb+eWYtJ nhru0oNK/0xRlA In-Reply-To: <87ip8cz2zu.fsf@mail.jurta.org> X-Y-GMX-Trusted: 0 X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.13 Precedence: list X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6.x X-Received-From: 140.186.70.43 X-BeenThere: bug-gnu-emacs@gnu.org List-Id: "Bug reports for GNU Emacs, the Swiss army knife of text editors" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Original-Sender: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.bugs:68242 Archived-At: > OTOH, instead of using an approach of matching only a full match > like in Chromium, we could do like GEdit and OpenOffice that > match the whole ligature character in a partial match > (i.e. to match "=EF=AC=80" when the search string is just "f"). Strictly spoken, they should match the first "f" in "=EF=AC=80". When ma= tching "suf" against "su=EF=AC=80er", the `match-string' would be "suf", with `match-end' after "=EF=AC=80". That is, the match length would not incre= ase when adding an "f" to the search string now. But I don't know what `match-string' should return - "su=EF=AC=80" or "suff". > Though this has a problem of highlighting the whole character for > a partial match that looks wrong, but perhaps no one can do better. We needed a display string "ff" replacing "=EF=AC=80" during highlighting= and highlight only the first "f" in it. > Yes, this is what I meant too. It is surprising but > http://www.unicode.org/Public/UNIDATA/CaseFolding.txt > defines the equivalence of "=C3=9F" and "ss" (lower case "s") > instead of case-folding. The following line in CaseFolding.txt: > > 00DF; F; 0073 0073; # LATIN SMALL LETTER SHARP S > > maps 00DF (LATIN SMALL LETTER SHARP S) to two characters > 0073 0073 (LATIN SMALL LETTER S) keeping the lower case. > Maybe this is a bug in Unicode data? Maybe it's explained here http://www.unicode.org/faq/idn.html in the answer to Q: Why does IDNA2003 map final sigma (=CF=82) to sigma (=CF=83), map e= szett (=C3=9F) to "ss", and delete ZWJ/ZWNJ? One possible interpretation of this is that mapping "=C3=9F" to "SS" woul= d imply that downcasing "SS" should produce "=C3=9F" and this is unwanted. = But I still wonder whether we are supposed to apply mappings recursively. martin