From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Juri Linkov Newsgroups: gmane.emacs.bugs Subject: bug#13084: boyer_moore crashes with certain characters in the case table Date: Wed, 12 Dec 2012 11:27:50 +0200 Organization: JURTA Message-ID: <871uevsknt.fsf@mail.jurta.org> References: <87txs1l4kg.fsf@mail.jurta.org> <831uewa9cq.fsf@gnu.org> <87txrsw55b.fsf@mail.jurta.org> <83lid39b6v.fsf@gnu.org> NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Transfer-Encoding: quoted-printable X-Trace: ger.gmane.org 1355304931 27993 80.91.229.3 (12 Dec 2012 09:35:31 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Wed, 12 Dec 2012 09:35:31 +0000 (UTC) Cc: 13084@debbugs.gnu.org To: Eli Zaretskii Original-X-From: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Wed Dec 12 10:35:44 2012 Return-path: Envelope-to: geb-bug-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1Tiij9-0000mv-AH for geb-bug-gnu-emacs@m.gmane.org; Wed, 12 Dec 2012 10:35:43 +0100 Original-Received: from localhost ([::1]:43839 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Tiiiw-0001nP-Hn for geb-bug-gnu-emacs@m.gmane.org; Wed, 12 Dec 2012 04:35:30 -0500 Original-Received: from eggs.gnu.org ([208.118.235.92]:44685) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Tiiio-0001mf-Si for bug-gnu-emacs@gnu.org; Wed, 12 Dec 2012 04:35:29 -0500 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Tiiii-00075Q-TE for bug-gnu-emacs@gnu.org; Wed, 12 Dec 2012 04:35:22 -0500 Original-Received: from debbugs.gnu.org ([140.186.70.43]:56138) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Tiiii-00075K-QA for bug-gnu-emacs@gnu.org; Wed, 12 Dec 2012 04:35:16 -0500 Original-Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.72) (envelope-from ) id 1TiijS-0003Vj-7t for bug-gnu-emacs@gnu.org; Wed, 12 Dec 2012 04:36:02 -0500 X-Loop: help-debbugs@gnu.org Resent-From: Juri Linkov Original-Sender: debbugs-submit-bounces@debbugs.gnu.org Resent-CC: bug-gnu-emacs@gnu.org Resent-Date: Wed, 12 Dec 2012 09:36:02 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: followup 13084 X-GNU-PR-Package: emacs X-GNU-PR-Keywords: Original-Received: via spool by 13084-submit@debbugs.gnu.org id=B13084.135530492413433 (code B ref 13084); Wed, 12 Dec 2012 09:36:02 +0000 Original-Received: (at 13084) by debbugs.gnu.org; 12 Dec 2012 09:35:24 +0000 Original-Received: from localhost ([127.0.0.1]:38154 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.72) (envelope-from ) id 1Tiiip-0003Ub-V0 for submit@debbugs.gnu.org; Wed, 12 Dec 2012 04:35:24 -0500 Original-Received: from ps18281.dreamhost.com ([69.163.218.105]:35209 helo=ps18281.dreamhostps.com) by debbugs.gnu.org with esmtp (Exim 4.72) (envelope-from ) id 1Tiiin-0003UO-Ps for 13084@debbugs.gnu.org; Wed, 12 Dec 2012 04:35:22 -0500 Original-Received: from localhost (ps18281.dreamhostps.com [69.163.218.105]) by ps18281.dreamhostps.com (Postfix) with ESMTP id B7419451C15E; Wed, 12 Dec 2012 01:34:33 -0800 (PST) In-Reply-To: <83lid39b6v.fsf@gnu.org> (Eli Zaretskii's message of "Wed, 12 Dec 2012 05:55:04 +0200") User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/24.3.50 (x86_64-pc-linux-gnu) X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.13 Precedence: list X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6.x X-Received-From: 140.186.70.43 X-BeenThere: bug-gnu-emacs@gnu.org List-Id: "Bug reports for GNU Emacs, the Swiss army knife of text editors" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Original-Sender: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.bugs:68376 Archived-At: >> Does this mean there are no more obstacles to filling a translation ta= ble >> for ignoring equivalence with all character mappings according to the >> `decomposition' property? This would be the first step in this direct= ion. > > I'm not sure I understand what you are asking. Please show more detail= s. There is confusion with the word `equivalence'. Currently there exists the case equivalence table in the case table (`case_eqv_table'). Implementing a diacritic search in bug#13041 requires adding a new similar table. I don't know what would be a good name: `decomposition_eqv_table' or `normalization_eqv_table' or something bette= r. I'm unfamiliar with the details of `search_buffer', but in principle using two tables in the macro `TRANSLATE' could implement a diacritic search where at the first step the character will be translated using `decomposition_eqv_table', and after that the resulting character will be translated using `case_eqv_table'. So the dataflow to get the canonical character will be =C1 -> A -> a. If `case-fold-search' is nil, then =C1 -> A. If a new variable `decomposition-search' (or `normalized-search') is nil then =C1 -> =E1.