From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Juri Linkov Newsgroups: gmane.emacs.bugs Subject: bug#13084: boyer_moore crashes with certain characters in the case table Date: Wed, 05 Dec 2012 02:34:39 +0200 Organization: JURTA Message-ID: <87txs1l4kg.fsf@mail.jurta.org> NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: text/plain X-Trace: ger.gmane.org 1354667822 15786 80.91.229.3 (5 Dec 2012 00:37:02 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Wed, 5 Dec 2012 00:37:02 +0000 (UTC) To: 13084@debbugs.gnu.org Original-X-From: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Wed Dec 05 01:37:15 2012 Return-path: Envelope-to: geb-bug-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1Tg2zD-0006b5-7K for geb-bug-gnu-emacs@m.gmane.org; Wed, 05 Dec 2012 01:37:15 +0100 Original-Received: from localhost ([::1]:35259 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Tg2z1-0004M8-Ap for geb-bug-gnu-emacs@m.gmane.org; Tue, 04 Dec 2012 19:37:03 -0500 Original-Received: from eggs.gnu.org ([208.118.235.92]:54504) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Tg2yx-0004Ln-Ow for bug-gnu-emacs@gnu.org; Tue, 04 Dec 2012 19:37:01 -0500 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Tg2yw-0005ma-P4 for bug-gnu-emacs@gnu.org; Tue, 04 Dec 2012 19:36:59 -0500 Original-Received: from debbugs.gnu.org ([140.186.70.43]:43600) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Tg2yw-0005mW-M5 for bug-gnu-emacs@gnu.org; Tue, 04 Dec 2012 19:36:58 -0500 Original-Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.72) (envelope-from ) id 1Tg2z0-000138-HU for bug-gnu-emacs@gnu.org; Tue, 04 Dec 2012 19:37:02 -0500 X-Loop: help-debbugs@gnu.org Resent-From: Juri Linkov Original-Sender: debbugs-submit-bounces@debbugs.gnu.org Resent-CC: bug-gnu-emacs@gnu.org Resent-Date: Wed, 05 Dec 2012 00:37:02 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: report 13084 X-GNU-PR-Package: emacs X-GNU-PR-Keywords: X-Debbugs-Original-To: bug-gnu-emacs@gnu.org Original-Received: via spool by submit@debbugs.gnu.org id=B.13546678074008 (code B ref -1); Wed, 05 Dec 2012 00:37:02 +0000 Original-Received: (at submit) by debbugs.gnu.org; 5 Dec 2012 00:36:47 +0000 Original-Received: from localhost ([127.0.0.1]:53851 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.72) (envelope-from ) id 1Tg2yl-00012a-4c for submit@debbugs.gnu.org; Tue, 04 Dec 2012 19:36:47 -0500 Original-Received: from eggs.gnu.org ([208.118.235.92]:43098) by debbugs.gnu.org with esmtp (Exim 4.72) (envelope-from ) id 1Tg2yi-00012T-Oq for submit@debbugs.gnu.org; Tue, 04 Dec 2012 19:36:45 -0500 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Tg2yd-0005jc-FY for submit@debbugs.gnu.org; Tue, 04 Dec 2012 19:36:40 -0500 Original-Received: from lists.gnu.org ([208.118.235.17]:44832) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Tg2yd-0005jY-CX for submit@debbugs.gnu.org; Tue, 04 Dec 2012 19:36:39 -0500 Original-Received: from eggs.gnu.org ([208.118.235.92]:54460) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Tg2yb-0004LU-Tw for bug-gnu-emacs@gnu.org; Tue, 04 Dec 2012 19:36:39 -0500 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Tg2ya-0005jG-UF for bug-gnu-emacs@gnu.org; Tue, 04 Dec 2012 19:36:37 -0500 Original-Received: from ps18281.dreamhost.com ([69.163.218.105]:35584 helo=ps18281.dreamhostps.com) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Tg2ya-0005jB-P0 for bug-gnu-emacs@gnu.org; Tue, 04 Dec 2012 19:36:36 -0500 Original-Received: from localhost (ps18281.dreamhostps.com [69.163.218.105]) by ps18281.dreamhostps.com (Postfix) with ESMTP id 6E2CB46FA014 for ; Tue, 4 Dec 2012 16:36:34 -0800 (PST) User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/24.3.50 (x86_64-pc-linux-gnu) X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x (no timestamps) [generic] X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6.x X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.13 Precedence: list X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6.x X-Received-From: 140.186.70.43 X-BeenThere: bug-gnu-emacs@gnu.org List-Id: "Bug reports for GNU Emacs, the Swiss army knife of text editors" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Original-Sender: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.bugs:67936 Archived-At: The minimal reproducible recipe for crashes in boyer_moore noticed in bug#13041: 1. emacs -Q 2. Eval in *scratch*: (let ((table (standard-case-table)) canon) (setq canon (copy-sequence table)) (aset canon #xff59 ?y) (set-char-table-extra-slot table 1 canon) (set-char-table-extra-slot table 2 nil) (set-standard-case-table table)) 3. Start an activity that includes a search, e.g. `C-x 8 RET TAB' The crash in boyer_moore is caused by fullwidth characters like #xff59 whose Unicode properties are: name: FULLWIDTH LATIN SMALL LETTER Y decomposition: (wide 121) (wide 'y') However, the crash doesn't occur when the same fullwidth characters are set to their downcase counterparts in lisp/international/characters.el: ;; Fullwidth Latin (setq c #xff21) (while (<= c #xff3a) (set-case-syntax-pair c (+ c #x20) tbl) (modify-category-entry c ?l) (modify-category-entry (+ c #x20) ?l) (setq c (1+ c)))