From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: don@donarmstrong.com (Emacs bug Tracking System) Newsgroups: gmane.emacs.bugs Subject: bug#540: marked as done (23.0.60; Unicode search bug) Date: Wed, 27 Aug 2008 07:40:04 -0700 Message-ID: References: <87wsi2a5mn.fsf@cyd.mit.edu> <87ej66q2os.fsf@jurta.org> NNTP-Posting-Host: lo.gmane.org Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="----------=_1219848004-27756-0" X-Trace: ger.gmane.org 1219848541 20736 80.91.229.12 (27 Aug 2008 14:49:01 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Wed, 27 Aug 2008 14:49:01 +0000 (UTC) To: Chong Yidong Original-X-From: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Wed Aug 27 16:49:54 2008 Return-path: Envelope-to: geb-bug-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([199.232.76.165]) by lo.gmane.org with esmtp (Exim 4.50) id 1KYMLE-0003eg-5j for geb-bug-gnu-emacs@m.gmane.org; Wed, 27 Aug 2008 16:49:48 +0200 Original-Received: from localhost ([127.0.0.1]:51483 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1KYMKF-0001VC-Uz for geb-bug-gnu-emacs@m.gmane.org; Wed, 27 Aug 2008 10:48:48 -0400 Original-Received: from mailman by lists.gnu.org with tmda-scanned (Exim 4.43) id 1KYMJ3-00015U-A1 for bug-gnu-emacs@gnu.org; Wed, 27 Aug 2008 10:47:33 -0400 Original-Received: from exim by lists.gnu.org with spam-scanned (Exim 4.43) id 1KYMJ2-000156-HQ for bug-gnu-emacs@gnu.org; Wed, 27 Aug 2008 10:47:32 -0400 Original-Received: from [199.232.76.173] (port=41564 helo=monty-python.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1KYMJ2-00014q-At for bug-gnu-emacs@gnu.org; Wed, 27 Aug 2008 10:47:32 -0400 Original-Received: from rzlab.ucr.edu ([138.23.92.77]:35031) by monty-python.gnu.org with esmtps (TLS-1.0:DHE_RSA_AES_256_CBC_SHA1:32) (Exim 4.60) (envelope-from ) id 1KYMJ1-0002Nn-HB for bug-gnu-emacs@gnu.org; Wed, 27 Aug 2008 10:47:32 -0400 Original-Received: from rzlab.ucr.edu (rzlab.ucr.edu [127.0.0.1]) by rzlab.ucr.edu (8.13.8/8.13.8/Debian-3) with ESMTP id m7RElSFV030658; Wed, 27 Aug 2008 07:47:29 -0700 Original-Received: (from debbugs@localhost) by rzlab.ucr.edu (8.13.8/8.13.8/Submit) id m7REe4Nx027856; Wed, 27 Aug 2008 07:40:04 -0700 X-Mailer: MIME-tools 5.420 (Entity 5.420) X-Loop: don@donarmstrong.com X-Emacs-PR-Message: closed 540 X-Emacs-PR-Package: emacs X-detected-kernel: by monty-python.gnu.org: Linux 2.6 (newer, 3) X-BeenThere: bug-gnu-emacs@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: "Bug reports for GNU Emacs, the Swiss army knife of text editors" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Errors-To: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.bugs:19766 Archived-At: This is a multi-part message in MIME format... ------------=_1219848004-27756-0 Content-Disposition: inline Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable Your message dated Wed, 27 Aug 2008 10:34:40 -0400 with message-id <87wsi2a5mn.fsf@cyd.mit.edu> and subject line Re: bug#540: 23.0.60; Unicode search bug has caused the Emacs bug report #540, regarding 23.0.60; Unicode search bug to be marked as done. This means that you claim that the problem has been dealt with. If this is not the case it is now your responsibility to reopen the bug report if necessary, and/or fix the problem forthwith. (NB: If you are a system administrator and have no idea what this message is talking about, this may indicate a serious mail system misconfiguration somewhere. Please contact don@donarmstrong.com immediately.) --=20 540: http://emacsbugs.donarmstrong.com/cgi-bin/bugreport.cgi?bug=3D540 Emacs Bug Tracking System Contact don@donarmstrong.com with problems ------------=_1219848004-27756-0 Content-Type: message/rfc822 Content-Disposition: inline X-Spam-Checker-Version: SpamAssassin 3.2.3-bugs.debian.org_2005_01_02 (2007-08-08) on rzlab.ucr.edu X-Spam-Level: X-Spam-Status: No, score=-2.5 required=4.0 tests=AWL,BAYES_00,FOURLA,KOI8R, RCVD_IN_DNSWL_MED autolearn=ham version=3.2.3-bugs.debian.org_2005_01_02 Received: (at submit) by emacsbugs.donarmstrong.com; 6 Jul 2008 18:45:35 +0000 Received: from fencepost.gnu.org (fencepost.gnu.org [140.186.70.10]) by rzlab.ucr.edu (8.13.8/8.13.8/Debian-3) with ESMTP id m66IjRPG031052 for ; Sun, 6 Jul 2008 11:45:28 -0700 Received: from mx10.gnu.org ([199.232.76.166]:58619) by fencepost.gnu.org with esmtp (Exim 4.67) (envelope-from ) id 1KFZEO-0003Wr-Rs for emacs-pretest-bug@gnu.org; Sun, 06 Jul 2008 14:45:04 -0400 Received: from Debian-exim by monty-python.gnu.org with spam-scanned (Exim 4.60) (envelope-from ) id 1KFZEh-0000sw-4O for emacs-pretest-bug@gnu.org; Sun, 06 Jul 2008 14:45:26 -0400 Received: from relay03.kiev.sovam.com ([62.64.120.201]:63486) by monty-python.gnu.org with esmtps (TLS-1.0:DHE_RSA_AES_256_CBC_SHA1:32) (Exim 4.60) (envelope-from ) id 1KFZEg-0000sY-Pd for emacs-pretest-bug@gnu.org; Sun, 06 Jul 2008 14:45:22 -0400 Received: from [83.170.232.243] (helo=smtp.svitonline.com) by relay03.kiev.sovam.com with esmtp (Exim 4.67) (envelope-from ) id 1KFZEe-0002AR-Gz for emacs-pretest-bug@gnu.org; Sun, 06 Jul 2008 21:45:20 +0300 From: Juri Linkov To: emacs-pretest-bug@gnu.org Subject: 23.0.60; Unicode search bug Organization: JURTA Date: Sun, 06 Jul 2008 21:43:23 +0300 Message-ID: <87ej66q2os.fsf@jurta.org> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/23.0.60 (x86_64-pc-linux-gnu) MIME-Version: 1.0 Content-Type: text/plain; charset=koi8-r X-Scanner-Signature: 4204c266f86c3e9b1e0231505dd73b85 X-DrWeb-checked: yes X-SpamTest-Envelope-From: juri@jurta.org X-SpamTest-Group-ID: 00000000 X-SpamTest-Header: Trusted X-SpamTest-Info: Profiles 4235s [July 6 2008] X-SpamTest-Info: {received from trusted relay: common white list} X-SpamTest-Method: white ip list X-SpamTest-Rate: 0 X-SpamTest-Status: Trusted X-SpamTest-Status-Extended: trusted X-SpamTest-Version: SMTP-Filter Version 3.0.0 [0278], KAS30/Release X-detected-kernel: by monty-python.gnu.org: FreeBSD 6.x (1) Content-Transfer-Encoding: quoted-printable X-MIME-Autoconverted: from 8bit to quoted-printable by rzlab.ucr.edu id m7RElSFV030658 There is a weird bug in searching Unicode text. The search function fails on Cyrillic letters between codepoints #x0400 and #x041f, but successfully finds a Cyrillic letter between #x0420 and #x042f. I tried to debug this and see that in case of failure it calls `boyer_moore', and in case of successful search it calls `simple_search'. I checked the Unicode properties, but everything seems correct. This bug didn't exist before the Unicode merge. The easiest way to reproduce it: run `emacs -Q', put in the *scratch* buffer the following 4 lines (note the leading space): (search-forward " =F0" nil t) (search-forward " =F2" nil t) =F0 =F2 and type `C-x C-e' after each of first two lines. In GNU Emacs 23.0.60 (x86_64-pc-linux-gnu) Important settings: value of $LC_ALL: nil value of $LC_COLLATE: nil value of $LC_CTYPE: nil value of $LC_MESSAGES: nil value of $LC_MONETARY: nil value of $LC_NUMERIC: nil value of $LC_TIME: nil value of $LANG: en_US.UTF-8 value of $XMODIFIERS: nil locale-coding-system: utf-8-unix default-enable-multibyte-characters: t --=20 Juri Linkov http://www.jurta.org/emacs/ ------------=_1219848004-27756-0 Content-Type: message/rfc822 Content-Disposition: inline X-Spam-Checker-Version: SpamAssassin 3.2.3-bugs.debian.org_2005_01_02 (2007-08-08) on rzlab.ucr.edu X-Spam-Level: X-Spam-Status: No, score=-5.1 required=4.0 tests=AWL,BAYES_00,HAS_BUG_NUMBER autolearn=ham version=3.2.3-bugs.debian.org_2005_01_02 Received: (at 540-done) by emacsbugs.donarmstrong.com; 27 Aug 2008 14:32:55 +0000 Received: from cyd.mit.edu (CYD.MIT.EDU [18.115.2.24]) by rzlab.ucr.edu (8.13.8/8.13.8/Debian-3) with ESMTP id m7REWpHS025941 for <540-done@emacsbugs.donarmstrong.com>; Wed, 27 Aug 2008 07:32:52 -0700 Received: by cyd.mit.edu (Postfix, from userid 1000) id 532A857E32E; Wed, 27 Aug 2008 10:34:40 -0400 (EDT) To: Andreas Schwab Cc: 540-done@emacsbugs.donarmstrong.com, Kenichi Handa Subject: Re: bug#540: 23.0.60; Unicode search bug References: <87wsi3qeiq.fsf@cyd.mit.edu> From: Chong Yidong Date: Wed, 27 Aug 2008 10:34:40 -0400 In-Reply-To: (Andreas Schwab's message of "Wed\, 27 Aug 2008 12\:59\:40 +0200") Message-ID: <87wsi2a5mn.fsf@cyd.mit.edu> User-Agent: Gnus/5.11 (Gnus v5.11) Emacs/22.2.91 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Andreas Schwab writes: > Should be fixed now. Thanks! ------------=_1219848004-27756-0--