From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Artur Malabarba Newsgroups: gmane.emacs.bugs Subject: bug#22090: Isearch is sluggish and eventually refuses further service with "[Too many words]". Date: Fri, 4 Dec 2015 20:49:42 +0000 Message-ID: References: <20151204192126.73199.qmail@mail.muc.de> Reply-To: bruce.connor.am@gmail.com NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable X-Trace: ger.gmane.org 1449262278 25571 80.91.229.3 (4 Dec 2015 20:51:18 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Fri, 4 Dec 2015 20:51:18 +0000 (UTC) Cc: 22090@debbugs.gnu.org To: Alan Mackenzie Original-X-From: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Fri Dec 04 21:51:12 2015 Return-path: Envelope-to: geb-bug-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1a4xJt-0007W4-N0 for geb-bug-gnu-emacs@m.gmane.org; Fri, 04 Dec 2015 21:51:09 +0100 Original-Received: from localhost ([::1]:43431 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1a4xJt-0000eM-7F for geb-bug-gnu-emacs@m.gmane.org; Fri, 04 Dec 2015 15:51:09 -0500 Original-Received: from eggs.gnu.org ([2001:4830:134:3::10]:39161) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1a4xJp-0000e5-T0 for bug-gnu-emacs@gnu.org; Fri, 04 Dec 2015 15:51:06 -0500 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1a4xJm-0005bO-Ic for bug-gnu-emacs@gnu.org; Fri, 04 Dec 2015 15:51:05 -0500 Original-Received: from debbugs.gnu.org ([208.118.235.43]:49730) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1a4xJm-0005bK-F3 for bug-gnu-emacs@gnu.org; Fri, 04 Dec 2015 15:51:02 -0500 Original-Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.80) (envelope-from ) id 1a4xJm-0007TN-3W for bug-gnu-emacs@gnu.org; Fri, 04 Dec 2015 15:51:02 -0500 X-Loop: help-debbugs@gnu.org Resent-From: Artur Malabarba Original-Sender: "Debbugs-submit" Resent-CC: bug-gnu-emacs@gnu.org Resent-Date: Fri, 04 Dec 2015 20:51:02 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: followup 22090 X-GNU-PR-Package: emacs X-GNU-PR-Keywords: Original-Received: via spool by 22090-submit@debbugs.gnu.org id=B22090.144926220728663 (code B ref 22090); Fri, 04 Dec 2015 20:51:02 +0000 Original-Received: (at 22090) by debbugs.gnu.org; 4 Dec 2015 20:50:07 +0000 Original-Received: from localhost ([127.0.0.1]:39438 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1a4xIr-0007SE-TO for submit@debbugs.gnu.org; Fri, 04 Dec 2015 15:50:06 -0500 Original-Received: from mail-lf0-f44.google.com ([209.85.215.44]:36376) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1a4xIW-0007RN-7L for 22090@debbugs.gnu.org; Fri, 04 Dec 2015 15:50:03 -0500 Original-Received: by lfs39 with SMTP id 39so116585215lfs.3 for <22090@debbugs.gnu.org>; Fri, 04 Dec 2015 12:49:43 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:reply-to:sender:in-reply-to:references:date:message-id :subject:from:to:cc:content-type:content-transfer-encoding; bh=SQ56wkNJsxr0R42Tv4wXbBGANbxnoRqFhm8xORavPOQ=; b=Xu/a83uvCumIcvmGKuP/iSvwIA5Trgnpy0QeSiiiqpdfdQq3vrc9KZx7do++2iv/pQ k60pHotVU+MHMxpUUwY1zHCB4TNw0WuJSifSwsEYkm/Sb+swbQJiIQ29Whrj6+32sT79 3Eef+8S//4tNITZSvuQSMIgbU2OFz13hOQCtyCv4azt4XmJxPzoBxCESAqW/7n30G7P2 tWjNUfh59dNy5FML1lSrcq63y1H/VhuRClyr/8d71etKJBIRspFFgPAghHzXkqa0SHjb 6QCBLjX2HcUIvuObSGfjiot/ZfVKxfNTZyKRFnRHpJHFsQ6YpBvbi3YtI6zGf2WkHfgx bwoQ== X-Received: by 10.25.18.92 with SMTP id h89mr9353472lfi.54.1449262183026; Fri, 04 Dec 2015 12:49:43 -0800 (PST) Original-Received: by 10.112.202.99 with HTTP; Fri, 4 Dec 2015 12:49:42 -0800 (PST) In-Reply-To: <20151204192126.73199.qmail@mail.muc.de> X-Google-Sender-Auth: 23OWBYE1jMcj3USL1ZwaugSARiM X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.15 Precedence: list X-detected-operating-system: by eggs.gnu.org: GNU/Linux 3.x X-Received-From: 208.118.235.43 X-BeenThere: bug-gnu-emacs@gnu.org List-Id: "Bug reports for GNU Emacs, the Swiss army knife of text editors" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Original-Sender: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.bugs:109623 Archived-At: 2015-12-04 19:21 GMT+00:00 Alan Mackenzie : > Would you like any help to sort out these regexps? I have some expertise > in doing this, having half-written fix-re.el, a program which analyses > and corrects just the sort of thing you're talking about. Maybe you can help then. The situation is actually quite simple. We have a regexp for matching anything that 'a' should match (for instance, that might look like "\\(a[=C2=B4`]?\\|[=C3=A1=C3=A0=F0=9D=91=8E]= \\)"), and we have another for matching anything that A could match (e.g. "\\(A[`=C2=B4]?\\|[=C3=81=C3=80]\\)"). When case-fold-search is on the previous code would simply join these regexps with "\\(\\(a[=C2=B4`]?\\|[=C3=A1=C3=A0=F0=9D=91=8E]\\)\\|\\(A[`=C2= =B4]?\\|[=C3=81=C3=80]\\)\\)". The problem is that (when case-fold-search is on) this creates a lot of redundancy. There are two paths in that regexp that match "a", there are two paths that match "=C3=A0" and so on (but it's not full redundancy, for instance, only one path matches =F0=9D=91=8E).