From mboxrd@z Thu Jan 1 00:00:00 1970 Path: main.gmane.org!not-for-mail From: storm@cua.dk (Kim F. Storm) Newsgroups: gmane.emacs.devel Subject: Re: regex and case-fold-search problem Date: 29 Aug 2002 14:33:42 +0200 Sender: emacs-devel-admin@gnu.org Message-ID: <5x8z2pj13t.fsf@kfs2.cua.dk> References: <200208230625.PAA23426@etlken.m17n.org> <200208262151.g7QLpfA12782@wijiji.santafe.edu> <200208290853.RAA03185@etlken.m17n.org> NNTP-Posting-Host: localhost.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Trace: main.gmane.org 1030620867 482 127.0.0.1 (29 Aug 2002 11:34:27 GMT) X-Complaints-To: usenet@main.gmane.org NNTP-Posting-Date: Thu, 29 Aug 2002 11:34:27 +0000 (UTC) Cc: rms@gnu.org, emacs-devel@gnu.org Return-path: Original-Received: from quimby.gnus.org ([80.91.224.244]) by main.gmane.org with esmtp (Exim 3.35 #1 (Debian)) id 17kNZ9-00007M-00 for ; Thu, 29 Aug 2002 13:34:23 +0200 Original-Received: from monty-python.gnu.org ([199.232.76.173]) by quimby.gnus.org with esmtp (Exim 3.12 #1 (Debian)) id 17kO5P-00029R-00 for ; Thu, 29 Aug 2002 14:07:43 +0200 Original-Received: from localhost ([127.0.0.1] helo=monty-python.gnu.org) by monty-python.gnu.org with esmtp (Exim 4.10) id 17kNaX-0005cX-00; Thu, 29 Aug 2002 07:35:49 -0400 Original-Received: from list by monty-python.gnu.org with tmda-scanned (Exim 4.10) id 17kNYB-0005Qm-00 for emacs-devel@gnu.org; Thu, 29 Aug 2002 07:33:23 -0400 Original-Received: from mail by monty-python.gnu.org with spam-scanned (Exim 4.10) id 17kNY9-0005PH-00 for emacs-devel@gnu.org; Thu, 29 Aug 2002 07:33:22 -0400 Original-Received: from mail.filanet.dk ([195.215.206.179]) by monty-python.gnu.org with esmtp (Exim 4.10) id 17kNY8-0005P7-00; Thu, 29 Aug 2002 07:33:20 -0400 Original-Received: from kfs2.cua.dk.cua.dk (kfs2.local.filanet.dk [192.168.1.182]) by mail.filanet.dk (Postfix) with SMTP id 6FBD27C017; Thu, 29 Aug 2002 11:33:19 +0000 (GMT) Original-To: Kenichi Handa In-Reply-To: <200208290853.RAA03185@etlken.m17n.org> Original-Lines: 19 User-Agent: Gnus/5.09 (Gnus v5.9.0) Emacs/21.3.50 Errors-To: emacs-devel-admin@gnu.org X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.0.11 Precedence: bulk List-Help: List-Post: List-Subscribe: , List-Id: Emacs development discussions. List-Unsubscribe: , List-Archive: Xref: main.gmane.org gmane.emacs.devel:7103 X-Report-Spam: http://spam.gmane.org/gmane.emacs.devel:7103 > > Consider the situation that one writes this regexp > "[\000-\xffff]" > to search only Unicode BMP chars in emacs-unicode. I > suspect that, if we implent the above method, compiling this > regexp when case-fold-search is non-nil takes longer time > than people usually expect. > > So, I agree with Stephen that his method is good enough. IMO, it is wrong to handle case-fold-search for regexp ranges by trying to modify the interpretation of the regex range. Instead, the regex matcher should try to upcase and lowercase each character in the string and see if either of these caracters are within the given range. -- Kim F. Storm http://www.cua.dk