From mboxrd@z Thu Jan 1 00:00:00 1970 Path: main.gmane.org!not-for-mail From: Kai.Grossjohann@CS.Uni-Dortmund.DE (Kai =?iso-8859-15?q?Gro=DFjohann?=) Newsgroups: gmane.emacs.devel Subject: Re: regex and case-fold-search problem Date: Sat, 24 Aug 2002 12:40:01 +0200 Sender: emacs-devel-admin@gnu.org Message-ID: References: <200208230625.PAA23426@etlken.m17n.org> <200208231736.g7NHafW02174@rum.cs.yale.edu> NNTP-Posting-Host: localhost.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Trace: main.gmane.org 1030185739 13577 127.0.0.1 (24 Aug 2002 10:42:19 GMT) X-Complaints-To: usenet@main.gmane.org NNTP-Posting-Date: Sat, 24 Aug 2002 10:42:19 +0000 (UTC) Cc: Kenichi Handa , emacs-devel@gnu.org Return-path: Original-Received: from quimby.gnus.org ([80.91.224.244]) by main.gmane.org with esmtp (Exim 3.35 #1 (Debian)) id 17iYN0-0003Ws-00 for ; Sat, 24 Aug 2002 12:42:18 +0200 Original-Received: from monty-python.gnu.org ([199.232.76.173]) by quimby.gnus.org with esmtp (Exim 3.12 #1 (Debian)) id 17iYqo-0007Zr-00 for ; Sat, 24 Aug 2002 13:13:06 +0200 Original-Received: from localhost ([127.0.0.1] helo=monty-python.gnu.org) by monty-python.gnu.org with esmtp (Exim 4.10) id 17iYOD-0005G5-00; Sat, 24 Aug 2002 06:43:33 -0400 Original-Received: from list by monty-python.gnu.org with tmda-scanned (Exim 4.10) id 17iYL1-0004f6-00 for emacs-devel@gnu.org; Sat, 24 Aug 2002 06:40:15 -0400 Original-Received: from mail by monty-python.gnu.org with spam-scanned (Exim 4.10) id 17iYKy-0004eT-00 for emacs-devel@gnu.org; Sat, 24 Aug 2002 06:40:14 -0400 Original-Received: from waldorf.cs.uni-dortmund.de ([129.217.4.42]) by monty-python.gnu.org with esmtp (Exim 4.10) id 17iYKw-0004dA-00 for emacs-devel@gnu.org; Sat, 24 Aug 2002 06:40:11 -0400 Original-Received: from lothlorien.cs.uni-dortmund.de (lothlorien [129.217.19.67]) by waldorf.cs.uni-dortmund.de with ESMTP id g7OAe7b05553; Sat, 24 Aug 2002 12:40:07 +0200 (MES) Original-Received: from lucy.cs.uni-dortmund.de (lucy [129.217.19.80]) by lothlorien.cs.uni-dortmund.de id MAA15900; Sat, 24 Aug 2002 12:40:02 +0200 (MET DST) Original-Received: by lucy.cs.uni-dortmund.de (Postfix, from userid 6104) id 80C4E3B203; Sat, 24 Aug 2002 12:40:01 +0200 (CEST) Original-To: "Stefan Monnier" In-Reply-To: <200208231736.g7NHafW02174@rum.cs.yale.edu> ("Stefan Monnier"'s message of "Fri, 23 Aug 2002 13:36:41 -0400") Original-Lines: 23 User-Agent: Gnus/5.090008 (Oort Gnus v0.08) Emacs/21.3.50 (i686-pc-linux-gnu) Errors-To: emacs-devel-admin@gnu.org X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.0.11 Precedence: bulk List-Help: List-Post: List-Subscribe: , List-Id: Emacs development discussions. List-Unsubscribe: , List-Archive: Xref: main.gmane.org gmane.emacs.devel:6838 X-Report-Spam: http://spam.gmane.org/gmane.emacs.devel:6838 "Stefan Monnier" writes: > For ASCII it's pretty easy to fix. But for other charsets, it's > indeed more tricky. Maybe we can simply use the smallest contiguous > range of chars that includes all the chars we should match, > so the behavior is indeed "implementation-defined" (in the sense > that it's not necessarily obvious to the user what happens) but > it's at least less confusing (in the sense that (case-fold-search t) > matches at least as much as (case-fold-search nil)). My first intuition would be to take all the characters in the range [A-_] (preserving case), then to "double" each character that has an uppercase and a lowercase variant. So we are talking about the characters "ABCDEFGHIJKLMNOPQRSTXYZ[\]^_" for the given range, and now we make a case-insensitive variant of this list of characters. Does this make sense? Is it feasible to implement? kai -- A large number of young women don't trust men with beards. (BFBS Radio)