From mboxrd@z Thu Jan 1 00:00:00 1970 Path: main.gmane.org!not-for-mail From: Stefan Monnier Newsgroups: gmane.emacs.devel Subject: Re: eight-bit char handling in emacs-unicode Date: 19 Nov 2003 08:48:54 -0500 Sender: emacs-devel-bounces+emacs-devel=quimby.gnus.org@gnu.org Message-ID: References: <200311130153.KAA04615@etlken.m17n.org> <200311130610.PAA04983@etlken.m17n.org> <200311130901.SAA05204@etlken.m17n.org> <200311140047.JAA06414@etlken.m17n.org> <200311180733.QAA13703@etlken.m17n.org> <200311190006.JAA14847@etlken.m17n.org> <87ptfovdnj.fsf@mail.jurta.org> NNTP-Posting-Host: deer.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Trace: sea.gmane.org 1069249843 8605 80.91.224.253 (19 Nov 2003 13:50:43 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Wed, 19 Nov 2003 13:50:43 +0000 (UTC) Cc: emacs-devel@gnu.org Original-X-From: emacs-devel-bounces+emacs-devel=quimby.gnus.org@gnu.org Wed Nov 19 14:50:39 2003 Return-path: Original-Received: from quimby.gnus.org ([80.91.224.244]) by deer.gmane.org with esmtp (Exim 3.35 #1 (Debian)) id 1AMSj9-0003bn-00 for ; Wed, 19 Nov 2003 14:50:39 +0100 Original-Received: from monty-python.gnu.org ([199.232.76.173]) by quimby.gnus.org with esmtp (Exim 3.35 #1 (Debian)) id 1AMSj9-0005jJ-00 for ; Wed, 19 Nov 2003 14:50:39 +0100 Original-Received: from localhost ([127.0.0.1] helo=monty-python.gnu.org) by monty-python.gnu.org with esmtp (Exim 4.24) id 1AMTfl-0001pH-2k for emacs-devel@quimby.gnus.org; Wed, 19 Nov 2003 09:51:13 -0500 Original-Received: from list by monty-python.gnu.org with tmda-scanned (Exim 4.24) id 1AMTfT-0001nb-8p for emacs-devel@gnu.org; Wed, 19 Nov 2003 09:50:55 -0500 Original-Received: from mail by monty-python.gnu.org with spam-scanned (Exim 4.24) id 1AMTew-0001JM-8l for emacs-devel@gnu.org; Wed, 19 Nov 2003 09:50:53 -0500 Original-Received: from [132.204.24.67] (helo=mercure.iro.umontreal.ca) by monty-python.gnu.org with esmtp (Exim 4.24) id 1AMTev-0001JC-SP for emacs-devel@gnu.org; Wed, 19 Nov 2003 09:50:21 -0500 Original-Received: from vor.iro.umontreal.ca (vor.iro.umontreal.ca [132.204.24.42]) by mercure.iro.umontreal.ca (8.12.9/8.12.9) with ESMTP id hAJDmsbj014074; Wed, 19 Nov 2003 08:48:55 -0500 Original-Received: by vor.iro.umontreal.ca (Postfix, from userid 20848) id B5FE53C63E; Wed, 19 Nov 2003 08:48:54 -0500 (EST) Original-To: Juri Linkov In-Reply-To: <87ptfovdnj.fsf@mail.jurta.org> Original-Lines: 26 User-Agent: Gnus/5.09 (Gnus v5.9.0) Emacs/21.3.50 X-DIRO-MailScanner: Found to be clean X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.2 Precedence: list List-Id: Emacs development discussions. List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-devel-bounces+emacs-devel=quimby.gnus.org@gnu.org Xref: main.gmane.org gmane.emacs.devel:17917 X-Report-Spam: http://spam.gmane.org/gmane.emacs.devel:17917 >> Now to tell you the truth, my change had to accept a few (not so) special >> cases and it took a bit of fiddling to make the code lenient enough to >> accept elisp code I didn't feel like "fixing". I can't remember the details >> off-hand, but I remember having problems with regexp matching functions >> where multibyte regexps are used in unibyte buffers. > Do you mean unibyte regexps in multibyte buffers? For example, No: multibyte is a superset of unibyte, so there's no problem searching for unibyte elements in a multibyte sequence. > currently gnus/message.el has a wrong regexp than prevents the Gnus > from using in some language environments. To repeat this bug, > you can eval the following: > (progn > (set-language-environment 'ukrainian) > (re-search-forward "[\000-\007\013\015-\032\034-\037\200-\237]" nil t)) In my Emacs this doesn't fail because the unibyte string is turned into multibyte without looking at the coding-system (i.e. it will only match ASCII and chars from eight-bit-control or eight-bit-graphic: probably not what the author's intended). Stefan