From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!.POSTED.blaine.gmane.org!not-for-mail From: Eli Zaretskii Newsgroups: gmane.emacs.bugs Subject: bug#3687: 23.1.50; inconsistency in multibyte eight-bit regexps [PATCH] Date: Fri, 28 Jun 2019 19:20:13 +0300 Message-ID: <83pnmxhebm.fsf@gnu.org> References: <831rzdj1z9.fsf@gnu.org> <6138515E-3202-437D-8341-7A8856AD0AE9@acm.org> <83v9wphixc.fsf@gnu.org> <36CBE596-29AD-4EB8-9CE4-979DA97FFDE3@acm.org> Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit Injection-Info: blaine.gmane.org; posting-host="blaine.gmane.org:195.159.176.226"; logging-data="93332"; mail-complaints-to="usenet@blaine.gmane.org" Cc: monnier@iro.umontreal.ca, 3687@debbugs.gnu.org To: Mattias =?UTF-8?Q?Engdeg=C3=A5rd?= Original-X-From: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Fri Jun 28 18:56:41 2019 Return-path: Envelope-to: geb-bug-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([209.51.188.17]) by blaine.gmane.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.89) (envelope-from ) id 1hguAy-000O55-Je for geb-bug-gnu-emacs@m.gmane.org; Fri, 28 Jun 2019 18:56:40 +0200 Original-Received: from localhost ([::1]:34376 helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.86_2) (envelope-from ) id 1hguAx-0007O2-5m for geb-bug-gnu-emacs@m.gmane.org; Fri, 28 Jun 2019 12:56:39 -0400 Original-Received: from eggs.gnu.org ([2001:470:142:3::10]:39086) by lists.gnu.org with esmtp (Exim 4.86_2) (envelope-from ) id 1hgtcW-0005oy-82 for bug-gnu-emacs@gnu.org; Fri, 28 Jun 2019 12:21:05 -0400 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1hgtcV-0008MN-9O for bug-gnu-emacs@gnu.org; Fri, 28 Jun 2019 12:21:04 -0400 Original-Received: from debbugs.gnu.org ([209.51.188.43]:57566) by eggs.gnu.org with esmtps (TLS1.0:RSA_AES_128_CBC_SHA1:16) (Exim 4.71) (envelope-from ) id 1hgtcV-0008LZ-4t for bug-gnu-emacs@gnu.org; Fri, 28 Jun 2019 12:21:03 -0400 Original-Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.84_2) (envelope-from ) id 1hgtcU-0006Cf-VJ for bug-gnu-emacs@gnu.org; Fri, 28 Jun 2019 12:21:02 -0400 X-Loop: help-debbugs@gnu.org Resent-From: Eli Zaretskii Original-Sender: "Debbugs-submit" Resent-CC: bug-gnu-emacs@gnu.org Resent-Date: Fri, 28 Jun 2019 16:21:02 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: followup 3687 X-GNU-PR-Package: emacs Original-Received: via spool by 3687-submit@debbugs.gnu.org id=B3687.156173884123786 (code B ref 3687); Fri, 28 Jun 2019 16:21:02 +0000 Original-Received: (at 3687) by debbugs.gnu.org; 28 Jun 2019 16:20:41 +0000 Original-Received: from localhost ([127.0.0.1]:42874 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1hgtc9-0006BZ-2P for submit@debbugs.gnu.org; Fri, 28 Jun 2019 12:20:41 -0400 Original-Received: from eggs.gnu.org ([209.51.188.92]:33010) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1hgtc6-0006BL-2x for 3687@debbugs.gnu.org; Fri, 28 Jun 2019 12:20:38 -0400 Original-Received: from fencepost.gnu.org ([2001:470:142:3::e]:58544) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1hgtbw-0007DN-LJ; Fri, 28 Jun 2019 12:20:30 -0400 Original-Received: from [176.228.60.248] (port=3694 helo=home-c4e4a596f7) by fencepost.gnu.org with esmtpsa (TLS1.2:RSA_AES_256_CBC_SHA1:256) (Exim 4.82) (envelope-from ) id 1hgtbu-0001Y0-5D; Fri, 28 Jun 2019 12:20:26 -0400 In-reply-to: <36CBE596-29AD-4EB8-9CE4-979DA97FFDE3@acm.org> (message from Mattias =?UTF-8?Q?Engdeg=C3=A5rd?= on Fri, 28 Jun 2019 17:00:33 +0200) X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic] X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic] X-Received-From: 209.51.188.43 X-BeenThere: bug-gnu-emacs@gnu.org List-Id: "Bug reports for GNU Emacs, the Swiss army knife of text editors" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Original-Sender: "bug-gnu-emacs" Xref: news.gmane.org gmane.emacs.bugs:161748 Archived-At: > From: Mattias Engdegård > Date: Fri, 28 Jun 2019 17:00:33 +0200 > Cc: mituharu@math.s.chiba-u.ac.jp, monnier@iro.umontreal.ca, > 3687@debbugs.gnu.org > > 28 juni 2019 kl. 16.40 skrev Eli Zaretskii : > > > > So this means \240 is no longer the same as NBSP and \300 is no longer > > the same as À? But \176 is still the same as ~? > > This has been the case for quite a while; the patch does not change any of this. > > > So you are saying that we will consider the raw bytes as if they > > followed ASCII characters in the lexicographical order? But non-ASCII > > characters whose codepoints start at 0x80? where are they in this > > order? > > Again, this is existing semantics and the patch does not change any of it. > > It sounds like you misunderstand the patch, which means that I have been bad at explaining it. It just fixes a few edge cases related to raw bytes in regexp matching. It does not attempt to change existing semantics, other than where they are clearly buggy, such as "\x9f" and "[\x9f]" not being equivalent regexps. Maybe I did misunderstand: if the patch change nothing fundamental, then why did you need to precede it with "principles"? But since you already pushed the change, I guess there's no reason to discuss this, and I regret I replied.