From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!.POSTED.blaine.gmane.org!not-for-mail From: Mattias =?UTF-8?Q?Engdeg=C3=A5rd?= Newsgroups: gmane.emacs.bugs Subject: bug#34492: rx: ASCII-raw byte ranges comprise all of Unicode Date: Fri, 15 Feb 2019 19:23:56 +0100 Message-ID: Mime-Version: 1.0 (Mac OS X Mail 12.2 \(3445.102.3\)) Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable Injection-Info: blaine.gmane.org; posting-host="blaine.gmane.org:195.159.176.226"; logging-data="176995"; mail-complaints-to="usenet@blaine.gmane.org" To: 34492@debbugs.gnu.org Original-X-From: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Fri Feb 15 19:25:11 2019 Return-path: Envelope-to: geb-bug-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([209.51.188.17]) by blaine.gmane.org with esmtps (TLS1.0:RSA_AES_256_CBC_SHA1:256) (Exim 4.89) (envelope-from ) id 1guiAg-000jsH-BD for geb-bug-gnu-emacs@m.gmane.org; Fri, 15 Feb 2019 19:25:10 +0100 Original-Received: from localhost ([127.0.0.1]:44418 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1guiAf-0008ON-9e for geb-bug-gnu-emacs@m.gmane.org; Fri, 15 Feb 2019 13:25:09 -0500 Original-Received: from eggs.gnu.org ([209.51.188.92]:51440) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1guiAZ-0008OH-CX for bug-gnu-emacs@gnu.org; Fri, 15 Feb 2019 13:25:04 -0500 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1guiAY-0004ad-KQ for bug-gnu-emacs@gnu.org; Fri, 15 Feb 2019 13:25:03 -0500 Original-Received: from debbugs.gnu.org ([209.51.188.43]:50497) by eggs.gnu.org with esmtps (TLS1.0:RSA_AES_128_CBC_SHA1:16) (Exim 4.71) (envelope-from ) id 1guiAY-0004a3-G7 for bug-gnu-emacs@gnu.org; Fri, 15 Feb 2019 13:25:02 -0500 Original-Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.84_2) (envelope-from ) id 1guiAY-0000xA-8c for bug-gnu-emacs@gnu.org; Fri, 15 Feb 2019 13:25:02 -0500 X-Loop: help-debbugs@gnu.org Resent-From: Mattias =?UTF-8?Q?Engdeg=C3=A5rd?= Original-Sender: "Debbugs-submit" Resent-CC: bug-gnu-emacs@gnu.org Resent-Date: Fri, 15 Feb 2019 18:25:02 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: report 34492 X-GNU-PR-Package: emacs X-Debbugs-Original-To: bug-gnu-emacs@gnu.org Original-Received: via spool by submit@debbugs.gnu.org id=B.15502550523602 (code B ref -1); Fri, 15 Feb 2019 18:25:02 +0000 Original-Received: (at submit) by debbugs.gnu.org; 15 Feb 2019 18:24:12 +0000 Original-Received: from localhost ([127.0.0.1]:49778 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1gui9k-0000w2-E0 for submit@debbugs.gnu.org; Fri, 15 Feb 2019 13:24:12 -0500 Original-Received: from eggs.gnu.org ([209.51.188.92]:36948) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1gui9h-0000vp-VX for submit@debbugs.gnu.org; Fri, 15 Feb 2019 13:24:10 -0500 Original-Received: from lists.gnu.org ([209.51.188.17]:52862) by eggs.gnu.org with esmtps (TLS1.0:RSA_AES_256_CBC_SHA1:32) (Exim 4.71) (envelope-from ) id 1gui9c-0003VR-0H for submit@debbugs.gnu.org; Fri, 15 Feb 2019 13:24:04 -0500 Original-Received: from eggs.gnu.org ([209.51.188.92]:51330) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1gui9b-0008Hr-9H for bug-gnu-emacs@gnu.org; Fri, 15 Feb 2019 13:24:03 -0500 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1gui9a-0003Tu-IT for bug-gnu-emacs@gnu.org; Fri, 15 Feb 2019 13:24:03 -0500 Original-Received: from mail231c50.megamailservers.eu ([91.136.10.241]:53996 helo=mail37c50.megamailservers.eu) by eggs.gnu.org with esmtps (TLS1.0:DHE_RSA_AES_256_CBC_SHA1:32) (Exim 4.71) (envelope-from ) id 1gui9Z-0003Qb-Vo for bug-gnu-emacs@gnu.org; Fri, 15 Feb 2019 13:24:02 -0500 X-Authenticated-User: mattiase@bredband.net DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=megamailservers.eu; s=maildub; t=1550255039; bh=7UmGRP0xspsdwVJlB168r9FO+5mkgWyXVqpbeYkOgmc=; h=From:Subject:Date:To:From; b=TmySxn0l9z8uKky7VcLUNmNYBIlgh8Ej1kf1PjwhFrYTZXLVk1nZDOLvH5TOKKRoa 0U56sHrBsq3nsz7L2fgvwcKqusWAPBRPy6HiLnbctDKta5orSSIRjNoxWeZGRNmY8Q oY2KCr9GYrzdCj265f9Koj9ZvRqkJCT8abAzCUm0= Feedback-ID: mattiase@acm.or Original-Received: from [192.168.0.4] (c83-251-8-17.bredband.comhem.se [83.251.8.17]) (authenticated bits=0) by mail37c50.megamailservers.eu (8.14.9/8.13.1) with ESMTP id x1FINuFU000977 for ; Fri, 15 Feb 2019 18:23:59 +0000 X-Mailer: Apple Mail (2.3445.102.3) X-CTCH-RefID: str=0001.0A0B0204.5C6703BF.0030, ss=1, re=0.000, recu=0.000, reip=0.000, cl=1, cld=1, fgs=0 X-CTCH-VOD: Unknown X-CTCH-Spam: Unknown X-CTCH-Score: 0.000 X-CTCH-Flags: 0 X-CTCH-ScoreCust: 0.000 X-CSC: 0 X-CHA: v=2.3 cv=J+uEEjvS c=1 sm=1 tr=0 a=NAHmi3I8mP0S/Y8gRKeQyA==:117 a=NAHmi3I8mP0S/Y8gRKeQyA==:17 a=IkcTkHD0fZMA:10 a=aFYkK34zXO-9mYGBXWIA:9 a=QEXdDO2ut3YA:10 X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x (no timestamps) [generic] X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6.x X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic] X-Received-From: 209.51.188.43 X-BeenThere: bug-gnu-emacs@gnu.org List-Id: "Bug reports for GNU Emacs, the Swiss army knife of text editors" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Original-Sender: "bug-gnu-emacs" Xref: news.gmane.org gmane.emacs.bugs:155440 Archived-At: `rx' incorrectly considers character ranges between ASCII and raw bytes = to cover all codes in-between, which includes all non-ASCII Unicode = chars. This causes (any "\000-\377" ?=C3=85) to be simplified to (any = "\000-\377"), which is not at all the same thing: [\000-\377] really = means [\000-\177\200-\377] -- the transformation is normally made by the = Emacs regexp engine. The two ranges are not contiguous on the character = code level. It's a sleeper bug that was awakened by my fixing bug#33205, so I'm to = blame for not checking this.