From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: Paul Eggert Newsgroups: gmane.emacs.bugs Subject: bug#64128: regexp parser zero-width assertion bugs Date: Sat, 17 Jun 2023 15:18:00 -0700 Organization: UCLA Computer Science Department Message-ID: References: <4A303177-384E-4FEF-98F2-FAB89A12ACC9@gmail.com> Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="------------1RRgvsv6eebocvFVK1FMlHHO" Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="34822"; mail-complaints-to="usenet@ciao.gmane.io" User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:102.0) Gecko/20100101 Thunderbird/102.11.0 Cc: 64128@debbugs.gnu.org To: Mattias =?UTF-8?Q?Engdeg=C3=A5rd?= , Stefan Monnier Original-X-From: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane-mx.org@gnu.org Sun Jun 18 00:19:13 2023 Return-path: Envelope-to: geb-bug-gnu-emacs@m.gmane-mx.org Original-Received: from lists.gnu.org ([209.51.188.17]) by ciao.gmane.io with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1qAeGG-0008oy-Hc for geb-bug-gnu-emacs@m.gmane-mx.org; Sun, 18 Jun 2023 00:19:12 +0200 Original-Received: from localhost ([::1] helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1qAeG8-0002n0-Rk; Sat, 17 Jun 2023 18:19:04 -0400 Original-Received: from eggs.gnu.org ([2001:470:142:3::10]) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1qAeG6-0002mn-Sl for bug-gnu-emacs@gnu.org; Sat, 17 Jun 2023 18:19:03 -0400 Original-Received: from debbugs.gnu.org ([209.51.188.43]) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.90_1) (envelope-from ) id 1qAeG6-0007ss-K4 for bug-gnu-emacs@gnu.org; Sat, 17 Jun 2023 18:19:02 -0400 Original-Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.84_2) (envelope-from ) id 1qAeG6-0005CB-FR for bug-gnu-emacs@gnu.org; Sat, 17 Jun 2023 18:19:02 -0400 X-Loop: help-debbugs@gnu.org Resent-From: Paul Eggert Original-Sender: "Debbugs-submit" Resent-CC: bug-gnu-emacs@gnu.org Resent-Date: Sat, 17 Jun 2023 22:19:02 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: followup 64128 X-GNU-PR-Package: emacs Original-Received: via spool by 64128-submit@debbugs.gnu.org id=B64128.168704029419906 (code B ref 64128); Sat, 17 Jun 2023 22:19:02 +0000 Original-Received: (at 64128) by debbugs.gnu.org; 17 Jun 2023 22:18:14 +0000 Original-Received: from localhost ([127.0.0.1]:52768 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1qAeFJ-0005Az-RX for submit@debbugs.gnu.org; Sat, 17 Jun 2023 18:18:14 -0400 Original-Received: from mail.cs.ucla.edu ([131.179.128.66]:42434) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1qAeFE-0005AX-7K for 64128@debbugs.gnu.org; Sat, 17 Jun 2023 18:18:11 -0400 Original-Received: from localhost (localhost [127.0.0.1]) by mail.cs.ucla.edu (Postfix) with ESMTP id D88A63C020F7C; Sat, 17 Jun 2023 15:18:01 -0700 (PDT) Original-Received: from mail.cs.ucla.edu ([127.0.0.1]) by localhost (mail.cs.ucla.edu [127.0.0.1]) (amavisd-new, port 10032) with ESMTP id MWbmBjryrD6R; Sat, 17 Jun 2023 15:18:01 -0700 (PDT) Original-Received: from localhost (localhost [127.0.0.1]) by mail.cs.ucla.edu (Postfix) with ESMTP id 5A4EC3C09FB41; Sat, 17 Jun 2023 15:18:01 -0700 (PDT) DKIM-Filter: OpenDKIM Filter v2.10.3 mail.cs.ucla.edu 5A4EC3C09FB41 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=cs.ucla.edu; s=9D0B346E-2AEB-11ED-9476-E14B719DCE6C; t=1687040281; bh=llkGHMH5dhtUMlpsC3ITK8uJutGfIZ+L7wv80PaLO6U=; h=Message-ID:Date:MIME-Version:To:From; b=DQDwkBAMdO1A/5L8WFJ1gqDN1uVzajzq8JXi7h6LcB+dbYTs8//rwgzZ4WhXFj1G2 zmeXBIWXVN1BQsh/rPcqZs1JdtqCuEwHIP4s8JD0EPb6Z9Sl+1bK5qiWpKl5UfHZwU By+9dsxNuflrDvWsaXweRz5H11fpLkHeIRXl39OHs/7zZ8jchUcAGA5kpuS0CMfRTR Dv+36yMtNyU4lo9V/8WprxrpoCLu11tO5SIuOikg9MqfzgotZM1fMGDfV0l+d96r5p IeWdBzSZS3Fy6JtN19chDkqHSAN//6uo7UTnsc/WsE3CXa5rzEdesMRBYnrG1wJrsQ R7rhUvFX8Gd+g== X-Virus-Scanned: amavisd-new at mail.cs.ucla.edu Original-Received: from mail.cs.ucla.edu ([127.0.0.1]) by localhost (mail.cs.ucla.edu [127.0.0.1]) (amavisd-new, port 10026) with ESMTP id XRBrUbu5fpBk; Sat, 17 Jun 2023 15:18:01 -0700 (PDT) Original-Received: from [192.168.1.9] (cpe-172-91-119-151.socal.res.rr.com [172.91.119.151]) by mail.cs.ucla.edu (Postfix) with ESMTPSA id 2B87C3C020F7C; Sat, 17 Jun 2023 15:18:01 -0700 (PDT) Content-Language: en-US In-Reply-To: <4A303177-384E-4FEF-98F2-FAB89A12ACC9@gmail.com> X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list X-BeenThere: bug-gnu-emacs@gnu.org List-Id: "Bug reports for GNU Emacs, the Swiss army knife of text editors" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane-mx.org@gnu.org Original-Sender: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane-mx.org@gnu.org Xref: news.gmane.io gmane.emacs.bugs:263582 Archived-At: This is a multi-part message in MIME format. --------------1RRgvsv6eebocvFVK1FMlHHO Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: quoted-printable On 2023-06-17 13:07, Mattias Engdeg=C3=A5rd wrote: > 17 juni 2023 kl. 20.44 skrev Stefan Monnier : >=20 >> I think the behavior that makes most sense is to signal an error when >> compiling the regexp. >=20 > Clearly, but some behaviour needs to be preserved for compatibility. > Regexps like "^*" aren't uncommon. Can it be generalised in a useful wa= y? >=20 doc/lispref/searching.texi says that "*" is treated as an ordinary=20 character if it is in a context where its special meaning makes no=20 sense, giving "*foo" as an example. If we break with this tradition by=20 making "\b*" an error instead of being equivalent to "\b\*", we should=20 update that part of the manual. One possible way forward is to update doc/lispref/searching.texi to=20 specify what we want. Then we can modify the code to match the updated=20 documentation. In my experience, modifying the doc is often the hard part, so I took a=20 crack at that in the draft proposed patch, which I have not installed. Comments? --------------1RRgvsv6eebocvFVK1FMlHHO Content-Type: text/x-patch; charset=UTF-8; name="0001-Document-that-b-etc-are-now-invalid-regexps.patch" Content-Disposition: attachment; filename="0001-Document-that-b-etc-are-now-invalid-regexps.patch" Content-Transfer-Encoding: base64 RnJvbSBlNGZjMzY5YTYyNGQ4NTAyN2QzOWE0MjRhNTA3NTA3ZGEwMGYyNmFhIE1vbiBTZXAg MTcgMDA6MDA6MDAgMjAwMQpGcm9tOiBQYXVsIEVnZ2VydCA8ZWdnZXJ0QGNzLnVjbGEuZWR1 PgpEYXRlOiBTYXQsIDE3IEp1biAyMDIzIDE1OjA1OjQyIC0wNzAwClN1YmplY3Q6IFtQUk9Q T1NFRF0gRG9jdW1lbnQgdGhhdCBcYiogZXRjIGFyZSBub3cgaW52YWxpZCByZWdleHBzCgot LS0KIGRvYy9saXNwcmVmL3NlYXJjaGluZy50ZXhpIHwgMjQgKysrKysrKysrKysrKysrKy0t LS0tLS0tCiBldGMvTkVXUyAgICAgICAgICAgICAgICAgICB8ICA2ICsrKysrKwogMiBmaWxl cyBjaGFuZ2VkLCAyMiBpbnNlcnRpb25zKCspLCA4IGRlbGV0aW9ucygtKQoKZGlmZiAtLWdp dCBhL2RvYy9saXNwcmVmL3NlYXJjaGluZy50ZXhpIGIvZG9jL2xpc3ByZWYvc2VhcmNoaW5n LnRleGkKaW5kZXggYjhkOTA5NGIyOC4uZmQ0ZGZjYmQ3MSAxMDA2NDQKLS0tIGEvZG9jL2xp c3ByZWYvc2VhcmNoaW5nLnRleGkKKysrIGIvZG9jL2xpc3ByZWYvc2VhcmNoaW5nLnRleGkK QEAgLTMzMiw2ICszMzIsMTAgQEAgUmVnZXhwIFNwZWNpYWwKIGV4cHJlc3Npb24uICBUaHVz LCBAc2FtcHtmbyp9IGhhcyBhIHJlcGVhdGluZyBAc2FtcHtvfSwgbm90IGEgcmVwZWF0aW5n CiBAc2FtcHtmb30uICBJdCBtYXRjaGVzIEBzYW1we2Z9LCBAc2FtcHtmb30sIEBzYW1we2Zv b30sIGFuZCBzbyBvbi4KIAorQHNhbXB7Kn0gY2Fubm90IGltbWVkaWF0ZWx5IGZvbGxvdyBh IGJhY2tzbGFzaCBlc2NhcGUgdGhhdCBtYXRjaGVzCitvbmx5IGVtcHR5IHN0cmluZ3MsIGFz IHRoaXMgaXMgdG9vIGxpa2VseSB0byBiZSBhIHR5cG8uICBGb3IgZXhhbXBsZSwKK0BzYW1w e1w8Kn0gaXMgaW52YWxpZC4KKwogQGNpbmRleCBiYWNrdHJhY2tpbmcgYW5kIHJlZ3VsYXIg ZXhwcmVzc2lvbnMKIFRoZSBtYXRjaGVyIHByb2Nlc3NlcyBhIEBzYW1weyp9IGNvbnN0cnVj dCBieSBtYXRjaGluZywgaW1tZWRpYXRlbHksIGFzCiBtYW55IHJlcGV0aXRpb25zIGFzIGNh biBiZSBmb3VuZC4gIFRoZW4gaXQgY29udGludWVzIHdpdGggdGhlIHJlc3Qgb2YKQEAgLTUw NSw5ICs1MDksMTAgQEAgUmVnZXhwIFNwZWNpYWwKIFdoZW4gbWF0Y2hpbmcgYSBzdHJpbmcg aW5zdGVhZCBvZiBhIGJ1ZmZlciwgQHNhbXB7Xn0gbWF0Y2hlcyBhdCB0aGUKIGJlZ2lubmlu ZyBvZiB0aGUgc3RyaW5nIG9yIGFmdGVyIGEgbmV3bGluZSBjaGFyYWN0ZXIuCiAKLUZvciBo aXN0b3JpY2FsIGNvbXBhdGliaWxpdHkgcmVhc29ucywgQHNhbXB7Xn0gY2FuIGJlIHVzZWQg b25seSBhdCB0aGUKK0ZvciBoaXN0b3JpY2FsIGNvbXBhdGliaWxpdHksIEBzYW1we159IGlz IHNwZWNpYWwgb25seSBhdCB0aGUKIGJlZ2lubmluZyBvZiB0aGUgcmVndWxhciBleHByZXNz aW9uLCBvciBhZnRlciBAc2FtcHtcKH0sIEBzYW1we1woPzp9Ci1vciBAc2FtcHtcfH0uCitv ciBAc2FtcHtcfH0uICBJbiBvdGhlciBjb250ZXh0cyBpdCBpcyBhbiBvcmRpbmFyeSBjaGFy YWN0ZXIsIGV4Y2VwdAorZm9yIGl0cyBzcGVjaWFsIG1lYW5pbmcgYXQgdGhlIHN0YXJ0IG9m IGEgY2hhcmFjdGVyIGFsdGVybmF0aXZlLgogCiBAaXRlbSBAc2FtcHskfQogQGNpbmRleCBA c2FtcHskfSBpbiByZWdleHAKQEAgLTUxOSw4ICs1MjQsOSBAQCBSZWdleHAgU3BlY2lhbAog V2hlbiBtYXRjaGluZyBhIHN0cmluZyBpbnN0ZWFkIG9mIGEgYnVmZmVyLCBAc2FtcHskfSBt YXRjaGVzIGF0IHRoZSBlbmQKIG9mIHRoZSBzdHJpbmcgb3IgYmVmb3JlIGEgbmV3bGluZSBj aGFyYWN0ZXIuCiAKLUZvciBoaXN0b3JpY2FsIGNvbXBhdGliaWxpdHkgcmVhc29ucywgQHNh bXB7JH0gY2FuIGJlIHVzZWQgb25seSBhdCB0aGUKK0ZvciBoaXN0b3JpY2FsIGNvbXBhdGli aWxpdHksIEBzYW1weyR9IGlzIHNwZWNpYWwgb25seSBhdCB0aGUKIGVuZCBvZiB0aGUgcmVn dWxhciBleHByZXNzaW9uLCBvciBiZWZvcmUgQHNhbXB7XCl9IG9yIEBzYW1we1x8fS4KK0lu IG90aGVyIGNvbnRleHRzIGl0IGlzIGFuIG9yZGluYXJ5IGNoYXJhY3Rlci4KIAogQGl0ZW0g QHNhbXB7XH0KIEBjaW5kZXggQHNhbXB7XH0gaW4gcmVnZXhwCkBAIC01NDEsMTEgKzU0Nywx MyBAQCBSZWdleHAgU3BlY2lhbAogQGVuZCB0YWJsZQogCiBAc3Ryb25ne1BsZWFzZSBub3Rl On0gRm9yIGhpc3RvcmljYWwgY29tcGF0aWJpbGl0eSwgc3BlY2lhbCBjaGFyYWN0ZXJzCi1h cmUgdHJlYXRlZCBhcyBvcmRpbmFyeSBvbmVzIGlmIHRoZXkgYXJlIGluIGNvbnRleHRzIHdo ZXJlIHRoZWlyIHNwZWNpYWwKLW1lYW5pbmdzIG1ha2Ugbm8gc2Vuc2UuICBGb3IgZXhhbXBs ZSwgQHNhbXB7KmZvb30gdHJlYXRzIEBzYW1weyp9IGFzCi1vcmRpbmFyeSBzaW5jZSB0aGVy ZSBpcyBubyBwcmVjZWRpbmcgZXhwcmVzc2lvbiBvbiB3aGljaCB0aGUgQHNhbXB7Kn0KLWNh biBhY3QuICBJdCBpcyBwb29yIHByYWN0aWNlIHRvIGRlcGVuZCBvbiB0aGlzIGJlaGF2aW9y OyBxdW90ZSB0aGUKLXNwZWNpYWwgY2hhcmFjdGVyIGFueXdheSwgcmVnYXJkbGVzcyBvZiB3 aGVyZSBpdCBhcHBlYXJzLgorYXJlIHRyZWF0ZWQgYXMgb3JkaW5hcnkgb25lcyBpZiB0aGV5 IHdvdWxkIG90aGVyd2lzZSBzdGFydCByZXBldGl0aW9uCitvcGVyYXRvcnMgZWl0aGVyIGF0 IHRoZSBzdGFydCBvZiBhIHJlZ3VsYXIgZXhwcmVzc2lvbiwgb3IgYWZ0ZXIKK0BzYW1we159 LCBAc2FtcHtcKH0sIEBzYW1we1woPzp9IG9yIEBzYW1we1x8fS4gIEZvciBleGFtcGxlLAor QHNhbXB7KmZvb30gaXMgdHJlYXRlZCBhcyBAc2FtcHtcKmZvb30sIGFuZCBAc2FtcHt0d29c fF5cQHsyXEB9fSBpcwordHJlYXRlZCBhcyBAc2FtcHt0d29cfF5AezJAfX0uICBJdCBpcyBw b29yIHByYWN0aWNlIHRvIGRlcGVuZCBvbiB0aGlzCitiZWhhdmlvcjsgdXNlIHByb3BlciBi YWNrc2xhc2ggZXNjYXBpbmcgYW55d2F5LCByZWdhcmRsZXNzIG9mIHdoZXJlCit0aGUgc3Bl Y2lhbCBjaGFyYWN0ZXIgYXBwZWFycy4KIAogQXMgYSBAc2FtcHtcfSBpcyBub3Qgc3BlY2lh bCBpbnNpZGUgYSBjaGFyYWN0ZXIgYWx0ZXJuYXRpdmUsIGl0IGNhbgogbmV2ZXIgcmVtb3Zl IHRoZSBzcGVjaWFsIG1lYW5pbmcgb2YgQHNhbXB7LX0sIEBzYW1we159IG9yIEBzYW1we119 LgpkaWZmIC0tZ2l0IGEvZXRjL05FV1MgYi9ldGMvTkVXUwppbmRleCA2MWU2ZTE2MTY2Li4w YzQ4ODlmOWE2IDEwMDY0NAotLS0gYS9ldGMvTkVXUworKysgYi9ldGMvTkVXUwpAQCAtNDM2 LDYgKzQzNiwxMiBAQCBQcmV2aW91c2x5LCAnXHgnIHdpdGhvdXQgYXQgbGVhc3Qgb25lIGhl eCBkaWdpdCBkZW5vdGVkIGNoYXJhY3RlciBjb2RlCiB6ZXJvIChOVUwpIGJ1dCBhcyB0aGlz IHdhcyBuZWl0aGVyIGludGVuZGVkIG5vciBkb2N1bWVudGVkIG9yIGV2ZW4KIGtub3duIGJ5 IGFueW9uZSwgaXQgaXMgbm93IHRyZWF0ZWQgYXMgYW4gZXJyb3IgYnkgdGhlIExpc3AgcmVh ZGVyLgogCis9PT0KKyoqIEluIHJlZ3VsYXIgZXhwcmVzc2lvbnMsIHplcm8td2lkdGggYmFj a3NsYXNoIGVzY2FwZXMgY2FuIG5vIGxvbmdlcgorYmUgZm9sbG93ZWQgYnkgcmVwZXRpdGlv biBvcGVyYXRvcnMuICBGb3IgZXhhbXBsZSwgJ1xiKicgaXMgbm8gbG9uZ2VyCithIHZhbGlk IHJlZ3VsYXIgZXhwcmVzc2lvbi4gIFByZXZpb3VzbHkgdGhlIGJlaGF2aW9yIHdhcyBlcnJh dGljIGZvcgordGhlc2UgY29uc3RydWN0cywgYW5kIHRoZXkgd2VyZSB0eXBpY2FsbHkgdHlw b3MgYW55d2F5LgorCiAtLS0KICoqIENvbm5lY3Rpb24tbG9jYWwgdmFyaWFibGVzIGFyZSBh cHBsaWVkIGluIGJ1ZmZlcnMgdmlzaXRpbmcgYSByZW1vdGUgZmlsZS4KIFRoaXMgb3ZlcnJp ZGVzIHBvc3NpYmxlIGRpcmVjdG9yeS1sb2NhbCBvciBmaWxlLWxvY2FsIHZhcmlhYmxlcyB3 aXRoCi0tIAoyLjM5LjIKCg== --------------1RRgvsv6eebocvFVK1FMlHHO--