From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Cameron Desautels Newsgroups: gmane.emacs.bugs Subject: bug#16046: Bug with Regexp Containing only a Character Class with a Caret Date: Tue, 3 Dec 2013 22:57:56 -0600 Message-ID: NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1 X-Trace: ger.gmane.org 1386151578 15449 80.91.229.3 (4 Dec 2013 10:06:18 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Wed, 4 Dec 2013 10:06:18 +0000 (UTC) To: 16046@debbugs.gnu.org Original-X-From: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Wed Dec 04 11:06:23 2013 Return-path: Envelope-to: geb-bug-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1Vo9LX-0003qe-3E for geb-bug-gnu-emacs@m.gmane.org; Wed, 04 Dec 2013 11:06:19 +0100 Original-Received: from localhost ([::1]:47362 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Vo9LW-0007Pe-Ij for geb-bug-gnu-emacs@m.gmane.org; Wed, 04 Dec 2013 05:06:18 -0500 Original-Received: from eggs.gnu.org ([2001:4830:134:3::10]:46997) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Vo9LK-0007Mg-Sn for bug-gnu-emacs@gnu.org; Wed, 04 Dec 2013 05:06:11 -0500 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Vo9LI-0001t7-2U for bug-gnu-emacs@gnu.org; Wed, 04 Dec 2013 05:06:06 -0500 Original-Received: from debbugs.gnu.org ([140.186.70.43]:43293) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Vo9LH-0001t3-VP for bug-gnu-emacs@gnu.org; Wed, 04 Dec 2013 05:06:04 -0500 Original-Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.80) (envelope-from ) id 1Vo9LH-0006xP-OO for bug-gnu-emacs@gnu.org; Wed, 04 Dec 2013 05:06:03 -0500 X-Loop: help-debbugs@gnu.org Resent-From: Cameron Desautels Original-Sender: "Debbugs-submit" Resent-CC: bug-gnu-emacs@gnu.org Resent-Date: Wed, 04 Dec 2013 10:06:03 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: report 16046 X-GNU-PR-Package: emacs X-GNU-PR-Keywords: X-Debbugs-Original-To: bug-gnu-emacs@gnu.org Original-Received: via spool by submit@debbugs.gnu.org id=B.138615154326699 (code B ref -1); Wed, 04 Dec 2013 10:06:03 +0000 Original-Received: (at submit) by debbugs.gnu.org; 4 Dec 2013 10:05:43 +0000 Original-Received: from localhost ([127.0.0.1]:57311 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1Vo9Kw-0006wT-OC for submit@debbugs.gnu.org; Wed, 04 Dec 2013 05:05:43 -0500 Original-Received: from eggs.gnu.org ([208.118.235.92]:49675) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1Vo4XC-0007dR-6E for submit@debbugs.gnu.org; Tue, 03 Dec 2013 23:58:02 -0500 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Vo4XA-0006bg-ND for submit@debbugs.gnu.org; Tue, 03 Dec 2013 23:58:01 -0500 Original-Received: from lists.gnu.org ([2001:4830:134:3::11]:34077) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Vo4XA-0006bc-Kp for submit@debbugs.gnu.org; Tue, 03 Dec 2013 23:58:00 -0500 Original-Received: from eggs.gnu.org ([2001:4830:134:3::10]:51391) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Vo4X9-0004aq-K4 for bug-gnu-emacs@gnu.org; Tue, 03 Dec 2013 23:58:00 -0500 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Vo4X8-0006b8-HH for bug-gnu-emacs@gnu.org; Tue, 03 Dec 2013 23:57:59 -0500 Original-Received: from mail-pd0-x22c.google.com ([2607:f8b0:400e:c02::22c]:52991) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Vo4X8-0006as-9s for bug-gnu-emacs@gnu.org; Tue, 03 Dec 2013 23:57:58 -0500 Original-Received: by mail-pd0-f172.google.com with SMTP id g10so21666926pdj.3 for ; Tue, 03 Dec 2013 20:57:56 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:date:message-id:subject:from:to:content-type; bh=hPwxv+mv+IFf49T7fvesLI6o+fnNO1xlOKpifH5APiM=; b=QBepIBdTFCQbedAG352ZzUkZurtr0OaAgrPWthwVxL+qcVlWL08muZw2/DIJitPEvn pYXbmzIgvJ5x4+OozLw/QtHNC6JtiKpHGliQY2E51AMRH/E1aFuIBUbRxxx4mi+kXhIR p8d0amyftHOr7MRAOsIswFHbGVRSTVcf6GsbDBcPMwDJiPr+PoT1Jc8qRJ9q+XFnvOxT 8QGafKYffBZ5cmI2BPcgvDs3d15r5UMmhTdshlASMczMVY9bdzuZscyQBv2Uje/vO/V4 eCSAMeKQyh+Gl3GND5bjWcNdf0sslGElc7t1RuBXNiLFHIbj6EMUyeWJEK06hEKOCWiM G4JQ== X-Received: by 10.68.200.33 with SMTP id jp1mr9095923pbc.21.1386133076456; Tue, 03 Dec 2013 20:57:56 -0800 (PST) Original-Received: by 10.70.50.228 with HTTP; Tue, 3 Dec 2013 20:57:56 -0800 (PST) X-detected-operating-system: by eggs.gnu.org: Error: Malformed IPv6 address (bad octet value). X-detected-operating-system: by eggs.gnu.org: Error: Malformed IPv6 address (bad octet value). X-Mailman-Approved-At: Wed, 04 Dec 2013 05:05:39 -0500 X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.15 Precedence: list X-detected-operating-system: by eggs.gnu.org: GNU/Linux 3.x X-Received-From: 140.186.70.43 X-BeenThere: bug-gnu-emacs@gnu.org List-Id: "Bug reports for GNU Emacs, the Swiss army knife of text editors" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Original-Sender: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.bugs:81351 Archived-At: Hi all, I've run across a dilemma, in the most literal sense: either there's a problem in Emacs's regexp engine or there's an issue with `regexp-opt-charset`---I'm not sure which. The issue has to do with regular expressions containing character classes with only a caret character. I know this seems like a rather silly case (why not just use "\\^"?) but it came up in the context of trying to track down a bug in ruby-mode, so it does occur in real (and particularly *programmatic*) settings. The simplest case to reproduce is the following: (re-search-forward "[^]") ; => Debugger entered--Lisp error: (invalid-regexp "Unmatched [ or [^") ; re-search-forward("[^]") ; eval((re-search-forward "[^]") nil) ; eval-last-sexp-1(t) ; eval-last-sexp(t) ; eval-print-last-sexp() ; call-interactively(eval-print-last-sexp record nil) ; command-execute(eval-print-last-sexp record) ; execute-extended-command(nil "eval-print-last-sexp") ; call-interactively(execute-extended-command nil nil) Now, you can make a compelling case that that's not a valid regexp (and the Emacs Lisp Reference Manual doesn't seem to *directly* contradict this argument), but that presents a problem when paired with `regexp-opt-charset`: (regexp-opt-charset '(?^)) => "[^]" Note that that produces the problem regexp; which is to say that the following code is bound to fail when it should succeed: (re-search-forward (regexp-opt-charset '(?^))) What's the correct behavior? I'd be happy to offer a patch for either side of the equation but I'm not sure which one to target. All the best. -- Cameron In GNU Emacs 24.3.1 (x86_64-apple-darwin11.4.2, Carbon Version 1.6.0 AppKit 1138.51) of 2013-05-13 on atago Windowing system distributor `Apple Inc.', version 10.9.0 Configured using: `configure '--with-mac' '--enable-mac-app=/Users/xin/Documents/emacs-mac-port/build' '--prefix=/Users/xin/Documents/emacs-mac-port/build'' Important settings: value of $LANG: en_US.UTF-8 locale-coding-system: utf-8-unix default enable-multibyte-characters: t Major mode: Lisp Interaction Minor modes in effect: tooltip-mode: t mouse-wheel-mode: t tool-bar-mode: t menu-bar-mode: t file-name-shadow-mode: t global-font-lock-mode: t font-lock-mode: t auto-composition-mode: t auto-encryption-mode: t auto-compression-mode: t line-number-mode: t transient-mark-mode: t Load-path shadows: /Applications/Emacs.app/Contents/Resources/lisp/.dir-locals hides /Applications/Emacs.app/Contents/Resources/lisp/gnus/.dir-locals Features: (shadow sort gnus-util mail-extr emacsbug message format-spec rfc822 mml mml-sec mm-decode mm-bodies mm-encode mail-parse rfc2231 mailabbrev gmm-utils mailheader sendmail rfc2047 rfc2045 ietf-drums mm-util mail-prsvr mail-utils help-mode easymenu debug time-date tooltip ediff-hook vc-hooks lisp-float-type mwheel mac-win tool-bar dnd fontset image regexp-opt fringe tabulated-list newcomment lisp-mode register page menu-bar rfn-eshadow timer select scroll-bar mouse jit-lock font-lock syntax facemenu font-core frame cham georgian utf-8-lang misc-lang vietnamese tibetan thai tai-viet lao korean japanese hebrew greek romanian slovak czech european ethiopic indian cyrillic chinese case-table epa-hook jka-cmpr-hook help simple abbrev minibuffer loaddefs button faces cus-face macroexp files text-properties overlay sha1 md5 base64 format env code-pages mule custom widget hashtable-print-readable backquote mac multi-tty make-network-process emacs)