From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Paul Eggert Newsgroups: gmane.emacs.devel Subject: Re: Fixing ill-conditioned regular expressions. Proof of concept. Date: Mon, 23 Feb 2015 14:19:03 -0800 Organization: UCLA Computer Science Department Message-ID: <54EBA757.5030901@cs.ucla.edu> References: <20150223181205.GA2861@acm.fritz.box> <54EB85AC.1030800@cs.ucla.edu> <20150223202114.GB2861@acm.fritz.box> NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 7bit X-Trace: ger.gmane.org 1424729961 28602 80.91.229.3 (23 Feb 2015 22:19:21 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Mon, 23 Feb 2015 22:19:21 +0000 (UTC) Cc: emacs-devel@gnu.org To: Alan Mackenzie Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Mon Feb 23 23:19:13 2015 Return-path: Envelope-to: ged-emacs-devel@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1YQ1LL-0005qh-Vw for ged-emacs-devel@m.gmane.org; Mon, 23 Feb 2015 23:19:12 +0100 Original-Received: from localhost ([::1]:45683 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1YQ1LL-0005cA-Aa for ged-emacs-devel@m.gmane.org; Mon, 23 Feb 2015 17:19:11 -0500 Original-Received: from eggs.gnu.org ([2001:4830:134:3::10]:38251) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1YQ1LH-0005bD-SM for emacs-devel@gnu.org; Mon, 23 Feb 2015 17:19:08 -0500 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1YQ1LE-00058z-LP for emacs-devel@gnu.org; Mon, 23 Feb 2015 17:19:07 -0500 Original-Received: from smtp.cs.ucla.edu ([131.179.128.62]:38199) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1YQ1LE-00058v-Fq for emacs-devel@gnu.org; Mon, 23 Feb 2015 17:19:04 -0500 Original-Received: from localhost (localhost.localdomain [127.0.0.1]) by smtp.cs.ucla.edu (Postfix) with ESMTP id E1D9E39E8019; Mon, 23 Feb 2015 14:19:03 -0800 (PST) X-Virus-Scanned: amavisd-new at smtp.cs.ucla.edu Original-Received: from smtp.cs.ucla.edu ([127.0.0.1]) by localhost (smtp.cs.ucla.edu [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 1Cf8lp0wg8Xo; Mon, 23 Feb 2015 14:19:03 -0800 (PST) Original-Received: from Penguin.CS.UCLA.EDU (Penguin.CS.UCLA.EDU [131.179.64.200]) by smtp.cs.ucla.edu (Postfix) with ESMTPSA id 64A7139E8016; Mon, 23 Feb 2015 14:19:03 -0800 (PST) User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:31.0) Gecko/20100101 Thunderbird/31.4.0 In-Reply-To: <20150223202114.GB2861@acm.fritz.box> X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6.x X-Received-From: 131.179.128.62 X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.14 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.devel:183429 Archived-At: On 02/23/2015 12:21 PM, Alan Mackenzie wrote: > basically, I've got little idea about regexp engines. That's OK, if you prefer a source-to-source transformation then you can use that instead, but the point is that this should be done for all uses of the regexp code, not just for some of them. The Emacs regexp code isn't Perl-inspired, as far as I know. It's an old copy of the glibc code, with a lot of hacks. The glibc version mutated quite a bit when it added i18n support, and Emacs's version has mutated in different ways.