From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: =?utf-8?B?xaB0xJtww6FuIE7Em21lYw==?= Newsgroups: gmane.emacs.devel Subject: Re: Patch for lookaround assertion in regexp Date: Mon, 23 Jan 2012 19:45:00 +0100 Message-ID: <874nvm1cnn.fsf@gmail.com> References: <009001ccd9c0$9bde09f0$d39a1dd0$@cfraizer.com> NNTP-Posting-Host: lo.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-Trace: dough.gmane.org 1327344324 28112 80.91.229.12 (23 Jan 2012 18:45:24 GMT) X-Complaints-To: usenet@dough.gmane.org NNTP-Posting-Date: Mon, 23 Jan 2012 18:45:24 +0000 (UTC) Cc: Andreas Schwab , Tom , emacs-devel@gnu.org To: Stefan Monnier Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Mon Jan 23 19:45:20 2012 Return-path: Envelope-to: ged-emacs-devel@m.gmane.org Original-Received: from lists.gnu.org ([140.186.70.17]) by lo.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1RpOtL-0001XX-PG for ged-emacs-devel@m.gmane.org; Mon, 23 Jan 2012 19:45:19 +0100 Original-Received: from localhost ([::1]:36361 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1RpOtK-0002TV-Ul for ged-emacs-devel@m.gmane.org; Mon, 23 Jan 2012 13:45:18 -0500 Original-Received: from eggs.gnu.org ([140.186.70.92]:43726) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1RpOtC-0002T3-SP for emacs-devel@gnu.org; Mon, 23 Jan 2012 13:45:16 -0500 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1RpOtB-0005nt-2x for emacs-devel@gnu.org; Mon, 23 Jan 2012 13:45:10 -0500 Original-Received: from mail-ee0-f41.google.com ([74.125.83.41]:54739) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1RpOtA-0005nI-OC for emacs-devel@gnu.org; Mon, 23 Jan 2012 13:45:09 -0500 Original-Received: by eekc13 with SMTP id c13so1271034eek.0 for ; Mon, 23 Jan 2012 10:45:07 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=from:to:cc:subject:in-reply-to:references:user-agent:date :message-id:mime-version:content-type:content-transfer-encoding; bh=bTdKjjicYnhLQ/t3+s3LRs+wBZeUDvYPHpeRz+oYgw8=; b=DJQYFc3/Mt+ovFpE3IQQKey9vfo34BrIS8mKm06O8nqNF26glkKJ4ldouYD/NLrvEB n2AW4lBNgjEVHJhlueeHkixAsRWtiAbpVLHytWuUpO6BxCPtczuex9WnhQyq4KMlNMSU B+o5Q6joSu3nWgGW/QDxKdAbjWW1JXUJvW8Tg= Original-Received: by 10.14.9.150 with SMTP id 22mr3309043eet.105.1327344307238; Mon, 23 Jan 2012 10:45:07 -0800 (PST) Original-Received: from localhost (176.119.broadband10.iol.cz. [90.177.119.176]) by mx.google.com with ESMTPS id y12sm57000542eeb.11.2012.01.23.10.45.03 (version=TLSv1/SSLv3 cipher=OTHER); Mon, 23 Jan 2012 10:45:04 -0800 (PST) In-Reply-To: (Stefan Monnier's message of "Mon, 23 Jan 2012 12:11:26 -0500") User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/24.0.50 (gnu/linux) X-detected-operating-system: by eggs.gnu.org: Genre and OS details not recognized. X-Received-From: 74.125.83.41 X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.14 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.devel:147857 Archived-At: On Mon, 23 Jan 2012 18:11:26 +0100 Stefan Monnier wrote: >>> If it doesn't then it's a job for the translation layer. Char syntaxes >>> and categories could be converted into the standard [...] format. >> Enumerating the syntax/category members is not an option. > > Indeed. > >> There is no easy way to do that. > > For `categories', there is a way, but the result is a *very* large [...] > chunk, so it's impractical. For `syntax' there is indeed no way, since > the syntax of a char doesn't only depend on the char itself but also of > the `char-table' text-property that might be applied to that particular > character position (and of course, if we ignore this problem, we're > still back to the same problem of enormous [...] expressions, as is the > case for categories). > These entities really need to be implemented inside the regexp-engine > (but they're usually pretty easy to implement there). OTOH using something like PCRE would finally fix the currently erroneous implementation of classes like [:space:], which now is the same as \s-. (And personally I would gladly forgo the syntax categories for standard [:classes:], although I imagine the former might be used by the font-locking or somewhere... I never felt the need for them.) --=20 =C5=A0t=C4=9Bp=C3=A1n