all messages for Emacs-related lists mirrored at yhetil.org
 help / color / mirror / code / Atom feed
From: "Clément Pit--Claudel" <clement.pit@gmail.com>
To: Lars Ingebrigtsen <larsi@gnus.org>
Cc: emacs-devel@gnu.org
Subject: Re: Regular expression libraries
Date: Fri, 16 Dec 2016 15:06:13 -0500	[thread overview]
Message-ID: <3b626acd-cb21-9208-a31d-1ee2a2a0fcba@gmail.com> (raw)
In-Reply-To: <m3inqjyj7b.fsf@gnus.org>


[-- Attachment #1.1: Type: text/plain, Size: 1373 bytes --]

On 2016-12-16 10:42, Lars Ingebrigtsen wrote:
> Clément Pit--Claudel <clement.pit@gmail.com> writes:
> 
>> Wait, now I'm confused.  The master branch of glibc *does* include the following in regex.h:
>>
>> extern int __re_search_2
>>   (struct re_pattern_buffer *buffer, const char *string1,
>>    int length1, const char *string2, int length2,
>>    int start, int range, struct re_registers *regs, int stop);
>>
>> … but the implementation just allocates a large buffer and concatenates both strings?
> 
> Yes.  If I remember correctly, a glibc developer popped up here a couple
> of years ago announcing that they were going to do that change (thereby
> making __re_search_2 incredibly slow, but retained for compatibility).

I see, thanks.  This code seems a bit beyond my abilities to easily extend :/
Eli started listing things that we would need from an external library before we could move to it.  AFAICT, there is:

* Support for searching a gap buffer (essentially two strings).
* Support for Emacs' internal utf-8 based encoding
* Support for Emacs' regexp syntax (though we could imagine writing a translator)
* Support for Emacs' syntax properties, and Emacs-specific extensions like matching the position of the point, etc.

An I missing anything else? Which ones of these things does gnulib already have?

Thanks!
Clément.


[-- Attachment #2: OpenPGP digital signature --]
[-- Type: application/pgp-signature, Size: 819 bytes --]

  reply	other threads:[~2016-12-16 20:06 UTC|newest]

Thread overview: 16+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2016-12-15 19:00 Regular expression libraries Clément Pit--Claudel
2016-12-15 20:10 ` Eli Zaretskii
2016-12-15 20:30 ` Paul Eggert
2016-12-15 22:00   ` Andreas Schwab
2016-12-16  7:20     ` Paul Eggert
2016-12-16 14:31       ` Clément Pit--Claudel
2016-12-16 14:54       ` Clément Pit--Claudel
2016-12-16 15:42         ` Lars Ingebrigtsen
2016-12-16 20:06           ` Clément Pit--Claudel [this message]
2016-12-16 21:25             ` Eli Zaretskii
2016-12-16 17:43         ` Paul Eggert
2016-12-15 22:16   ` Clément Pit--Claudel
2016-12-16  5:15 ` Stefan Monnier
2016-12-16 14:41   ` Clément Pit--Claudel
2016-12-16 17:45     ` Paul Eggert
2016-12-16 20:07       ` Clément Pit--Claudel

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=3b626acd-cb21-9208-a31d-1ee2a2a0fcba@gmail.com \
    --to=clement.pit@gmail.com \
    --cc=emacs-devel@gnu.org \
    --cc=larsi@gnus.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
Code repositories for project(s) associated with this external index

	https://git.savannah.gnu.org/cgit/emacs.git
	https://git.savannah.gnu.org/cgit/emacs/org-mode.git

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.