From mboxrd@z Thu Jan 1 00:00:00 1970 Path: main.gmane.org!not-for-mail From: Tom Lord Newsgroups: gmane.lisp.guile.user Subject: Re: Stupid module and pregexp questions Date: Mon, 5 May 2003 13:19:55 -0700 (PDT) Sender: guile-user-bounces+guile-user=m.gmane.org@gnu.org Message-ID: <200305052019.NAA12930@morrowfield.regexps.com> References: <877k9eobcv.fsf@raven.i.defaultvalue.org> <200304292321.QAA04172@morrowfield.regexps.com> <877k96htat.fsf@raven.i.defaultvalue.org> <200305050618.XAA10052@morrowfield.regexps.com> <87of2h25tj.fsf@raven.i.defaultvalue.org> <87r87dyykc.fsf@raven.i.defaultvalue.org> NNTP-Posting-Host: main.gmane.org X-Trace: main.gmane.org 1052167514 16714 80.91.224.249 (5 May 2003 20:45:14 GMT) X-Complaints-To: usenet@main.gmane.org NNTP-Posting-Date: Mon, 5 May 2003 20:45:14 +0000 (UTC) Cc: guile-user@gnu.org Original-X-From: guile-user-bounces+guile-user=m.gmane.org@gnu.org Mon May 05 22:45:13 2003 Return-path: Original-Received: from mail-relay.eunet.no ([193.71.71.242]) by main.gmane.org with esmtp (Exim 3.35 #1 (Debian)) id 19Cmoa-0004GZ-00 for ; Mon, 05 May 2003 22:44:08 +0200 Original-Received: from monty-python.gnu.org (monty-python.gnu.org [199.232.76.173]) by mail-relay.eunet.no (8.12.2/8.12.2/GN) with ESMTP id h45KjLxP065992 for ; Mon, 5 May 2003 22:45:22 +0200 (CEST) (envelope-from guile-user-bounces+guile-user=m.gmane.org@gnu.org) Original-Received: from localhost ([127.0.0.1] helo=monty-python.gnu.org) by monty-python.gnu.org with esmtp (Exim 4.10.13) id 19Cmmj-0004fT-01 for guile-user@m.gmane.org; Mon, 05 May 2003 16:42:05 -0400 Original-Received: from list by monty-python.gnu.org with tmda-scanned (Exim 4.10.13) id 19Cmg5-0002UA-00 for guile-user@gnu.org; Mon, 05 May 2003 16:35:13 -0400 Original-Received: from mail by monty-python.gnu.org with spam-scanned (Exim 4.10.13) id 19CmXI-0008W2-00 for guile-user@gnu.org; Mon, 05 May 2003 16:26:11 -0400 Original-Received: from 1cust86.tnt13.sfo8.da.uu.net ([65.234.195.86] helo=morrowfield.regexps.com) by monty-python.gnu.org with esmtp (Exim 4.10.13) id 19CmE8-0004gN-00 for guile-user@gnu.org; Mon, 05 May 2003 16:06:20 -0400 Original-Received: (from lord@localhost) by morrowfield.regexps.com (8.9.1/8.9.1) id NAA12930; Mon, 5 May 2003 13:19:55 -0700 (PDT) (envelope-from lord@morrowfield.regexps.com) Original-To: rlb@defaultvalue.org In-reply-to: <87r87dyykc.fsf@raven.i.defaultvalue.org> (message from Rob Browning on Mon, 05 May 2003 14:37:39 -0500) X-BeenThere: guile-user@gnu.org X-Mailman-Version: 2.1b5 Precedence: list List-Id: General Guile related discussions List-Help: List-Post: List-Subscribe: , List-Archive: List-Unsubscribe: , Errors-To: guile-user-bounces+guile-user=m.gmane.org@gnu.org Xref: main.gmane.org gmane.lisp.guile.user:1911 X-Report-Spam: http://spam.gmane.org/gmane.lisp.guile.user:1911 > Thanks again for the very interesting comments. Hope they're useful. Along those lines: > With respect (ice-9 regex), I'm inclined to agree with you. > If we can include a good POSIX implementation, then that > should fix the problems I've been asking about. State of the world, as I know it: * GNU regex Most versions are buggy. RMS did some work on one fork (perhaps just the one in emacs) and I'm pretty sure he fixed all the Posix bugs. So, ask in that direction. Advantages: simple, small, LGPL Disadvantages: slow on expressions that cause backtracking Uknowns: Posix compliant? (guess bias towards "yes" for the latest from RMS) * Henry Spencer's I haven't seen any more recent release than the one included in Tcl. Advantages: fast, Berkeley-ish license (GPL compatible), Unicode support Disadvantages: big and complicated. Some Posix bugs (at least ca. mid-2002) Uknowns: still maintained? * Isamu Hasegawa's (Latest glibc?) Advantages: smart implementor, DFAish, in glibc (so perhaps gets beaten upon), LGPL Disadvantages: odd space requirements, big and complicated Unknowns: Posix conformance status (guess bias: "good") and performance (guess bias: "good for short strings") * Tom Lord's (latest arch, src/hackerlab) Advantages: DFAish, fast, good correctness tests, Unicode in low-level engine (but not (yet) via Posix entry points), good growth path basis for "what should a regexp srfi do". Disadvantages: big and complicated, GPL (probably flexible on that), regcomp is mildly slow (compared to GNU regex but regexec fast), can be a fickle beast to tune (but conversely: flexibly tunable). Unknowns: maintained? (c.f., my so-called life :-) * Others Don't bother, imho. It's mostly the "big and complicated" on all but one of those that makes me suggest bundling a good fork of GNU regex, if you can get one. -t _______________________________________________ Guile-user mailing list Guile-user@gnu.org http://mail.gnu.org/mailman/listinfo/guile-user