unofficial mirror of bug-guile@gnu.org 
 help / color / mirror / Atom feed
* regexp_exec thread-unsafe
@ 2013-05-14 21:21 Ludovic Courtès
  2013-05-15  7:34 ` bug#14404: " Ludovic Courtès
  2013-05-19 21:30 ` Paul Eggert
  0 siblings, 2 replies; 8+ messages in thread
From: Ludovic Courtès @ 2013-05-14 21:21 UTC (permalink / raw)
  To: bug-gnulib; +Cc: bug-guile

Hello,

Guile imports Gnulib ‘regexp’ module, and for some reason it gets
compiled even on glibc systems.

The problem is that Gnulib’s ‘regex_internal.h’ reads this:

  #if defined _LIBC
  # include <bits/libc-lock.h>
  #else
  # define __libc_lock_init(NAME) do { } while (0)
  # define __libc_lock_lock(NAME) do { } while (0)
  # define __libc_lock_unlock(NAME) do { } while (0)
  #endif

and ‘regexp_exec’ happily does:

    __libc_lock_lock (dfa->lock);
    if (preg->no_sub)
      err = re_search_internal (preg, string, length, start, length,
                                length, 0, NULL, eflags);
    else
      err = re_search_internal (preg, string, length, start, length,
                                length, nmatch, pmatch, eflags);
    __libc_lock_unlock (dfa->lock);

Obviously that doesn’t work well, and it caused a regression in Guile.
With MALLOC_CHECK_=2 and a Guile program, it aborts like this (sorry, I
failed to come up with a reduced test case):

--8<---------------cut here---------------start------------->8---
(gdb) bt full
#0  0x00007f793608d2b5 in raise () from /home/ludo/.guix-profile/lib/libc.so.6
No symbol table info available.
#1  0x00007f793608e738 in abort () from /home/ludo/.guix-profile/lib/libc.so.6
No symbol table info available.
#2  0x00007f79360d2afe in malloc_printerr () from /home/ludo/.guix-profile/lib/libc.so.6
No symbol table info available.
#3  0x00007f79360d5bf5 in realloc_check () from /home/ludo/.guix-profile/lib/libc.so.6
No symbol table info available.
#4  0x00007f7937a6efed in register_state (dfa=0x10872c0, newstate=newstate@entry=0x1c6dce0, hash=hash@entry=576)
    at regex_internal.c:1590
        new_alloc = 8
        new_array = 0x0
        spot = 0x11981c0
        i = <optimized out>
#5  0x00007f7937a72373 in create_cd_newstate (hash=<optimized out>, context=1, nodes=0x12, dfa=0x1c6dce0) at regex_internal.c:1734
        i = <optimized out>
        nctx_nodes = <optimized out>
        err = <optimized out>
        newstate = 0x1c6dce0
#6  re_acquire_state_context (err=err@entry=0x7fff67bd9e40, dfa=dfa@entry=0x10872c0, nodes=nodes@entry=0x7fff67bd9e60, 
    context=context@entry=1) at regex_internal.c:1554
        hash = 6
        spot = <optimized out>
        i = <optimized out>
#7  0x00007f79379bb54b in build_trtable (dfa=<optimized out>, state=state@entry=0x1c44820) at regexec.c:3474
        next_node = <optimized out>
        err = _REG_NOERROR
        i = 1
        j = 2
        ch = <optimized out>
        need_word_trtable = true
        elem = <optimized out>
        mask = <optimized out>
        dests_node_malloced = true
        dest_states_malloced = true
        trtable = <optimized out>
        dest_states = 0x1c688f0
        dest_states_word = 0x1c68918
        dest_states_nl = 0x1c68940
        follows = {alloc = 74, nelem = 18, elems = 0x1c5d4e0}
        dests_node = 0x1c642b0
        dests_ch = 0x1c65ab0
        acceptable = {288063250384289792, 576460743847706622, 0, 0}
        dests_alloc = 0x1c642b0
#8  0x00007f7937a77714 in transit_state (state=0x1c44820, mctx=0x7fff67bda000, err=0x7fff67bd9fb0) at regexec.c:2339
        trtable = <optimized out>
        ch = 116 't'
#9  check_matching (p_match_first=0x7fff67bd9fb8, fl_longest_match=true, mctx=0x7fff67bda000) at regexec.c:1187
        old_state = 0x1c44820
        next_char_idx = <optimized out>
        dfa = <optimized out>
        err = _REG_NOERROR
        match = 0
        match_last = <optimized out>
        next_start_idx = 0
        cur_str_idx = <optimized out>
        cur_state = 0x1c44820
        at_init_state = <optimized out>
#10 re_search_internal (preg=0x1256700, string=0x1c612d0 "http://hydra.gnu.org/nar/0did57blsaaspj49cl3q8nnydm5pr1qg-cflow-1.4", 
    length=67, start=0, last_start=0, stop=<optimized out>, nmatch=9, pmatch=0x1c63d50, eflags=0) at regexec.c:856
        err = _REG_NOERROR
        dfa = 0x10872c0
        left_lim = 0
        right_lim = 0
        incr = 1
        fl_longest_match = <optimized out>
        match_kind = 8
        match_first = 0
        match_last = <optimized out>
        extra_nmatch = 0
        sb = false
        mctx = {input = {raw_mbs = 0x1c612d0 "http://hydra.gnu.org/nar/0did57blsaaspj49cl3q8nnydm5pr1qg-cflow-1.4", 
            mbs = 0x1c612d0 "http://hydra.gnu.org/nar/0did57blsaaspj49cl3q8nnydm5pr1qg-cflow-1.4", wcs = 0x1c63f50, 
            offsets = 0x0, cur_state = {__count = 0, __value = {__wch = 0, __wchb = "\000\000\000"}}, raw_mbs_idx = 0, 
            valid_len = 67, valid_raw_len = 67, bufs_len = 68, cur_idx = 2, raw_len = 67, len = 67, raw_stop = 67, stop = 67, 
            tip_context = 6, trans = 0x0, word_char = 0x1087378, icase = 0 '\000', is_utf8 = 1 '\001', map_notascii = 0 '\000', 
            mbs_allocated = 0 '\000', offsets_needed = 0 '\000', newline_anchor = 0 '\000', word_ops_used = 0 '\000', 
            mb_cur_max = 6}, dfa = 0x10872c0, eflags = 0, match_last = 18446744073709551615, last_node = 0, 
          state_log = 0x1c64070, state_log_top = 1, nbkref_ents = 0, abkref_ents = 0, bkref_ents = 0x0, max_mb_elem_len = 0, 
          nsub_tops = 0, asub_tops = 0, sub_tops = 0x0}
        fastmap = 0x0
        t = 0x0
#11 0x00007f7937a7b5c0 in rpl_regexec (preg=0x1256700, string=0x3399 <Address 0x3399 out of bounds>, 
    string@entry=0x1c612d0 "http://hydra.gnu.org/nar/0did57blsaaspj49cl3q8nnydm5pr1qg-cflow-1.4", nmatch=6, nmatch@entry=9, 
    pmatch=0xffffffffffffffff, pmatch@entry=0x1c63d50, eflags=0) at regexec.c:254
        err = _REG_NOERROR
        start = 13209
        length = 0
#12 0x00007f7937a68873 in scm_regexp_exec (rx=0x1836090, str=0x1806b00, start=<optimized out>, flags=0x2) at regex-posix.c:288
        status = <optimized out>
        nmatches = 9
        offset = 0
        matches = 0x1c63d50
        c_str = 0x1c612d0 "http://hydra.gnu.org/nar/0did57blsaaspj49cl3q8nnydm5pr1qg-cflow-1.4"
        mvec = 0x4
        substr = <optimized out>
--8<---------------cut here---------------end--------------->8---

How should that be fixed?  Shouldn’t __libc_lock_unlock & co. be rebased
on top of pthread_mutex_t?

Thanks,
Ludo’.



^ permalink raw reply	[flat|nested] 8+ messages in thread

end of thread, other threads:[~2014-03-11 23:15 UTC | newest]

Thread overview: 8+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2013-05-14 21:21 regexp_exec thread-unsafe Ludovic Courtès
2013-05-15  7:34 ` bug#14404: " Ludovic Courtès
2013-05-19 21:30 ` Paul Eggert
2013-05-20 14:20   ` bug#14404: " Ludovic Courtès
     [not found]   ` <87d2slu413.fsf@gnu.org>
2013-05-26 22:11     ` Paolo Bonzini
     [not found]     ` <51A288AD.4070405@gnu.org>
2013-05-30  0:59       ` Paul Eggert
     [not found]       ` <51A6A489.2060101@cs.ucla.edu>
2013-05-30  7:28         ` Paolo Bonzini
2014-03-11 23:15         ` Ludovic Courtès

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).