From mboxrd@z Thu Jan 1 00:00:00 1970 From: Andy Wingo Subject: bug#28211: Grafting code triggers GC/thread-safety issue on Guile 2.2.2 Date: Thu, 10 May 2018 09:53:18 +0200 Message-ID: <87tvrg3q1d.fsf@igalia.com> References: <877exuj58y.fsf@gnu.org> <87d0yo1tie.fsf@gnu.org> <87fu3124nt.fsf@gnu.org> <87d0y5k6sl.fsf@netris.org> <871sel6vnq.fsf@igalia.com> <87fu30dmx3.fsf@netris.org> Mime-Version: 1.0 Content-Type: text/plain Return-path: Received: from eggs.gnu.org ([2001:4830:134:3::10]:54795) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1fGgOr-00029I-3A for bug-guix@gnu.org; Thu, 10 May 2018 03:54:06 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1fGgOn-0004vu-U1 for bug-guix@gnu.org; Thu, 10 May 2018 03:54:05 -0400 Received: from debbugs.gnu.org ([208.118.235.43]:48002) by eggs.gnu.org with esmtps (TLS1.0:RSA_AES_128_CBC_SHA1:16) (Exim 4.71) (envelope-from ) id 1fGgOn-0004vj-QQ for bug-guix@gnu.org; Thu, 10 May 2018 03:54:01 -0400 Sender: "Debbugs-submit" Resent-Message-ID: In-Reply-To: <87fu30dmx3.fsf@netris.org> (Mark H. Weaver's message of "Thu, 10 May 2018 02:50:32 -0400") List-Id: Bug reports for GNU Guix List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: bug-guix-bounces+gcggb-bug-guix=m.gmane.org@gnu.org Sender: "bug-Guix" To: Mark H Weaver Cc: 28211@debbugs.gnu.org On Thu 10 May 2018 08:50, Mark H Weaver writes: > Andy Wingo writes: > >> On Wed 09 May 2018 02:32, Mark H Weaver writes: >> >>> However, I think it's _far_ more likely that the NULL argument on the >>> stack was copied from memory shared by multiple threads without proper >>> thread synchronization. >> >> I think this is unlikely on x86 given its total-store-ordering memory >> model. I agree with you about the value of barriers, but I don't think >> they are part of this bug that Ludo is seeing. > > I think you're forgetting about the C compiler. It's true that x86 > machine code has a TSO memory model, but C does not. In the absence of > barriers, the C compiler may freely reorder stores to non-volatile, > non-atomic objects. In particular, it is free to reorder the > initialization of an object with the write of that object's address. > > I admit that I haven't checked whether GCC 5.5.0 does this in practice. > Do you have reason to believe that it never does so? Oh I agree with you here as well, and compiler reordering could well be happening here. My suspicions are however that it's not happening. In libguile, we rarely mutate shared state, and in that case it's usually within mutexes. The main source of mutation in libguile is initialization -- but there that's on a fresh object local to a thread, and we try to avoid publishing that object to other threads without a barrier (atomic or mutex), and in any case such publishing is usually outside of the region that a compiler can work on. There is the possibility of mutation via e.g. vector-set!, but hopefully we aren't doing that on shared data; likewise in Scheme there are barriers (the atomic box instructions and mutexes, both of which are compiler barriers as well). It's still possible to write Scheme programs with races, of course, but I don't think that's what's happening here. I could be misunderstanding things of course! Andy