From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mp12.migadu.com ([2001:41d0:8:6d80::]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits)) by ms5.migadu.com with LMTPS id ULeiD3FTg2KlbQEAbAwnHQ (envelope-from ) for ; Tue, 17 May 2022 09:49:05 +0200 Received: from aspmx1.migadu.com ([2001:41d0:8:6d80::]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits)) by mp12.migadu.com with LMTPS id wIeZD3FTg2JikAAAauVa8A (envelope-from ) for ; Tue, 17 May 2022 09:49:05 +0200 Received: from lists.gnu.org (lists.gnu.org [209.51.188.17]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by aspmx1.migadu.com (Postfix) with ESMTPS id B7EAD1949A for ; Tue, 17 May 2022 09:49:04 +0200 (CEST) Received: from localhost ([::1]:39306 helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1nqrx1-0007aF-PP for larch@yhetil.org; Tue, 17 May 2022 03:49:03 -0400 Received: from eggs.gnu.org ([2001:470:142:3::10]:42816) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1nqrv4-0006Id-HF for bug-guix@gnu.org; Tue, 17 May 2022 03:47:03 -0400 Received: from debbugs.gnu.org ([209.51.188.43]:33568) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.90_1) (envelope-from ) id 1nqrv4-0002wA-8a for bug-guix@gnu.org; Tue, 17 May 2022 03:47:02 -0400 Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.84_2) (envelope-from ) id 1nqrv4-0003Rz-78 for bug-guix@gnu.org; Tue, 17 May 2022 03:47:02 -0400 X-Loop: help-debbugs@gnu.org Subject: bug#55441: [cuirass] hang in "In progress..."; runs out of pgsql connections Resent-From: Ludovic =?UTF-8?Q?Court=C3=A8s?= Original-Sender: "Debbugs-submit" Resent-CC: bug-guix@gnu.org Resent-Date: Tue, 17 May 2022 07:47:02 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: followup 55441 X-GNU-PR-Package: guix X-GNU-PR-Keywords: To: Maxim Cournoyer Cc: 55441@debbugs.gnu.org, Mathieu Othacehe Received: via spool by 55441-submit@debbugs.gnu.org id=B55441.165277356513179 (code B ref 55441); Tue, 17 May 2022 07:47:02 +0000 Received: (at 55441) by debbugs.gnu.org; 17 May 2022 07:46:05 +0000 Received: from localhost ([127.0.0.1]:55692 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1nqru8-0003QV-Td for submit@debbugs.gnu.org; Tue, 17 May 2022 03:46:05 -0400 Received: from eggs.gnu.org ([209.51.188.92]:49294) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1nqru6-0003Pt-PP for 55441@debbugs.gnu.org; Tue, 17 May 2022 03:46:03 -0400 Received: from fencepost.gnu.org ([2001:470:142:3::e]:54090) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1nqru1-0002fA-DU; Tue, 17 May 2022 03:45:57 -0400 DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=gnu.org; s=fencepost-gnu-org; h=MIME-Version:In-Reply-To:Date:References:Subject:To: From; bh=JcteS0APi5I+vA9mFPQ5hZ63zVVlxsSFn3uYQeNDeYA=; b=W02ufWeKcK1lelvRHj1p AAoKCEXXgi/FxO6zad7we05bINPJvdvRQZmuyaLY+7aMDmJn9vsofRefpOLXKc2hgVdAAaRGZomVK za5tmOsIrdBJhCIBcWoM+v3jhxrnAQgxTCsHZNTaMqfPeKUlJu5BnFI7fY4suaZ/selv2QlU1ylhp 7F2y7d65EcRM35AIGdiOxRxLpLLDrB9x5mfy/zrRg0HY4yAMyVOycD0vRu21C2iQT09MHZRL3Ch2Z WophmnmxJjrlgw0GT+r5He73jC4scYy+jsQ7pZx0uA5UAY4Ftk5mozP0XHUTJoGrl8StHUlYgYJsG iNN0PRKDmn+cPA==; Received: from [193.50.110.244] (port=52728 helo=ribbon) by fencepost.gnu.org with esmtpsa (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1nqrtx-0001dI-5L; Tue, 17 May 2022 03:45:53 -0400 From: Ludovic =?UTF-8?Q?Court=C3=A8s?= References: <87fslcaznn.fsf@gmail.com> <87mtfj174l.fsf@gnu.org> <878rr2kug6.fsf_-_@gmail.com> <87pmke6ig4.fsf@gnu.org> <87o7zxalhu.fsf@gnu.org> <87k0alahug.fsf@gnu.org> <87fsl9acuw.fsf@gnu.org> <878rr1jsd1.fsf@gmail.com> Date: Tue, 17 May 2022 09:45:50 +0200 In-Reply-To: <878rr1jsd1.fsf@gmail.com> (Maxim Cournoyer's message of "Mon, 16 May 2022 13:32:26 -0400") Message-ID: <87fsl87gb5.fsf@gnu.org> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/27.2 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list X-BeenThere: bug-guix@gnu.org List-Id: Bug reports for GNU Guix List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: bug-guix-bounces+larch=yhetil.org@gnu.org Sender: "bug-Guix" X-Migadu-Flow: FLOW_IN X-Migadu-To: larch@yhetil.org X-Migadu-Country: US ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=yhetil.org; s=key1; t=1652773744; h=from:from:sender:sender:reply-to:subject:subject:date:date: message-id:message-id:to:to:cc:cc:mime-version:mime-version: content-type:content-type: content-transfer-encoding:content-transfer-encoding:resent-cc: resent-from:resent-sender:resent-message-id:in-reply-to:in-reply-to: references:references:list-id:list-help:list-unsubscribe: list-subscribe:list-post:dkim-signature; bh=JcteS0APi5I+vA9mFPQ5hZ63zVVlxsSFn3uYQeNDeYA=; b=kAYYFaH/84/GWjMydYSObW0W4vcnk7gqnPz2qZVLPOnD20mebhhnqcbFsZcceUuQ0ObXbT 2OH5pn2Krfw6YtOXDSNIqBQ40tJXBH88pwKje6vERRZ1qLB/5hQ/wPJuA7Qfo+L324UlsI zATRGZ0LBehFNjFQ4DXtROGRMIV9bmrOiUWIOHKMIe3ehUbWKyfexfrE9P8vq3t6O9hPWX AxbjCUgU1Gofg8h92pvrZfGAKmMFWJpioyXq9gQ3HdQPPg1mAVNi5CwNv1zZpplQ6IoVxf quFbIeQsAmtoP5tsrtssFIGPu4jAHdh5hHnMQ+Dm7nFaHY2pD0IDiVfnhYR4OQ== ARC-Seal: i=1; s=key1; d=yhetil.org; t=1652773744; a=rsa-sha256; cv=none; b=P3EJuuf/WrmK6o2QWt05J1yxfeq1x1wUmKY8Vge1a6u6QsiO+NhBoDX89byfblOAa2EUVV lm2T6wHzBFfxkvI2Xfy+g0L/ZSO23d2pQFopt/doLRRMP8XHxq2GojRYtLSGdE34RQeLgv EqWw+isbgS/9KlHpcFng7P2OGPqZbjZAfsPerZ6vUcwebWl94daal11KzkzDjEHkVINoZT /nro6xZUTuLRJ15oIeEATdodHeMEx9FKYlDAJ+0QpHTIb4poHTSGTIfRx+/NzylrqvL8kD Vgb0xbU2WuzE/ghIKrczBodbr44YSDBw8OXZdSsYtAQP0HB5Y65xOG7AsFQsYA== ARC-Authentication-Results: i=1; aspmx1.migadu.com; dkim=fail ("headers rsa verify failed") header.d=gnu.org header.s=fencepost-gnu-org header.b=W02ufWeK; dmarc=pass (policy=none) header.from=gnu.org; spf=pass (aspmx1.migadu.com: domain of "bug-guix-bounces+larch=yhetil.org@gnu.org" designates 209.51.188.17 as permitted sender) smtp.mailfrom="bug-guix-bounces+larch=yhetil.org@gnu.org" X-Migadu-Spam-Score: -4.53 Authentication-Results: aspmx1.migadu.com; dkim=fail ("headers rsa verify failed") header.d=gnu.org header.s=fencepost-gnu-org header.b=W02ufWeK; dmarc=pass (policy=none) header.from=gnu.org; spf=pass (aspmx1.migadu.com: domain of "bug-guix-bounces+larch=yhetil.org@gnu.org" designates 209.51.188.17 as permitted sender) smtp.mailfrom="bug-guix-bounces+larch=yhetil.org@gnu.org" X-Migadu-Queue-Id: B7EAD1949A X-Spam-Score: -4.53 X-Migadu-Scanner: scn0.migadu.com X-TUID: Ro1SEp3Q3OaO Hi, Maxim Cournoyer skribis: > Berlin was reconfigured with this commit of Cuirass, and is now running > the derivations with it, but so far still "In progress..." after more > than 100 minutes [0] Yeah, I=E2=80=99m not sure about the backtrace you report, however there=E2= =80=99s again a bunch of =E2=80=98cuirass evaluate=E2=80=99 processes hanging, this time = with the main thread stuck on =E2=80=98waitpid=E2=80=99 (presumably from the =E2=80=98clo= se-inferior=E2=80=99 call): --8<---------------cut here---------------start------------->8--- #0 0x00007f0886310f27 in __GI___wait4 (pid=3D86099, stat_loc=3Dstat_loc@en= try=3D0x7ffea0cb849c, options=3Doptions@entry=3D0, usage=3Dusage@entry=3D0x= 0) at ../sysdeps/unix/sysv/linux/wait4.c:30 #1 0x00007f0886310ea7 in __GI___waitpid (pid=3D, stat_loc= =3Dstat_loc@entry=3D0x7ffea0cb849c, options=3Doptions@entry=3D0) at waitpid.c:38 #2 0x00007f08868ae25e in scm_waitpid (pid=3D86099, options=3D) at posix.c:727 #3 0x00007f088689a336 in vm_regular_engine (thread=3D0x7f08861fad80) at vm= -engine.c:972 #4 0x00007f08868a75e9 in scm_call_n (proc=3D, argv=3D, nargs=3D1) at vm.c:1608 #5 0x00007f088680f457 in scm_primitive_eval (exp=3D, exp@entry=3D((@ (ice-9 control) %) (begin ((@@ (ice-9 command-line) loa= d/lang) "/gnu/store/z8haznhwck4bjm4gxqy25wvwv4041wvx-cuirass-1.1.0-12.f087a= af/bin/.cuirass-real") (main (command-line)) (quit)))) at eval.c:671 #6 0x00007f08868154b6 in scm_eval ( exp=3D((@ (ice-9 control) %) (begin ((@@ (ice-9 command-line) load/lang= ) "/gnu/store/z8haznhwck4bjm4gxqy25wvwv4041wvx-cuirass-1.1.0-12.f087aaf/bin= /.cuirass-real") (main (command-line)) (quit))), module_or_state=3D"#" =3D {...}) at eval.c:705 #7 0x00007f08868793b6 in scm_shell (argc=3D9, argv=3D0x7ffea0cb8c98) at sc= ript.c:357 --8<---------------cut here---------------end--------------->8--- Process 86099 (the one it=E2=80=99s waiting for) is indeed =E2=80=98guix re= pl=E2=80=99 and it=E2=80=99s waiting for input in read(0, =E2=80=A6). There=E2=80=99s a second thread stuck in =E2=80=98waitpid=E2=80=99: --8<---------------cut here---------------start------------->8--- (gdb) thread 27 [Switching to thread 27 (LWP 86002)] #0 0x00007f0886310f27 in __GI___wait4 (pid=3D86100, stat_loc=3Dstat_loc@en= try=3D0x7f085393c60c, options=3Doptions@entry=3D0, usage=3Dusage@entry=3D0x= 0) at ../sysdeps/unix/sysv/linux/wait4.c:30 30 ../sysdeps/unix/sysv/linux/wait4.c: No such file or directory. (gdb) bt #0 0x00007f0886310f27 in __GI___wait4 (pid=3D86100, stat_loc=3Dstat_loc@en= try=3D0x7f085393c60c, options=3Doptions@entry=3D0, usage=3Dusage@entry=3D0x= 0) at ../sysdeps/unix/sysv/linux/wait4.c:30 #1 0x00007f0886310ea7 in __GI___waitpid (pid=3D, stat_loc= =3Dstat_loc@entry=3D0x7f085393c60c, options=3Doptions@entry=3D0) at waitpid.c:38 #2 0x00007f08868ae25e in scm_waitpid (pid=3D86100, options=3D) at posix.c:727 #3 0x00007f088689a336 in vm_regular_engine (thread=3D0x7f087d54e240) at vm= -engine.c:972 #4 0x00007f08868a75e9 in scm_call_n (proc=3D, argv=3D, nargs=3D0) at vm.c:1608 #5 0x00007f088680ba0e in scm_call_with_unblocked_asyncs (proc=3D#) at async.c:406 #6 0x00007f088689a336 in vm_regular_engine (thread=3D0x7f087d54e240) at vm= -engine.c:972 #7 0x00007f08868a75e9 in scm_call_n (proc=3D, argv=3D, nargs=3D0) at vm.c:1608 --8<---------------cut here---------------end--------------->8--- and then a couple of threads in =E2=80=98read=E2=80=99: --8<---------------cut here---------------start------------->8--- #0 __libc_read (nbytes=3D1, buf=3D0x7f085a69ab90, fd=3D5) at ../sysdeps/un= ix/sysv/linux/read.c:26 #1 __libc_read (fd=3D5, buf=3Dbuf@entry=3D0x7f085a69ab90, nbytes=3Dnbytes@= entry=3D1) at ../sysdeps/unix/sysv/linux/read.c:24 #2 0x00007f08868252e8 in fport_read (port=3D, dst=3D, start=3D, count=3D1) at fports.c:597 #3 0x00007f0886865d22 in scm_i_read_bytes (port=3Dport@entry=3D# 7f0854af6d20>, dst=3D"#" =3D {...},=20 start=3Dstart@entry=3D0, count=3D1) at ports.c:1566 #4 0x00007f08868681c7 in scm_fill_input (port=3Dport@entry=3D# 7f0854af6d20>,=20 minimum_size=3Dminimum_size@entry=3D1, cur_out=3Dcur_out@entry=3D0x7f08= 5313b5e0, avail_out=3Davail_out@entry=3D0x7f085313b588) at ports.c:2693 #5 0x00007f0886868d5c in peek_iconv_codepoint (port=3D# 7f0854af6d20>, buf=3Dbuf@entry=3D0x7f085313b5e8,=20 cur=3Dcur@entry=3D0x7f085313b5e0, len=3Dlen@entry=3D0x7f085313b5d8) at = ports.c:1944 #6 0x00007f0886868e4a in peek_codepoint (len=3D0x7f085313b5d8, cur=3D0x7f0= 85313b5e0, buf=3D0x7f085313b5e8, port=3D) at ports.c:1988 #7 scm_peek_char (port=3D) at ports.c:2202 #8 0x00007f087e62582b in ?? () #9 0x0000000000857760 in ?? () #10 0x00007f087e6257a0 in ?? () #11 0x00007f087d850c48 in ?? () #12 0x00007f0886844ccc in scm_jit_enter_mcode (thread=3D0x7f087d54e000, mco= de=3D0x857770 "\034\234\003") at jit.c:6038 #13 0x00007f0886899f3c in vm_regular_engine (thread=3D0x7f087d54e000) at vm= -engine.c:360 --8<---------------cut here---------------end--------------->8--- Normally we call =E2=80=98waitpid=E2=80=99 once the pipe has been closed: --8<---------------cut here---------------start------------->8--- (define* (open-inferior directory #:key (command "bin/guix") (error-port (%make-void-port "w"))) "Open the inferior Guix in DIRECTORY, running 'DIRECTORY/COMMAND repl' or equivalent. Return #f if the inferior could not be launched." (let ((pipe pid (inferior-pipe directory command error-port))) (port->inferior pipe (lambda (port) (close-port port) (waitpid pid))))) ;<----- here --8<---------------cut here---------------end--------------->8--- =E2=80=A6 and when the pipe is closed, the child =E2=80=98guix repl=E2=80= =99 process gets EOF and exits. So I=E2=80=99m not sure why the =E2=80=98guix repl=E2=80=99 process would s= tick around. Thoughts? Ludo=E2=80=99.