From mboxrd@z Thu Jan 1 00:00:00 1970 From: ludo@gnu.org (Ludovic =?utf-8?Q?Court=C3=A8s?=) Subject: Re: Hydra status Date: Tue, 22 Apr 2014 11:03:40 +0200 Message-ID: <874n1l20sz.fsf@gnu.org> References: <87wqeza7cn.fsf@gnu.org> <20140409073818.GA16419@debian> <20140409190750.GA25503@debian> <87ioqi1a9b.fsf@gnu.org> <87mwfqyzas.fsf@gnu.org> Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable Return-path: Received: from eggs.gnu.org ([2001:4830:134:3::10]:46992) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1WcWcE-000093-VS for guix-devel@gnu.org; Tue, 22 Apr 2014 05:03:51 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1WcWcA-0001Bp-4q for guix-devel@gnu.org; Tue, 22 Apr 2014 05:03:46 -0400 Received: from hera.aquilenet.fr ([2a01:474::1]:38247) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1WcWc9-0001BY-Nu for guix-devel@gnu.org; Tue, 22 Apr 2014 05:03:42 -0400 In-Reply-To: <87mwfqyzas.fsf@gnu.org> ("Ludovic \=\?utf-8\?Q\?Court\=C3\=A8s\=22'\?\= \=\?utf-8\?Q\?s\?\= message of "Sun, 13 Apr 2014 00:10:19 +0200") List-Id: "Development of GNU Guix and the GNU System distribution." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: guix-devel-bounces+gcggd-guix-devel=m.gmane.org@gnu.org Sender: guix-devel-bounces+gcggd-guix-devel=m.gmane.org@gnu.org To: Andreas Enge Cc: guix-devel@gnu.org ludo@gnu.org (Ludovic Court=C3=A8s) skribis: > ludo@gnu.org (Ludovic Court=C3=A8s) skribis: > >> Going to http://hydra.gnu.org/build/49569/ shows that the texlive builds >> were actually aborted. The =E2=80=9CNix error output=E2=80=9D box shows= this: >> >> offloading '/gnu/store/j0b550qfqaffffc1w550v0zq2nlb0sqc-texlive-2013.drv= ' to 'hydra.gnunet.org'... >> @ build-remote /gnu/store/j0b550qfqaffffc1w550v0zq2nlb0sqc-texlive-2013.= drv hydra.gnunet.org >> retrieving 2 files from 'hydra.gnunet.org'... >> Backtrace: >> In ice-9/boot-9.scm: >> 2401: 19 [save-module-excursion #] >> 4050: 18 [#] >> 1724: 17 [%start-stack load-stack ...] >> 1729: 16 [#] >> In unknown file: >> ?: 15 [primitive-load "/usr/local/bin/guix"] >> In guix/ui.scm: >> 597: 14 [run-guix-command offload "x86_64-linux" "3600" "1" "72000"] >> In guix/scripts/offload.scm: >> 686: 13 [guix-offload "x86_64-linux" "3600" "1" "72000"] >> In ice-9/boot-9.scm: >> 157: 12 [catch srfi-34 ...] >> In guix/scripts/offload.scm: >> 413: 11 [transfer-and-offload # # # ...] >> 488: 10 [retrieve-files # #] >> In ice-9/boot-9.scm: >> 157: 9 [catch srfi-34 ...] >> In guix/scripts/offload.scm: >> 497: 8 [#] >> In guix/utils.scm: >> 240: 7 [call-with-decompressed-port xz # ...] >> In guix/nar.scm: >> 426: 6 [loop 1 ()] >> 256: 5 [restore "/gnu/store/guix-8uVCz3"] >> 256: 4 [restore "/gnu/store/guix-8uVCz3/texmf-dist"] >> 256: 3 [restore "/gnu/store/guix-8uVCz3/texmf-dist/doc"] >> 256: 2 [restore "/gnu/store/guix-8uVCz3/texmf-dist/doc/generic"] >> 248: 1 [restore "/gnu/store/guix-8uVCz3/texmf-dist/doc/generic/babel-sw= edish"] >> In unknown file: >> ?: 0 [mkdir "/gnu/store/guix-8uVCz3/texmf-dist/doc/generic/babel-swed= ish" ...] >> >> ERROR: In procedure mkdir: >> ERROR: In procedure mkdir: No such file or directory >> lsh: io.c: write failed (errno =3D 32): Broken pipe >> lsh: channel.c: stdout: I/O error, Broken pipe >> lsh: Ignoring data on channel which is closing >> builder for `/gnu/store/j0b550qfqaffffc1w550v0zq2nlb0sqc-texlive-2013.dr= v' failed with exit code 1 >> @ hook-failed /gnu/store/j0b550qfqaffffc1w550v0zq2nlb0sqc-texlive-2013.d= rv - 256 builder for `/gnu/store/j0b550qfqaffffc1w550v0zq2nlb0sqc-texlive-2= 013.drv' failed with exit code 1 >> error: build of `/gnu/store/j0b550qfqaffffc1w550v0zq2nlb0sqc-texlive-201= 3.drv' failed >> >> So the problem occurred while retrieving and restoring the build result >> (the log shows just these two lines because apparently, the result was >> already available on the build machine hydra.gnunet.org=E2=80=93which is= no >> surprise given that /gnu is 1.5T on that machine, vs. 400G on >> hydra.gnu.org.) >> >> The =E2=80=98mkdir=E2=80=99 error here suggests that the temporary place= were we are >> restoring, /gnu/store/guix-8uVCz3, has been removed right in the middle >> of the process by the GC. >> >> I believe this is fixed by 9132b9b. > > Actually no (I=E2=80=99ve seen other occurrences of that in the meantime.) > > 9132b9b had a thinko, which is (hopefully!) fixed by 6071b55. I=E2=80=99= ve just > installed it on hydra.gnu.org. There was another problem with that =E2=80=9Cfix=E2=80=9D, namely that the = daemon ignores normal GC roots pointing to =E2=80=9Cabnormal=E2=80=9D store files = like /gnu/store/guix-8uVCz3. This led to the (now abandoned) proposal at . I finally came up with a simpler solution in commit 50db7d8: a connection to the daemon is opened for the sake of holding a per-process GC root to that temporary store file. Ludo=E2=80=99.