all messages for Guix-related lists mirrored at yhetil.org
 help / color / mirror / code / Atom feed
From: "Ludovic Courtès" <ludo@gnu.org>
To: "Sergio Pastor Pérez" <sergio.pastorperez@outlook.es>
Cc: guix-devel@gnu.org
Subject: Re: Should we document how to detect if build machines are reachable before trying to offload?
Date: Sun, 21 Jul 2024 14:56:54 +0200	[thread overview]
Message-ID: <87v80yoqcp.fsf@gnu.org> (raw)
In-Reply-To: <PAXP251MB0348D24F884757DC4A004500F3A52@PAXP251MB0348.EURP251.PROD.OUTLOOK.COM> ("Sergio Pastor Pérez"'s message of "Thu, 11 Jul 2024 22:23:42 +0200")

Hi,

Sergio Pastor Pérez <sergio.pastorperez@outlook.es> skribis:

>> Do you remember exactly under what circumstances it hangs?  I think
>> ‘guix offload’ should handle that situation gracefully and we should fix
>> it if it does not.
>
> Yeah. It happens when I have a build machine configured like so and I
> disconnect it from the Ethernet connection:
>
> (build-machines
>  (list
>   #~(build-machine
>      (name "remote")
>      (systems (list "x86_64-linux" "i686-linux"))
>      (host-key %remote-host-key
>      (private-key %local-key))))
>
>
> With this configuration `guix offload test` will timeout after 30
> seconds, as you describe. But this other command will hang indefinitely:
>
> $ timeout 1m guix build imhex -M 0
> The following derivation will be built:
>   /gnu/store/9absqzdd4ak3pms2jw6rkhlmjvm8zzyv-imhex-1.35.1.drv
> process 12199 acquired build slot '/var/guix/offload/bordercollie:22/0'
> guix offload: error: failed to connect to 'bordercollie': No route to host
> waiting for locks or build slots...
> process 12199 acquired build slot '/var/guix/offload/bordercollie:22/0'
> guix offload: error: failed to connect to 'bordercollie': No route to host

I believe the problem here is that offloading always wants to offload.
That is, when all the machines in /etc/guix/machines.scm are
unavailable, ‘guix offload’ says so to guix-daemon, but then guix-daemon
just keeps retrying (if you had more than one machine in
/etc/guix/machines.scm, one of which is unavailable, ‘guix offload’
would just pick another one.)

I guess this is probably what we should permit: building locally when we
cannot offload.

Does that make sense?

Ludo’.


  reply	other threads:[~2024-07-21 12:57 UTC|newest]

Thread overview: 8+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2024-07-04 15:45 Should we document how to detect if build machines are reachable before trying to offload? Sergio Pastor Pérez
2024-07-11  9:48 ` Ludovic Courtès
2024-07-11 20:23   ` Sergio Pastor Pérez
2024-07-21 12:56     ` Ludovic Courtès [this message]
2024-07-21 15:59       ` Sergio Pastor Pérez
2024-07-21 16:26       ` Vincent Legoll
2024-07-21 17:25         ` Tomas Volf
2024-07-22 14:59       ` Simon Tournier

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=87v80yoqcp.fsf@gnu.org \
    --to=ludo@gnu.org \
    --cc=guix-devel@gnu.org \
    --cc=sergio.pastorperez@outlook.es \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
Code repositories for project(s) associated with this external index

	https://git.savannah.gnu.org/cgit/guix.git

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.