* bug#74284: Shepherd does not respect ordering for one-shot? services
@ 2024-11-09 16:53 Tomas Volf
2024-11-22 14:37 ` Ludovic Courtès
0 siblings, 1 reply; 3+ messages in thread
From: Tomas Volf @ 2024-11-09 16:53 UTC (permalink / raw)
To: 74284
Hello,
I think I found a bug in the GNU Shepherd. Dependencies between
one-shot? #t services do not seem to be respected.
Documentation for #:requirement says the following (emphasis mine):
--8<---------------cut here---------------start------------->8---
#:requirement is, like provision, a list of symbols that specify
services. In this case, they name what this service depends on: before
the service can be started, services that provide those symbols *must be
started*.
Note that every name listed in #:requirement must be registered so it
can be resolved (see Service Registry).
--8<---------------cut here---------------end--------------->8---
Documentation for #:one-shot? says the following:
--8<---------------cut here---------------start------------->8---
Whether the service is a one-shot service. A one-shot service is a
service that, as soon as it has been successfully started, is marked as
“stopped.” Other services can nonetheless require one-shot
services. One-shot services are useful to trigger an action before other
services are started, such as a cleanup or an initialization action.
As for other services, the start method of a one-shot service must
return a truth value to indicate success, and false to indicate failure.
--8<---------------cut here---------------end--------------->8---
Nothing in there seems to mention that one-shot? services do not
actually wait on each other. To reproduce I wrote a simple
configuration file:
--8<---------------cut here---------------start------------->8---
(define %one-shot #f)
(use-modules (srfi srfi-1))
(define (make-waiting-service name wait requirement)
(service (list name)
#:requirement requirement
#:start (λ _
(sleep wait)
(format #t "~a\n" name)
#t)
#:one-shot? %one-shot))
(let ((svcs (pair-fold (λ (names waits svcs)
(cons (make-waiting-service (car names)
(car waits)
(cdr names))
svcs))
'()
'(a b c d)
'(1 2 3 4))))
(register-services svcs)
(start-in-the-background (map service-canonical-name svcs)))
--8<---------------cut here---------------end--------------->8---
Each service sleeps for `wait' seconds to simulate some slow work being
done. In effect that means that each of the services takes different
time to start up.
Now, when we run it as it is, we get the following (correct) output:
--8<---------------cut here---------------start------------->8---
$ shepherd -c conf.scm
Starting service root...
Service root started.
Service root running with value #t.
Service root has been started.
Configuration successfully loaded from 'conf.scm'.
Starting service d...
d
Service d has been started.
Service d started.
Service d running with value #t.
Starting service c...
c
Service c has been started.
Service c started.
Service c running with value #t.
Starting service b...
b
Service b has been started.
Service b started.
Service b running with value #t.
Starting service a...
a
Service a has been started.
Service a started.
Successfully started 4 services in the background.
Service a running with value #t.
--8<---------------cut here---------------end--------------->8---
Notice the start-up order (d c b a). If you run it, you will also
notice that `d' takes 4 seconds to start up, `c' 3 seconds etc.
However if we change the define at the top of the configuration file to
#t, hence:
--8<---------------cut here---------------start------------->8---
(define %one-shot #t)
--8<---------------cut here---------------end--------------->8---
The behavior changes:
--8<---------------cut here---------------start------------->8---
$ shepherd -c conf.scm
Starting service root...
Service root started.
Service root running with value #t.
Service root has been started.
Configuration successfully loaded from 'conf.scm'.
Starting service d...
Starting service c...
Starting service b...
Starting service a...
a
Service a has been started.
Service a started.
Service a running with value #t.
b
Service b has been started.
Service b started.
Service b running with value #t.
c
Service c has been started.
Service c started.
Service c running with value #t.
d
Service d has been started.
Service d started.
Successfully started 4 services in the background.
Service d running with value #t.
--8<---------------cut here---------------end--------------->8---
Notice that the order changed to (a b c d, this matches the increasing
wait time), the initial messages are all together:
--8<---------------cut here---------------start------------->8---
Starting service d...
Starting service c...
Starting service b...
Starting service a...
--8<---------------cut here---------------end--------------->8---
and the whole start-up takes 4 seconds (the wait time of `d'). That
seems to indicate that all 4 services are actually starting at the same
time without waiting as they should per the #:requirement argument.
Have a nice day,
Tomas
--
There are only two hard things in Computer Science:
cache invalidation, naming things and off-by-one errors.
^ permalink raw reply [flat|nested] 3+ messages in thread
* bug#74284: Shepherd does not respect ordering for one-shot? services
2024-11-09 16:53 bug#74284: Shepherd does not respect ordering for one-shot? services Tomas Volf
@ 2024-11-22 14:37 ` Ludovic Courtès
2024-11-22 19:41 ` Tomas Volf
0 siblings, 1 reply; 3+ messages in thread
From: Ludovic Courtès @ 2024-11-22 14:37 UTC (permalink / raw)
To: 74284; +Cc: Dariqq
Hi Tomas,
(+ Dariqq since we briefly discussed it on IRC yesterday.)
Tomas Volf <~@wolfsden.cz> skribis:
> Notice that the order changed to (a b c d, this matches the increasing
> wait time), the initial messages are all together:
>
> Starting service d...
> Starting service c...
> Starting service b...
> Starting service a...
>
> and the whole start-up takes 4 seconds (the wait time of `d'). That
> seems to indicate that all 4 services are actually starting at the same
> time without waiting as they should per the #:requirement argument.
Indeed. As Dariqq found out, the problem was that we’d mark one-short
services in ‘%one-shot-services-started’ as soon as we’ve started them,
effectively acting as if “started” were synonymous with “running”.
This is fixed with 550c0370985022c5c90a7b477a5e0b84f6faf5d7.
Let me know if you find anything fishy!
Thanks,
Ludo’.
^ permalink raw reply [flat|nested] 3+ messages in thread
* bug#74284: Shepherd does not respect ordering for one-shot? services
2024-11-22 14:37 ` Ludovic Courtès
@ 2024-11-22 19:41 ` Tomas Volf
0 siblings, 0 replies; 3+ messages in thread
From: Tomas Volf @ 2024-11-22 19:41 UTC (permalink / raw)
To: Ludovic Courtès; +Cc: Dariqq, 74284
[-- Attachment #1: Type: text/plain, Size: 780 bytes --]
Hi Ludo',
Ludovic Courtès <ludo@gnu.org> writes:
> Indeed. As Dariqq found out, the problem was that we’d mark one-short
> services in ‘%one-shot-services-started’ as soon as we’ve started them,
> effectively acting as if “started” were synonymous with “running”.
>
> This is fixed with 550c0370985022c5c90a7b477a5e0b84f6faf5d7.
I have checked out the commit and verified it with my original
reproducer. Everything seems to work as it should, thank you for fixing
it :)
> Let me know if you find anything fishy!
Did not notice anything, so once 1.0.0 lands in Guix we can just close
this bug.
Have a nice day,
Tomas
--
There are only two hard things in Computer Science:
cache invalidation, naming things and off-by-one errors.
[-- Attachment #2: signature.asc --]
[-- Type: application/pgp-signature, Size: 853 bytes --]
^ permalink raw reply [flat|nested] 3+ messages in thread
end of thread, other threads:[~2024-11-22 19:42 UTC | newest]
Thread overview: 3+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2024-11-09 16:53 bug#74284: Shepherd does not respect ordering for one-shot? services Tomas Volf
2024-11-22 14:37 ` Ludovic Courtès
2024-11-22 19:41 ` Tomas Volf
Code repositories for project(s) associated with this public inbox
https://git.savannah.gnu.org/cgit/guix.git
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).