all messages for Guix-related lists mirrored at yhetil.org
 help / color / mirror / code / Atom feed
* bug#65306: [shepherd] ntpd throws shepherd out of the loop
@ 2023-08-15  5:18 Liliana Marie Prikler
  2023-08-15 11:13 ` Csepp
  2023-08-15 14:27 ` Timotej Lazar
  0 siblings, 2 replies; 8+ messages in thread
From: Liliana Marie Prikler @ 2023-08-15  5:18 UTC (permalink / raw)
  To: 65306

Hi Guix,

I have a laptop that's a little stuck in the past… more accurately
January of 2020 thanks to what I believe to be an empty CMOS battery. 
As of recently (maybe it dates back longer, but I first experienced it
two weeks ago and just now got to debugging it a little), Shepherd gets
stuck at 100% CPU usage "early" on first boot.  I can prevent this
issue by getting the system time "close enough" to the actual time
before the NTP sync, but see the first sentence.  Not having a network
connection also works, but that's somewhat unpractical.  Also, the high
CPU usage still occurs if a sync is done later.  I have yet to
encounter the bug post hibernation, but I also wish not to.  There
doesn't appear to be anything particular interesting in the logs
either.

Cheers




^ permalink raw reply	[flat|nested] 8+ messages in thread

* bug#65306: [shepherd] ntpd throws shepherd out of the loop
  2023-08-15  5:18 bug#65306: [shepherd] ntpd throws shepherd out of the loop Liliana Marie Prikler
@ 2023-08-15 11:13 ` Csepp
  2023-08-15 14:27 ` Timotej Lazar
  1 sibling, 0 replies; 8+ messages in thread
From: Csepp @ 2023-08-15 11:13 UTC (permalink / raw)
  To: Liliana Marie Prikler; +Cc: 65306


Liliana Marie Prikler <liliana.prikler@gmail.com> writes:

> Hi Guix,
>
> I have a laptop that's a little stuck in the past… more accurately
> January of 2020 thanks to what I believe to be an empty CMOS battery. 
> As of recently (maybe it dates back longer, but I first experienced it
> two weeks ago and just now got to debugging it a little), Shepherd gets
> stuck at 100% CPU usage "early" on first boot.  I can prevent this
> issue by getting the system time "close enough" to the actual time
> before the NTP sync, but see the first sentence.  Not having a network
> connection also works, but that's somewhat unpractical.  Also, the high
> CPU usage still occurs if a sync is done later.  I have yet to
> encounter the bug post hibernation, but I also wish not to.  There
> doesn't appear to be anything particular interesting in the logs
> either.
>
> Cheers

This sounds like an issue with slow incremental system time updates,
although I don't understand why that would cause Shepherd to hang, but
maybe the NTP service is configured to only report itself as initialized
once it has finished synchronizing, which defeats the point of
incremental updating.
There is probably a config setting to tell ntpd to perform the update in
a single step, at least I know chrony has one.

ps.: don't wait until the battery starts leaking to replace it




^ permalink raw reply	[flat|nested] 8+ messages in thread

* bug#65306: [shepherd] ntpd throws shepherd out of the loop
  2023-08-15  5:18 bug#65306: [shepherd] ntpd throws shepherd out of the loop Liliana Marie Prikler
  2023-08-15 11:13 ` Csepp
@ 2023-08-15 14:27 ` Timotej Lazar
  2023-09-02 20:44   ` Ludovic Courtès
  1 sibling, 1 reply; 8+ messages in thread
From: Timotej Lazar @ 2023-08-15 14:27 UTC (permalink / raw)
  To: Liliana Marie Prikler, 65306

Liliana Marie Prikler <liliana.prikler@gmail.com> [2023-08-15 07:18:02+0200]:
> As of recently (maybe it dates back longer, but I first experienced it
> two weeks ago and just now got to debugging it a little), Shepherd gets
> stuck at 100% CPU usage "early" on first boot.

I have this issue on all Guix systems without a (working) RTC. It seems
to be caused by a recentish update to guile-fibers:

https://github.com/wingo/fibers/issues/89

For me this happens regardless of whether the system time is pushed
forward manually or by ntpd. Depending on the time delta and CPU speed,
the usage returns to normal after a couple of days. During that time any
socket-activated services like SSH are also unreachable.




^ permalink raw reply	[flat|nested] 8+ messages in thread

* bug#65306: [shepherd] ntpd throws shepherd out of the loop
  2023-08-15 14:27 ` Timotej Lazar
@ 2023-09-02 20:44   ` Ludovic Courtès
  2023-09-02 21:41     ` Liliana Marie Prikler
  2023-09-04  5:46     ` Timotej Lazar
  0 siblings, 2 replies; 8+ messages in thread
From: Ludovic Courtès @ 2023-09-02 20:44 UTC (permalink / raw)
  To: Timotej Lazar; +Cc: Liliana Marie Prikler, 65306

Hi,

Timotej Lazar <timotej.lazar@araneo.si> skribis:

> Liliana Marie Prikler <liliana.prikler@gmail.com> [2023-08-15 07:18:02+0200]:
>> As of recently (maybe it dates back longer, but I first experienced it
>> two weeks ago and just now got to debugging it a little), Shepherd gets
>> stuck at 100% CPU usage "early" on first boot.
>
> I have this issue on all Guix systems without a (working) RTC. It seems
> to be caused by a recentish update to guile-fibers:
>
> https://github.com/wingo/fibers/issues/89

Yeah, that’s the one.

Liliana, Timotej: could you try the Guix patch I posted at
<https://issues.guix.gnu.org/64966>?

Thanks,
Ludo’.




^ permalink raw reply	[flat|nested] 8+ messages in thread

* bug#65306: [shepherd] ntpd throws shepherd out of the loop
  2023-09-02 20:44   ` Ludovic Courtès
@ 2023-09-02 21:41     ` Liliana Marie Prikler
  2023-09-03 19:58       ` Ludovic Courtès
  2023-09-04  5:46     ` Timotej Lazar
  1 sibling, 1 reply; 8+ messages in thread
From: Liliana Marie Prikler @ 2023-09-02 21:41 UTC (permalink / raw)
  To: Ludovic Courtès, Timotej Lazar; +Cc: 65306

Am Samstag, dem 02.09.2023 um 22:44 +0200 schrieb Ludovic Courtès:
> Hi,
> 
> Timotej Lazar <timotej.lazar@araneo.si> skribis:
> 
> > Liliana Marie Prikler <liliana.prikler@gmail.com> [2023-08-15
> > 07:18:02+0200]:
> > > As of recently (maybe it dates back longer, but I first
> > > experienced it two weeks ago and just now got to debugging it a
> > > little), Shepherd gets stuck at 100% CPU usage "early" on first
> > > boot.
> > 
> > I have this issue on all Guix systems without a (working) RTC. It
> > seems to be caused by a recentish update to guile-fibers:
> > 
> > https://github.com/wingo/fibers/issues/89
> 
> Yeah, that’s the one.
> 
> Liliana, Timotej: could you try the Guix patch I posted at
> <https://issues.guix.gnu.org/64966>?
Do we have a guide on how to swap out shepherd from the config.scm? 
The machine that experiences this fault isn't set up for Guix hacking.

Cheers




^ permalink raw reply	[flat|nested] 8+ messages in thread

* bug#65306: [shepherd] ntpd throws shepherd out of the loop
  2023-09-02 21:41     ` Liliana Marie Prikler
@ 2023-09-03 19:58       ` Ludovic Courtès
  0 siblings, 0 replies; 8+ messages in thread
From: Ludovic Courtès @ 2023-09-03 19:58 UTC (permalink / raw)
  To: Liliana Marie Prikler; +Cc: Timotej Lazar, 65306

Hi,

Liliana Marie Prikler <liliana.prikler@gmail.com> skribis:

> Am Samstag, dem 02.09.2023 um 22:44 +0200 schrieb Ludovic Courtès:

[...]

>> Liliana, Timotej: could you try the Guix patch I posted at
>> <https://issues.guix.gnu.org/64966>?
> Do we have a guide on how to swap out shepherd from the config.scm? 
> The machine that experiences this fault isn't set up for Guix hacking.

You can do something like this in your OS config:

  (essential-services
   (modify-services (operating-system-default-essential-services
                     this-operating-system)
     (shepherd-root-service-type
      config => (shepherd-configuration
                 (shepherd insert-custom-sherpherd-here)))))

(Initially mentioned at
<https://lists.gnu.org/archive/html/guix-devel/2023-04/msg00396.html>.)

HTH!

Ludo’.




^ permalink raw reply	[flat|nested] 8+ messages in thread

* bug#65306: [shepherd] ntpd throws shepherd out of the loop
  2023-09-02 20:44   ` Ludovic Courtès
  2023-09-02 21:41     ` Liliana Marie Prikler
@ 2023-09-04  5:46     ` Timotej Lazar
  2023-09-08 16:50       ` Ludovic Courtès
  1 sibling, 1 reply; 8+ messages in thread
From: Timotej Lazar @ 2023-09-04  5:46 UTC (permalink / raw)
  To: Ludovic Courtès; +Cc: Liliana Marie Prikler, 65306

Ludovic Courtès <ludo@gnu.org> [2023-09-02 22:44:03+0200]:
> Liliana, Timotej: could you try the Guix patch I posted at
> <https://issues.guix.gnu.org/64966>?

That patch works for my aarch64 board. I encounter the same issue on an
x86_64 system without a functional RTC, but at least now I know how to
apply a workaround. Thanks!




^ permalink raw reply	[flat|nested] 8+ messages in thread

* bug#65306: [shepherd] ntpd throws shepherd out of the loop
  2023-09-04  5:46     ` Timotej Lazar
@ 2023-09-08 16:50       ` Ludovic Courtès
  0 siblings, 0 replies; 8+ messages in thread
From: Ludovic Courtès @ 2023-09-08 16:50 UTC (permalink / raw)
  To: Timotej Lazar; +Cc: 65306-done, Liliana Marie Prikler

Timotej Lazar <timotej.lazar@araneo.si> skribis:

> Ludovic Courtès <ludo@gnu.org> [2023-09-02 22:44:03+0200]:
>> Liliana, Timotej: could you try the Guix patch I posted at
>> <https://issues.guix.gnu.org/64966>?
>
> That patch works for my aarch64 board. I encounter the same issue on an
> x86_64 system without a functional RTC, but at least now I know how to
> apply a workaround. Thanks!

Right.  I’ve committed a variant of this patch (will push shortly).

Thanks for testing!

Ludo’.




^ permalink raw reply	[flat|nested] 8+ messages in thread

end of thread, other threads:[~2023-09-08 16:51 UTC | newest]

Thread overview: 8+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2023-08-15  5:18 bug#65306: [shepherd] ntpd throws shepherd out of the loop Liliana Marie Prikler
2023-08-15 11:13 ` Csepp
2023-08-15 14:27 ` Timotej Lazar
2023-09-02 20:44   ` Ludovic Courtès
2023-09-02 21:41     ` Liliana Marie Prikler
2023-09-03 19:58       ` Ludovic Courtès
2023-09-04  5:46     ` Timotej Lazar
2023-09-08 16:50       ` Ludovic Courtès

Code repositories for project(s) associated with this external index

	https://git.savannah.gnu.org/cgit/guix.git

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.