all messages for Guix-related lists mirrored at yhetil.org
 help / color / mirror / code / Atom feed
From: Maxim Cournoyer <maxim.cournoyer@gmail.com>
To: "Ludovic Courtès" <ludo@gnu.org>
Cc: guix-devel@gnu.org
Subject: Re: Postmortem of service downtime
Date: Fri, 24 May 2024 21:19:44 -0400	[thread overview]
Message-ID: <87ikz2it73.fsf@gmail.com> (raw)
In-Reply-To: <877cfk77vk.fsf@gnu.org> ("Ludovic Courtès"'s message of "Thu, 23 May 2024 19:31:11 +0200")

Hi Ludovic,

Ludovic Courtès <ludo@gnu.org> writes:

> From Sunday May 19th to Tuesday may 21st, for about 36h,
> bayfront.guix.gnu.org, the machine behind many services went down:
>
>   https://lists.gnu.org/archive/html/info-guix/2024-05/msg00000.html
>
> Affected web sites and services included:
>
>   guix.gnu.org
>   bordeaux.guix.gnu.org
>   logs.guix.gnu.org
>   hpc.guix.info
>   foundation.guix.info
>   packages.guix.gnu.org
>   qa.guix.gnu.org
>

[...]

>     A large part of the slowness was due to ‘guix substitute’ reading
>     all the 300K+ entries from /var/guix/substitute/cache and deleting
>     them, one by one (this took several minutes).  Chris had mentioned
>     that performance issue in the past; it’s not much of a problem on
>     one’s laptop with an SSD, but it’s clearly a problem here where
>     there are more entries than usual.  We should at least drastically
>     reduce the TTL of cache entries.

Interesting!

>   • qa-frontpage failed to build when we first reconfigured the machine,
>     so we commented it out.  This is now fixed:
>
>       https://git.savannah.gnu.org/cgit/guix/maintenance.git/commit/?id=3fecb1e8fdea65a7440fec403c1c52da197b5dfe
>
>   • guix-packages-website (the server behind packages.guix.gnu.org)
>     still refuses to start with an Artanis error:
>
>       https://issues.guix.gnu.org/71138
>
> Ludo’, on behalf on the emergency rescue^W^W sysadmin team.

Phew!  Thanks for the detailed write-up and for the fixes/thankless work
of bringing the machine back up and running.

-- 
Maxim


  parent reply	other threads:[~2024-05-25  1:20 UTC|newest]

Thread overview: 4+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2024-05-23 17:31 Postmortem of service downtime Ludovic Courtès
2024-05-23 17:41 ` Jay Sulzberger
2024-05-25  1:19 ` Maxim Cournoyer [this message]
2024-06-04 19:07 ` Simon Tournier

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=87ikz2it73.fsf@gmail.com \
    --to=maxim.cournoyer@gmail.com \
    --cc=guix-devel@gnu.org \
    --cc=ludo@gnu.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
Code repositories for project(s) associated with this external index

	https://git.savannah.gnu.org/cgit/guix.git

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.