From mboxrd@z Thu Jan  1 00:00:00 1970
Return-Path: <guix-devel-bounces+larch=yhetil.org@gnu.org>
Received: from mp1 ([2001:41d0:2:bcc0::])
	(using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits))
	by ms0.migadu.com with LMTPS
	id SNMYHNvuqWHUSgEAgWs5BA
	(envelope-from <guix-devel-bounces+larch=yhetil.org@gnu.org>)
	for <larch@yhetil.org>; Fri, 03 Dec 2021 11:18:03 +0100
Received: from aspmx1.migadu.com ([2001:41d0:2:bcc0::])
	(using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits))
	by mp1 with LMTPS
	id wIqnF9vuqWHlDAAAbx9fmQ
	(envelope-from <guix-devel-bounces+larch=yhetil.org@gnu.org>)
	for <larch@yhetil.org>; Fri, 03 Dec 2021 10:18:03 +0000
Received: from lists.gnu.org (lists.gnu.org [209.51.188.17])
	(using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits))
	(No client certificate requested)
	by aspmx1.migadu.com (Postfix) with ESMTPS id 2BEB935714
	for <larch@yhetil.org>; Fri,  3 Dec 2021 11:18:03 +0100 (CET)
Received: from localhost ([::1]:54952 helo=lists1p.gnu.org)
	by lists.gnu.org with esmtp (Exim 4.90_1)
	(envelope-from <guix-devel-bounces+larch=yhetil.org@gnu.org>)
	id 1mt5di-0004Sq-8o
	for larch@yhetil.org; Fri, 03 Dec 2021 05:18:02 -0500
Received: from eggs.gnu.org ([209.51.188.92]:45134)
 by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256)
 (Exim 4.90_1) (envelope-from <mail@cbaines.net>) id 1mt5dS-0004SR-4e
 for guix-devel@gnu.org; Fri, 03 Dec 2021 05:17:46 -0500
Received: from mira.cbaines.net ([212.71.252.8]:56380)
 by eggs.gnu.org with esmtp (Exim 4.90_1)
 (envelope-from <mail@cbaines.net>)
 id 1mt5dP-000727-Hw; Fri, 03 Dec 2021 05:17:45 -0500
Received: from localhost (unknown [IPv6:2a02:8010:68c1:0:8ac0:b4c7:f5c8:7caa])
 by mira.cbaines.net (Postfix) with ESMTPSA id B0FE527BBE9;
 Fri,  3 Dec 2021 10:17:38 +0000 (GMT)
Received: from capella (localhost [127.0.0.1])
 by localhost (OpenSMTPD) with ESMTP id 5455f3ed;
 Fri, 3 Dec 2021 10:17:38 +0000 (UTC)
References: <87ee762at1.fsf@cbaines.net> <8735ngb39u.fsf@gnu.org>
User-agent: mu4e 1.6.6; emacs 27.2
From: Christopher Baines <mail@cbaines.net>
To: Ludovic =?utf-8?Q?Court=C3=A8s?= <ludo@gnu.org>
Subject: Re: Update on bordeaux.guix.gnu.org
Date: Fri, 03 Dec 2021 09:39:17 +0000
In-reply-to: <8735ngb39u.fsf@gnu.org>
Message-ID: <87k0gm0z7k.fsf@cbaines.net>
MIME-Version: 1.0
Content-Type: multipart/signed; boundary="=-=-=";
 micalg=pgp-sha512; protocol="application/pgp-signature"
Received-SPF: pass client-ip=212.71.252.8; envelope-from=mail@cbaines.net;
 helo=mira.cbaines.net
X-Spam_score_int: -18
X-Spam_score: -1.9
X-Spam_bar: -
X-Spam_report: (-1.9 / 5.0 requ) BAYES_00=-1.9, SPF_HELO_PASS=-0.001,
 SPF_PASS=-0.001 autolearn=ham autolearn_force=no
X-Spam_action: no action
X-BeenThere: guix-devel@gnu.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: "Development of GNU Guix and the GNU System distribution."
 <guix-devel.gnu.org>
List-Unsubscribe: <https://lists.gnu.org/mailman/options/guix-devel>,
 <mailto:guix-devel-request@gnu.org?subject=unsubscribe>
List-Archive: <https://lists.gnu.org/archive/html/guix-devel>
List-Post: <mailto:guix-devel@gnu.org>
List-Help: <mailto:guix-devel-request@gnu.org?subject=help>
List-Subscribe: <https://lists.gnu.org/mailman/listinfo/guix-devel>,
 <mailto:guix-devel-request@gnu.org?subject=subscribe>
Cc: guix-devel@gnu.org
Errors-To: guix-devel-bounces+larch=yhetil.org@gnu.org
Sender: "Guix-devel" <guix-devel-bounces+larch=yhetil.org@gnu.org>
X-Migadu-Flow: FLOW_IN
X-Migadu-Country: US
ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=yhetil.org;
	s=key1; t=1638526683;
	h=from:from:sender:sender:reply-to:subject:subject:date:date:
	 message-id:message-id:to:to:cc:cc:mime-version:mime-version:
	 content-type:content-type:in-reply-to:in-reply-to:
	 references:references:list-id:list-help:list-unsubscribe:
	 list-subscribe:list-post; bh=u9tGoU8T5HHY+30CArNbGCo1+DFNKUHtHb5gcPjsgNQ=;
	b=QEgbPVZ0gMKDu/0PUDcdlWco10XO/OJiExhJKnBgEEVwcSgGvB3UCrmvKwjPsdE3R86L5J
	0mGu0ul6v+22gGmfElUFCJiAOr3OUY1ZXoRDb0RHzmbF/cZ/n9NoARWDljhjNYWmKgF8/Y
	OPyQVvJLRkEggn19cmgR3tp7XCFZdCpj4mDFfXnS1zZjSnjEWPfKQd/6A3HmZ7P9+ztm8a
	6Na+wsOqA4KGw1j8VcE7G6URqOmFpAkU1adzIHHPRBpSMwBJRQPT2yxkuS97vwCteldicT
	G+Rh5McHRNCSvuRGgWj/5xppxhEiOlLmDWyi0r62Bibr0jafudSPiVrIqUK/eg==
ARC-Seal: i=1; s=key1; d=yhetil.org; t=1638526683; a=rsa-sha256; cv=none;
	b=s896VES68WtmOGpZbg7ZyK6OLHfsl2R4Ki3Fg1PP4VCtZw1afwJ/+6fNvreqe0bEz2KJTY
	CyX0DB3slKGIAD9Us2gCIHVn4s7tPHqoPr2GwXyAPGc/ahkJEYOp2TFeEuaDFcf1y7+03C
	ohZm5zNBxI8Xj/Vlcnfo+rBnt7bWWX/D1rWofH/E3KfURj2DR9eavUF5zTyQIiE0V0H39d
	n2qBpXEanvV+RHyyxW0iBY7ZWKCTHbsHpjncgDFEEEVCnLVjPe2GKuCEMZR/FLsGGp8ixE
	9frAJ/GkHAgoM9osEGa4yOOrA5vatD3Q3P5WNdS2b3MoJYwCtv49Ma7rPOcXkA==
ARC-Authentication-Results: i=1;
	aspmx1.migadu.com;
	dkim=none;
	dmarc=none;
	spf=pass (aspmx1.migadu.com: domain of "guix-devel-bounces+larch=yhetil.org@gnu.org" designates 209.51.188.17 as permitted sender) smtp.mailfrom="guix-devel-bounces+larch=yhetil.org@gnu.org"
X-Migadu-Spam-Score: -5.02
Authentication-Results: aspmx1.migadu.com;
	dkim=none;
	dmarc=none;
	spf=pass (aspmx1.migadu.com: domain of "guix-devel-bounces+larch=yhetil.org@gnu.org" designates 209.51.188.17 as permitted sender) smtp.mailfrom="guix-devel-bounces+larch=yhetil.org@gnu.org"
X-Migadu-Queue-Id: 2BEB935714
X-Spam-Score: -5.02
X-Migadu-Scanner: scn0.migadu.com
X-TUID: LLZUuYumY8U0

--=-=-=
Content-Type: text/plain; charset=utf-8
Content-Transfer-Encoding: quoted-printable


Ludovic Court=C3=A8s <ludo@gnu.org> writes:

> Christopher Baines <mail@cbaines.net> skribis:
>
>> I've been doing some performance tuning, submitting builds is now more
>> parallelised, a source of slowness when fetching builds has been
>> addressed, and one of the long queries involved in allocating builds has
>> been removed, which also improved handling of the WAL (Sqlite write
>> ahead log).
>>
>> There's also a few new features. Agents can be deactivated which means
>> they won't get any builds allocated. The coordinator now checks the
>> hashes of outputs which are submitted, a safeguard which I added because
>> the coordinator now also supports resuming the uploads of outputs. This
>> is particularly important when trying to upload large (> 1GiB) outputs
>> over slow connections.
>>
>> I also added a new x86_64 build machine. It's a 4 core Intel NUC that I
>> had sitting around, but I cleaned it up and got it building things. This
>> was particularly useful as I was able to use it to retry building
>> guile@3.0.7, which is extremely hard to build [2]. This was blocking
>> building the channel instance derivations for x86_64-linux.
>>
>> 2: https://data.guix.gnu.org/gnu/store/7k6s13bzbz5fd72ha1gx9rf6rrywhxzz-=
guile-3.0.7.drv
>
> Neat!  (Though I wouldn=E2=80=99t say building Guile is =E2=80=9Cextremel=
y hard=E2=80=9D,
> especially on x86_64.  :-))  The ability to keep retrying is much
> welcome.

To rephrase, I found it extremely hard to get that particular Guile
derivation to build successfully, it failed to build 12 times, and only
succeeded when I added new hardware to attempt on (I'm guessing the
particular issue I was encountering was exacerbated by more cores).

Unfortunately, I also think that you finding it easy to build actually
contributes to the problem here, since it makes finding and addressing
issues like this harder.

>> Space is running out on bayfront, the machine that runs the coordinator,
>> stores all the nars and build logs, and serves the substitutes. I knew
>> this was probably going to be an issue, bayfront didn't have much space
>> to begin with, but I had hoped I'd be further forward in developing some
>> way to allow moving the nars around between multiple machines, to remove
>> the need to store all of them on bayfront. I have got a plan, there's
>> some ideas I mentioned back in February [4], but I haven't got around to
>> implementing anything yet. The disk space usage trend is pretty much
>> linear, so if things continue without any change, I think it will be
>> necessary to pause the agents within a month, to avoid filling up
>> bayfront entirely.
>
> Ah, bummer.  I hope we can find a solution one way or another.
> Certainly we could replicate nars on another machine with more disk,
> possibly buying the necessary hardware with the project funds.

Since this email got a bit delayed when I sent it, things have moved on
a bit now.

90% disk usage was the threshold I had in mind for bayfront, and that's
now pretty much been reached so I've paused all the agents. My plans for
how to address this have also developed a bit as well, but it's still
going to take a month at least to get things going again.

Chris

--=-=-=
Content-Type: application/pgp-signature; name="signature.asc"

-----BEGIN PGP SIGNATURE-----

iQKlBAEBCgCPFiEEPonu50WOcg2XVOCyXiijOwuE9XcFAmGp7r9fFIAAAAAALgAo
aXNzdWVyLWZwckBub3RhdGlvbnMub3BlbnBncC5maWZ0aGhvcnNlbWFuLm5ldDNF
ODlFRUU3NDU4RTcyMEQ5NzU0RTBCMjVFMjhBMzNCMEI4NEY1NzcRHG1haWxAY2Jh
aW5lcy5uZXQACgkQXiijOwuE9Xdi0g//atBNpSkhwIdd21MAFvTJ3RvhRf5NE23O
sIFc7sqS74ksOXBB64MzPJ3UkN2Ctd+Wekmkumd4zChfWP9gCd/WT1sNZVURTvba
r36Sx8dCoZTHOqFNnfhf83fa140sOq+AFb169VzdASxNTW916IKbTZpB5XsCjVfx
ztUJz5Es0u/UqzeOdgyTxnboRO6sDPQUj9myqOd4LgfFMq37TrGEsmoZRHHl2ddO
60nwtwYEvTG0GAGyHAa8ifOtHfVpG1jiCR4wniXX7/Z8eAK6aCzRxzx5Xim2OaN+
Eu8/1azLh3gPHk6G1SKKrQ2EolF1/qGXsdBT12glvFmoIjwgG4IQ9fKwxkmKhPeT
eWk3ZJcZZOvm/wHcpr7ZXBk+RKhwh1QlnKlOOTnqPHupUz599dH42yKxx443dA0r
RE/8JaL/wuVhcNJUupMKUR9YzRTfxH51pq+YIi/1r+JaRvhluscTwNcRsbGdTP6L
1Zqe12OL/dxWPswE6mB2PP8+h8Dnq7Mhj2o+6LqUGZqa+hEDYhnutnZcl7bOXHXh
C83+5SkmkwLRCsMCAfZdqXw+XskbtzpkOB+i64DpGmRataWuuSASro2db4VqzdGP
ufMKH3osgusb211jpLO2pv+LY4Zy4tkaMOjJUOREGDcPZ2+2mh455rgEtI1lx0WP
fHh0U2vOYDY=
=xi0J
-----END PGP SIGNATURE-----
--=-=-=--