unofficial mirror of guix-devel@gnu.org 
 help / color / mirror / code / Atom feed
From: "Ludovic Courtès" <ludovic.courtes@inria.fr>
To: <guix-devel@gnu.org>
Subject: Disarchive update
Date: Sat, 09 Oct 2021 12:05:50 +0200	[thread overview]
Message-ID: <87r1cu1pj5.fsf@inria.fr> (raw)

Hello Guix!

This job is disassembling all the .tar.gz files packages refer to, using
the recently-added ‘etc/disarchive-manifest.scm’ file:

  https://ci.guix.gnu.org/jobset/disarchive

It has just succeeded for the first time.  :-)

  https://ci.guix.gnu.org/eval/29213?status=succeeded

If you run:

  guix build /gnu/store/nnl67m8c2x9rwqbnych1agc6p7g5473g-disarchive-collection.drv

or:

  guix build -m etc/disarchive-manifest.scm

and if you’re patient :-), you eventually get a 579 MB directory
containing Disarchive metadata for 8,413 tarballs out of 9,113 (the
missing tarballs are those that “disarchive disassemble” fails to
handle, for instance because it couldn’t guess what compression method
is being used.)

Where to go from here?  Timothy Sample had already set up a Disarchive
database at <https://disarchive.ngyro.com>, which (guix download) uses
as a fallback; I’m not sure exactly how it’s populated.  The goal here
would be for the Guix project to set up infrastructure populating a
database automatically and creating backups, possibly via SWH (we’ll
have to discuss it with them).

A plan we can already deploy would be:

  1. Add the disarchive.guix.gnu.org DNS entry, pointing to berlin.

  2. On berlin, add an mcron job that periodically copies the output of
     the latest “disarchive-collection” build to a directory, say
     /srv/disarchive.  Thus, the database would accumulate tarball
     metadata over time.

  3. Add an nginx route so that /srv/disarchive is served at
     https://disarchive.guix.gnu.org.

  4. Add disarchive.guix.gnu.org to (guix download).

How does that sound?  Thoughts?

Ludo’.


             reply	other threads:[~2021-10-09 10:06 UTC|newest]

Thread overview: 15+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2021-10-09 10:05 Ludovic Courtès [this message]
2021-10-09 10:37 ` Disarchive update Mathieu Othacehe
2021-10-10 13:22   ` Ludovic Courtès
2021-10-12  8:41     ` Mathieu Othacehe
2021-10-14 14:06       ` Ludovic Courtès
2021-10-12  9:19 ` zimoun
2021-10-14 14:02   ` Ludovic Courtès
2021-10-14 19:17     ` zimoun
2021-10-21 19:41       ` Ludovic Courtès
2021-10-21 19:57         ` zimoun
2021-10-13 14:54 ` Timothy Sample
2021-10-14 14:04   ` Ludovic Courtès
2021-10-14 14:31 ` Ludovic Courtès
2021-10-14 21:44   ` zimoun
2021-10-21 19:44   ` Ludovic Courtès

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

  List information: https://guix.gnu.org/

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=87r1cu1pj5.fsf@inria.fr \
    --to=ludovic.courtes@inria.fr \
    --cc=guix-devel@gnu.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
Code repositories for project(s) associated with this public inbox

	https://git.savannah.gnu.org/cgit/guix.git

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).