From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mp0 ([2001:41d0:2:4a6f::]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits)) by ms0.migadu.com with LMTPS id uPhvG/qKqGHOtwAAgWs5BA (envelope-from ) for ; Thu, 02 Dec 2021 09:59:38 +0100 Received: from aspmx1.migadu.com ([2001:41d0:2:4a6f::]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits)) by mp0 with LMTPS id oDYrF/qKqGGfdwAA1q6Kng (envelope-from ) for ; Thu, 02 Dec 2021 08:59:38 +0000 Received: from lists.gnu.org (lists.gnu.org [209.51.188.17]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by aspmx1.migadu.com (Postfix) with ESMTPS id 1B88337CC9 for ; Thu, 2 Dec 2021 09:59:38 +0100 (CET) Received: from localhost ([::1]:58234 helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1mshwH-0007s2-5T for larch@yhetil.org; Thu, 02 Dec 2021 03:59:37 -0500 Received: from eggs.gnu.org ([209.51.188.92]:39602) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1mshwB-0007py-Ui; Thu, 02 Dec 2021 03:59:31 -0500 Received: from mail2-relais-roc.national.inria.fr ([192.134.164.83]:49195) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1mshw2-0001Ba-Eg; Thu, 02 Dec 2021 03:59:26 -0500 IronPort-Data: =?us-ascii?q?A9a23=3Awpr1Ca/dKKoduVU6N44IDrUDU3yTJUtcMsCJ2f8?= =?us-ascii?q?bfWQNrUoj3mYHzzcWWG+Pb62LY2qmfo1zOoq09UpTvpPVy99lGldlrnsFo1Bi+?= =?us-ascii?q?ZOUX4zBRqvTF3rPdZObFBoPA/3z27AsFehsJpPnjkrrYueJQUVUj/nSH+OmULS?= =?us-ascii?q?cY0ideCc/IMsfoUM68wIGqt4w6TSJK1vlVeLa+6UzCnf9s9JHGj58B5a4lf9al?= =?us-ascii?q?K+aVAX0EbAJTasjUFf2zxH5BX+ETE27ByOQroJ8RoZWSwtfpYxV8F81/z91Yj+?= =?us-ascii?q?kuqz6eEcNRNY+PyDX2yEQBvDk20Ea4HVois7XN9JFAatTozGUk9dvyd4LvputU?= =?us-ascii?q?xskJYXNnv4cWl9WCUmSOIUZp+6feyfv2SCU5wicG5f2+N18DUQxO8sE/ftrBnx?= =?us-ascii?q?I+NQXLTkMalaIgOfe6L2mS/kpnc8iIc/gMasQvGwmyivWZd4pXJHTBqnH+9Jc9?= =?us-ascii?q?Dg2m4ZJB/m2T9EQbCJrYQjoZRJeIFBRA5U79NpELFGXnyZw8QPO4/dvpTGKlEo?= =?us-ascii?q?oiuCFDTYcQfTSLe09o6pSjjiuE7zFPywn?= IronPort-HdrOrdr: =?us-ascii?q?A9a23=3AsbENE6/XwQRr6dsZC2Nuk+Eidb1zdoMgy1kn?= =?us-ascii?q?xilNoENuHPBwxvrAoB1E73PJYW4qKQgdcKO7SdG9qBLnhOpICOwqVtaftWbdyQ?= =?us-ascii?q?yVxe1ZjbcKzgeLJ8SczJ8r6U4DSdkZNDSYNzETsS+Q2njbLz9U+qjizEnev5a6?= =?us-ascii?q?854Cd3AIV0hn1WpEIzfeNnczaBhNBJI/GpbZzNFAvSCcdXMeadn+LmUZXsDYzu?= =?us-ascii?q?e73a7OUFojPVoK+QOOhTSn5PrRCB6DxCoTVDtJ3PML7XXFqQrk/a+u2svLgiM0?= =?us-ascii?q?llWjpKi+quGRh+erN/b8xvT97Q+cxTpAUb4REYFqegpF7t1Hpmxa0eUk6C1QRP?= =?us-ascii?q?ibo0mhBF1d5yGdrTUImQxelkMK82Xo8kfLsIj8XnY3GsBBjYVWfl/Q7Fchpsh1?= =?us-ascii?q?1OZO03iCv5RaABvclGCljuK4Ii1Chw6xuz4vgOQTh3tQXc8Xb6JQt5UW+AdQHI?= =?us-ascii?q?0bFCz35Yg7GK1lDd3a5vxRbVSGBkqpzFVH0ZipRDA+Dx2GSk8Ntoic1CVXhmlw?= =?us-ascii?q?yw8CyMkWjh47heIAoll/lpX525VT5c9zp5UtHN5A7c86MLSKNlA=3D?= X-IronPort-AV: E=Sophos;i="5.87,281,1631570400"; d="scan'208";a="7790278" Received: from unknown (HELO ribbon) ([193.50.110.120]) by mail2-relais-roc.national.inria.fr with ESMTP/TLS/DHE-RSA-AES256-GCM-SHA384; 02 Dec 2021 09:59:19 +0100 From: =?utf-8?Q?Ludovic_Court=C3=A8s?= To: Timothy Sample Cc: guix-devel@gnu.org, guix-science@gnu.org, zimoun Subject: Re: Software Heritage fifth anniversary event References: <87sfvc4q8j.fsf@inria.fr> <87tufsgq1p.fsf@ngyro.com> X-URL: http://www.fdn.fr/~lcourtes/ X-Revolutionary-Date: 12 Frimaire an 230 de la =?utf-8?Q?R=C3=A9volution?= X-PGP-Key-ID: 0x090B11993D9AEBB5 X-PGP-Key: http://www.fdn.fr/~lcourtes/ludovic.asc X-PGP-Fingerprint: 3CE4 6455 8A84 FDC6 9DB4 0CFB 090B 1199 3D9A EBB5 X-OS: x86_64-pc-linux-gnu Date: Thu, 02 Dec 2021 09:59:18 +0100 In-Reply-To: <87tufsgq1p.fsf@ngyro.com> (Timothy Sample's message of "Wed, 01 Dec 2021 13:04:18 -0500") Message-ID: <87k0gnxtzt.fsf@inria.fr> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/27.2 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable Received-SPF: pass client-ip=192.134.164.83; envelope-from=ludovic.courtes@inria.fr; helo=mail2-relais-roc.national.inria.fr X-Spam_score_int: -41 X-Spam_score: -4.2 X-Spam_bar: ---- X-Spam_report: (-4.2 / 5.0 requ) BAYES_00=-1.9, RCVD_IN_DNSWL_MED=-2.3, RCVD_IN_MSPIKE_H2=-0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001 autolearn=ham autolearn_force=no X-Spam_action: no action X-BeenThere: guix-science@gnu.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: guix-science-bounces+larch=yhetil.org@gnu.org Sender: "Guix-Science" X-Migadu-Flow: FLOW_IN X-Migadu-Country: US ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=yhetil.org; s=key1; t=1638435578; h=from:from:sender:sender:reply-to:subject:subject:date:date: message-id:message-id:to:to:cc:cc:mime-version:mime-version: content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references:list-id:list-help: list-unsubscribe:list-subscribe:list-post; bh=i+cap439vhEMtjrgYcXLWVzH9LOBlcZJ94PKhgSnK4E=; b=qkkC4zX1jzZe3N6nfhjOgLoRA5N7CayrmLMiT0SA2nJUyRmOGV21dEQWiF1bcH2+0OPMMy MBKgEc752iSz56d9+Bbssmru730T8Hz4TmRCOwD9WySDir6WAxkGbfYB7agEXt1CmrROjj 7v8hp1Xd76rE8J3dZyWFQh9OVofVaY4VRQwtbehL2CtdUOr4TYu7TJAB7AO2tNOBkSb+1G Fp8XrnWwTH6+NsShDExU3CbCgCXjGwR+BAibwzHd3DB4csmm9ewmTVyXRMrE9SJ0PXvXLL rIx2N3kxYMEd573xS1X8EWeY+nZUrcTJBuGQKPnPYx/XKQtsLP6vNTOk3y0ztQ== ARC-Seal: i=1; s=key1; d=yhetil.org; t=1638435578; a=rsa-sha256; cv=none; b=IcMr+UmACXMl3bwHUakgqr3sfqfKdlTIOsYAOgIZKz88B6PjRGdIQAHh0Ivwf8hAnRNKJs QeDMfZp90IgC04GubAgz22trf9mEzEnQELlEP4+2BWOzWTkGkdXmooqo1dhUKHz3fYE6gZ Aq1qwy+YyuVkTRPmLv7psc7e6jN8hxXUhedTXxUKh+ZFiZW+iaE3zFcfDHSVY4DUcKy2bn 76Tt1IFbn0inV7eZiL7p1pcAW3CanlcEToeCkIR+hHvdht+EhzEHD/xzc/lhl6usdMuLgQ EhUB/IGWWcupzta99gRX/FjPdXiF9X7c7yRsB00DWfG4LW9VMKIwrZz03oAgUw== ARC-Authentication-Results: i=1; aspmx1.migadu.com; dkim=none; dmarc=none; spf=pass (aspmx1.migadu.com: domain of "guix-science-bounces+larch=yhetil.org@gnu.org" designates 209.51.188.17 as permitted sender) smtp.mailfrom="guix-science-bounces+larch=yhetil.org@gnu.org" X-Migadu-Spam-Score: -1.62 Authentication-Results: aspmx1.migadu.com; dkim=none; dmarc=none; spf=pass (aspmx1.migadu.com: domain of "guix-science-bounces+larch=yhetil.org@gnu.org" designates 209.51.188.17 as permitted sender) smtp.mailfrom="guix-science-bounces+larch=yhetil.org@gnu.org" X-Migadu-Queue-Id: 1B88337CC9 X-Spam-Score: -1.62 X-Migadu-Scanner: scn0.migadu.com X-TUID: GPBzLg2FFjmV Hi! Timothy Sample skribis: > Ludovic Court=C3=A8s writes: [...] >> =E2=80=A2 Disarchive: they=E2=80=99d like to better understand the =E2= =80=9Cunknowns=E2=80=9D in the >> PoG plots (I wasn=E2=80=99t sure if it was non-tar.gz tarballs or wh= at) and >> to work on the definitely-missing origins that show up there; > > Many of the unknowns are there for me to track Disarchive progress. > It=E2=80=99s not really the clearest reporting, but it tracks more what G= uix can > handle automatically than what we could theoretically know about. > Basically something is =E2=80=9Cknown=E2=80=9D if it can be downloaded fr= om upstream, > and either: it=E2=80=99s a non-recursive Git reference; or it=E2=80=99s s= omething > Disarchive can handle. Hence, we know nothing about other version > control systems and, say, =E2=80=9C.tar.bz2=E2=80=9D archives. Also, all= these things > are based on heuristics. :) As we get closer to 100% known, we can > start analyzing everything more closely. Right. Perhaps at some point we can give them (say on swh-devel) this explanation so they have a clearer view of how far Disarchive is from being =E2=80=9Cproduction-ready=E2=80=9D from an SWH perspective. Valentin= of the SWH team played a lot with pristine-tar and I=E2=80=99m sure they=E2=80=99d hav= e useful feedback to give. >> they=E2=80=99re not opposed to the idea of eventually hosting or mai= ntaining >> the Disarchive database (in fact one of the developers thought we >> were hosting it in Git and that as such they were already archiving >> it=E2=80=94maybe we could go back to Git?); > > It=E2=80=99s a possibility, but right now I=E2=80=99m hopeful that the da= tabase will be > in the care of SWH directly before too long. I=E2=80=99d rather wait and= see at > this point. I=E2=80=99m sure we could manage it, but the uncompressed si= ze of > the Disarchive specification of a Chromium tarball is 366M. Storing all > the XZ specifications uncompressed is over 20G. It would be a big Git > repo! Indeed! So, in passing, you=E2=80=99re telling us that xz support is kinda ready, r= ight? :-) Thanks! Ludo=E2=80=99.