From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mp0 ([2001:41d0:8:6d80::]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits)) by ms0.migadu.com with LMTPS id iPO7J8SUcWFP4wAAgWs5BA (envelope-from ) for ; Thu, 21 Oct 2021 18:26:44 +0200 Received: from aspmx1.migadu.com ([2001:41d0:8:6d80::]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits)) by mp0 with LMTPS id yB1qI8SUcWEnBQAA1q6Kng (envelope-from ) for ; Thu, 21 Oct 2021 16:26:44 +0000 Received: from lists.gnu.org (lists.gnu.org [209.51.188.17]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by aspmx1.migadu.com (Postfix) with ESMTPS id 413D518676 for ; Thu, 21 Oct 2021 18:26:44 +0200 (CEST) Received: from localhost ([::1]:40484 helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1mdatv-00057x-D0 for larch@yhetil.org; Thu, 21 Oct 2021 12:26:43 -0400 Received: from eggs.gnu.org ([2001:470:142:3::10]:41476) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1mdati-00056f-M2 for guix-devel@gnu.org; Thu, 21 Oct 2021 12:26:30 -0400 Received: from out3-smtp.messagingengine.com ([66.111.4.27]:38355) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1mdatg-0003y3-IE for guix-devel@gnu.org; Thu, 21 Oct 2021 12:26:30 -0400 Received: from compute4.internal (compute4.nyi.internal [10.202.2.44]) by mailout.nyi.internal (Postfix) with ESMTP id 753E75C00A1; Thu, 21 Oct 2021 12:26:27 -0400 (EDT) Received: from mailfrontend2 ([10.202.2.163]) by compute4.internal (MEProxy); Thu, 21 Oct 2021 12:26:27 -0400 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d= messagingengine.com; h=cc:content-transfer-encoding:content-type :date:from:in-reply-to:message-id:mime-version:references :subject:to:x-me-proxy:x-me-proxy:x-me-sender:x-me-sender :x-sasl-enc; s=fm1; bh=MJ+PCK9vPJimtEdedJkd/CNtfxCYWedIsj523VOBb bg=; b=fpV4Ht46P+LKhfqjLtf949Hl0ptfSXQ7R1be/e0uYS9Z28VWqDkw1YdzT yRHvjryvqU95TrwxbL0S7r1nUWiFRJ6jwP1WzwrG0MwCD9m45Az5xpZ/8TzODE/y pF6sJGRiAJogXqmfrXaW7xH4SEaqWyBpect13rAdWJlr71MBgg4KIdJGaZjlmbAL BcK+2wvqZMJSWldWZm3N75q25ACfwYVcOMpsVRdDTASJlYklN0ZMDCidKKzvHk+S UUhzEH2BpRRKEsgqGFlJzjmaNlofoFArTOth++ByEiI+2fRxvSF0fvwiI6XUoGQI E8KYstNdJ/TBmRGFlE6Ul8eVK3mAw== X-ME-Sender: X-ME-Received: X-ME-Proxy-Cause: gggruggvucftvghtrhhoucdtuddrgedvtddrvddviedgleelucetufdoteggodetrfdotf fvucfrrhhofhhilhgvmecuhfgrshhtofgrihhlpdfqfgfvpdfurfetoffkrfgpnffqhgen uceurghilhhouhhtmecufedttdenucenucfjughrpefhvffufhffjgfkfgggtgfgsehtqh ertddtreejnecuhfhrohhmpefvihhmohhthhihucfurghmphhlvgcuoehsrghmphhlvght sehnghihrhhordgtohhmqeenucggtffrrghtthgvrhhnpefhtefhiedvtdeftdffvdehke ejhedvvdetuedtvdefgedtuedujeelueetvdektdenucffohhmrghinhepghhnuhdrohhr ghdpshhofhhtfigrrhgvhhgvrhhithgrghgvrdhorhhgnecuvehluhhsthgvrhfuihiivg eptdenucfrrghrrghmpehmrghilhhfrhhomhepshgrmhhplhgvthesnhhghihrohdrtgho mh X-ME-Proxy: Received: by mail.messagingengine.com (Postfix) with ESMTPA; Thu, 21 Oct 2021 12:26:27 -0400 (EDT) From: Timothy Sample To: zimoun Subject: Re: Preservation of Guix Report References: <87o87jjx54.fsf@ngyro.com> <86sfwug72o.fsf@gmail.com> Date: Thu, 21 Oct 2021 12:26:26 -0400 In-Reply-To: <86sfwug72o.fsf@gmail.com> (zimoun's message of "Thu, 21 Oct 2021 09:39:27 +0200") Message-ID: <87zgr2cpjh.fsf@ngyro.com> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/27.2 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable Received-SPF: pass client-ip=66.111.4.27; envelope-from=samplet@ngyro.com; helo=out3-smtp.messagingengine.com X-Spam_score_int: -25 X-Spam_score: -2.6 X-Spam_bar: -- X-Spam_report: (-2.6 / 5.0 requ) BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, RCVD_IN_DNSWL_LOW=-0.7, RCVD_IN_MSPIKE_H2=-0.001, SPF_HELO_PASS=-0.001, SPF_PASS=-0.001 autolearn=ham autolearn_force=no X-Spam_action: no action X-BeenThere: guix-devel@gnu.org X-Mailman-Version: 2.1.23 Precedence: list List-Id: "Development of GNU Guix and the GNU System distribution." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Cc: guix-devel@gnu.org Errors-To: guix-devel-bounces+larch=yhetil.org@gnu.org Sender: "Guix-devel" X-Migadu-Flow: FLOW_IN ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=yhetil.org; s=key1; t=1634833604; h=from:from:sender:sender:reply-to:subject:subject:date:date: message-id:message-id:to:to:cc:cc:mime-version:mime-version: content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references:list-id:list-help: list-unsubscribe:list-subscribe:list-post:dkim-signature; bh=MJ+PCK9vPJimtEdedJkd/CNtfxCYWedIsj523VOBbbg=; b=PG//TdlERUMmt3OIu1wIkt6NQpDd7O5XY9cKH7jtSgqyWzURYW6uqiW0EAIwjXRgkCUdRI EVzPStcRr/wB7L8hXoW6MXaGqKNA87J7mWa+oH05x2PTsayQUA71JUf9hyUTVgFNConibq R6ZsZJOxLjkW81eUViQ86/M5k1kwXighQmNQjYkxfMvCl23iMhO0o9O28wXUvYOwPyTq/v XLTXV0mI9LEUdxX33b3oajd1+1Z+CnZn4ILQsx+CcQj0ka7B1/x9HmkeySUM56D8RYFXKO bMeeI+mCN/+fDyugJoUBHuwzahu0B5IfltVsI3OiLOjAeJQQ2aS/S1Csi/nTFA== ARC-Seal: i=1; s=key1; d=yhetil.org; t=1634833604; a=rsa-sha256; cv=none; b=nQH2b0viY8JZvezSMDRoO6MDqhq+SUe61Wg7MH2oLn8yiFs+mzjwJ3ol3FPv/iBF6Xvl0p 7SdBKq1iKNUbEvvusu5mJpgqaYsH4MpQhr7Jk8adrYqAync6dyUVaSfC7r6dEZDnSRs7PN k2UX17UwoSvOyOYYLlOo2mqX8dUEo/1TV9IEsS17ALxlFinKly0pVEQdIxizfdYfsZNF0K rnUXy64P4VeTnbcaRk2jgmc6bKTXEDtyGjjB/JW4esGAYxpF+BsiijTFYc7SMlCWoTi2fn NWVq2boqmTZ4Xl4J98YNhFRgMOeGapE/vP4Q2sT8X6T9+GF4FLRCOt06btwzAA== ARC-Authentication-Results: i=1; aspmx1.migadu.com; dkim=pass header.d=messagingengine.com header.s=fm1 header.b=fpV4Ht46; dmarc=none; spf=pass (aspmx1.migadu.com: domain of guix-devel-bounces@gnu.org designates 209.51.188.17 as permitted sender) smtp.mailfrom=guix-devel-bounces@gnu.org X-Migadu-Spam-Score: -1.13 Authentication-Results: aspmx1.migadu.com; dkim=pass header.d=messagingengine.com header.s=fm1 header.b=fpV4Ht46; dmarc=none; spf=pass (aspmx1.migadu.com: domain of guix-devel-bounces@gnu.org designates 209.51.188.17 as permitted sender) smtp.mailfrom=guix-devel-bounces@gnu.org X-Migadu-Queue-Id: 413D518676 X-Spam-Score: -1.13 X-Migadu-Scanner: scn0.migadu.com X-TUID: PMCT7PW6pPSA Hi zimoun, zimoun writes: > 2. For still unknown reasons, the bridge between SWH and Disarchive has > some holes. For instance, > > $ guix lint -c archive znc > gnu/packages/messaging.scm:996:12: znc@1.8.2: Disarchive entry re= fers to non-existent SWH directory '33a3b509b5ff8e9039626d11b7a800281884cf2= a' > > [...] > > Therefore, something is wrong somewhere. Because of #1, I detect > many of such examples. I do not know if SWH-ID computed by > Disarchive is incorrect [...]. Bingo! According to SWH (emphasis mine): SWHIDs for contents, directories, revisions, and releases are, *at present*, compatible with the Git way of computing identifiers for its objects. This is not true anymore. As they go on to say: Note that Git compatibility is incidental and is not guaranteed to be maintained in future versions of this scheme (or Git). Disarchive does it the Git way, and SWH does something slightly different. The SWH hash is 4e58dc09b8362caf1265102130a593b070562a68, but the Git hash is 33a3b509b5ff8e9039626d11b7a800281884cf2a. The difference is that Disarchive, like Git, ignores empty directories. It makes sense that an archival project like SWH would not do that, and they indeed don=E2=80=99t. Fixing this in Disarchive is going to make a *huge* difference, so that is now high priority for me (it=E2=80=99s a one line change, but I want to = fix it, release it, update Guix, and recompute the report). > And answering to your question [3] about =E2=80=9Csources.json=E2=80=9D, = I think the > ingestion started after this commit > 35bb77108fc7f2339da0b5be139043a5f3f21493 from guix-artwork. Other said, > SWH started to ingest from =E2=80=9Csources.json=E2=80=9D after July 2020= ; probably > around September 2020. > > 3: Thanks! While investigating the above problem, I found a page that lists what SWH is getting from us [1] and another showing when they are scanning =E2=80=9Csources.json=E2=80=9D [2]. I don=E2=80=99t know if you= =E2=80=99ve seen them before, but they will be invaluable for figuring this stuff out. [1] https://archive.softwareheritage.org/browse/origin/branches/?origin_url= =3Dhttps://guix.gnu.org/sources.json [2] https://archive.softwareheritage.org/browse/origin/visits/?origin_url= =3Dhttps://guix.gnu.org/sources.json > For the Missing and Unknown fields, could you distinguish the kind of > origin? Is it mainly git-fetch or url-fetch or others? Good idea. I think I can do this easily enough. I might shelve it for a bit, because I=E2=80=99m too excited to update the report with the Disarc= hive hash fix. :) -- Tim