From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mp2.migadu.com ([2001:41d0:403:58f0::]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits)) by ms8.migadu.com with LMTPS id mCNUBFanqmVKUwAAe85BDQ:P1 (envelope-from ) for ; Fri, 19 Jan 2024 17:46:14 +0100 Received: from aspmx1.migadu.com ([2001:41d0:403:58f0::]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits)) by mp2.migadu.com with LMTPS id mCNUBFanqmVKUwAAe85BDQ (envelope-from ) for ; Fri, 19 Jan 2024 17:46:14 +0100 X-Envelope-To: larch@yhetil.org Authentication-Results: aspmx1.migadu.com; dkim=none; spf=pass (aspmx1.migadu.com: domain of "guix-devel-bounces+larch=yhetil.org@gnu.org" designates 209.51.188.17 as permitted sender) smtp.mailfrom="guix-devel-bounces+larch=yhetil.org@gnu.org"; dmarc=none ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=yhetil.org; s=key1; t=1705682774; h=from:from:sender:sender:reply-to:subject:subject:date:date: message-id:message-id:to:to:cc:cc:mime-version:mime-version: content-type:content-type:in-reply-to:in-reply-to: references:references:list-id:list-help:list-unsubscribe: list-subscribe:list-post; bh=JvHruct8fWzbRXAoq/9Ax2SgWQICHZFrwzt23u+8vAY=; b=qZRXTp2qHc9Txhv8abFWj4f5KlsuhE4uFBi/SxTI1piiJhtPZ+nGOd5Gn5whtmQ5Pn9tfH qQ/1rCReXoK/jk4p0AMffkU0Jm8kuzNE0NDYRN9eXGH/ae2U1IrdMNLdT+tkDF/lTwBEu+ Zh0jLC7JwEgl50BVaRMwGv0QGprCKO2X5x+WdG+lN5GsFq9d2yVuWIjev5EdRrOs7qVpIa BS9Nci7LBnVL1I6u4tmMbZSx9LWTGldYK6x+uch8ooS7RFbgh+TPn16/rifK5u9D26t/qG nb0Vphb+NPUKfAZJ5TUkO1J7KWim7oftOjre3SelZYh20bwCU+usoM9xT8kS9w== ARC-Authentication-Results: i=1; aspmx1.migadu.com; dkim=none; spf=pass (aspmx1.migadu.com: domain of "guix-devel-bounces+larch=yhetil.org@gnu.org" designates 209.51.188.17 as permitted sender) smtp.mailfrom="guix-devel-bounces+larch=yhetil.org@gnu.org"; dmarc=none ARC-Seal: i=1; s=key1; d=yhetil.org; t=1705682774; a=rsa-sha256; cv=none; b=KYtMWdqEse/VybNFy2Y90W6mmnlYp2UtN2bqHPI/cHLtxuARbGV6Wn88uRoAqYe512vqHW 8Vxq8RmKNYZ7S7XLH8IXiA8B0L+XsrOvr8APLYY1jez63ZlPzXNKzOadG5W4IlXNhovhZ4 b0t0oM0rhJTLRDUqOCNHaOZ8No510UDEuZ0Uy0Q6Ya8TLVCVVY7ditkc+6j8L33aAG6TlC Ly2YvBDKNPj5FNV7mcDBVXodnfkNsVj0hW7Hb3cnLYdVCyuqT1s8gZ7LRS3nbQEWsuVooA amjoklqcC2ro9BTMd2RhWeWZ4YWUvnjboJL/x+16ertTD9XpiRlf5dgpCQMSFw== Received: from lists.gnu.org (lists.gnu.org [209.51.188.17]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by aspmx1.migadu.com (Postfix) with ESMTPS id E9BEC43BEC for ; Fri, 19 Jan 2024 17:46:13 +0100 (CET) Received: from localhost ([::1] helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1rQrz1-00012p-9T; Fri, 19 Jan 2024 11:44:43 -0500 Received: from eggs.gnu.org ([2001:470:142:3::10]) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1rQryz-00012e-UU for guix-devel@gnu.org; Fri, 19 Jan 2024 11:44:41 -0500 Received: from mail-wm1-f52.google.com ([209.85.128.52]) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.90_1) (envelope-from ) id 1rQryx-0002Zq-Se for guix-devel@gnu.org; Fri, 19 Jan 2024 11:44:41 -0500 Received: by mail-wm1-f52.google.com with SMTP id 5b1f17b1804b1-40ea084ec14so4150015e9.2 for ; Fri, 19 Jan 2024 08:44:39 -0800 (PST) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1705682678; x=1706287478; h=mime-version:message-id:date:references:in-reply-to:subject:cc:to :from:x-gm-message-state:from:to:cc:subject:date:message-id:reply-to; bh=JvHruct8fWzbRXAoq/9Ax2SgWQICHZFrwzt23u+8vAY=; b=Nj/wU1zwtvuLXeFvMaQdxZl4YNWirRVF7m4pgRu14zHSycgWbdP4mwsgWlzOQ4ldZp iVzaRPe4/o9bjFGQrn71rmdf+xwzDTYoXWiHBbXXAuLcEAC5z0uXNx67XcjZgYLvYJ6+ jjKPgqi2PcjRgrigjRdBTPwlsvphCvpFWlITkmHRGOpbj1CtYkmw9y3R2VFavjb2yEC2 8Ew7NgKnmTZL005A3QVvNXvsmdg5goUcXHXL04+rbB3sN0DbFeH4pAeYtRhj15nfMbaw XR8X1uxQihf1IqFCWLWwk0OrjjhO9NsEkiaT1JyVwpuRtAQWzjWVoyrdwg8uBkOcf+ro edRg== X-Gm-Message-State: AOJu0YzBroaDn2Ni5zgoZJYFaax9WCHWFrErAaTQ8R3CT0VRrk07wAdk IGStHmBjoYyI//pIwjIMesq8O9lOLiH6M6nDoLYi1WWjeJDXdqCg X-Google-Smtp-Source: AGHT+IEz/aZtB/S21MlStWNekPU2tZTPnKeEaV7dBvv8P5WQSzeYbwlRvwTnCmyBI6MvnpHo51CNDw== X-Received: by 2002:a1c:7404:0:b0:40e:6b8e:5ab2 with SMTP id p4-20020a1c7404000000b0040e6b8e5ab2mr19383wmc.106.1705682677655; Fri, 19 Jan 2024 08:44:37 -0800 (PST) Received: from yavin4 ([2a01:e0a:1c8:8a40:ae77:249:1184:9e74]) by smtp.gmail.com with ESMTPSA id n15-20020a05600c3b8f00b0040d5a5c523csm33778385wms.1.2024.01.19.08.44.36 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Fri, 19 Jan 2024 08:44:37 -0800 (PST) From: "Antoine R. Dumont (@ardumont)" To: Timothy Sample Cc: Simon TOURNIER , "swh-devel@inria.fr" , "guix-devel@gnu.org" , "ludovic.courtes" , julien@malka.sh Subject: Re: [swh-devel] Call for public review - SWH Nix/GNU Guix stack In-Reply-To: <871qahc8rr.fsf@ngyro.com> References: <87o7dnq85z.fsf@gmail.com> <871qahc8rr.fsf@ngyro.com> Date: Fri, 19 Jan 2024 17:44:36 +0100 Message-ID: <87zfx1nuh7.fsf@gmail.com> MIME-Version: 1.0 Content-Type: multipart/signed; boundary="=-=-="; micalg=pgp-sha512; protocol="application/pgp-signature" Received-SPF: pass client-ip=209.85.128.52; envelope-from=antoine.romain.dumont@gmail.com; helo=mail-wm1-f52.google.com X-Spam_score_int: -13 X-Spam_score: -1.4 X-Spam_bar: - X-Spam_report: (-1.4 / 5.0 requ) BAYES_00=-1.9, FREEMAIL_FORGED_FROMDOMAIN=0.249, FREEMAIL_FROM=0.001, HEADER_FROM_DIFFERENT_DOMAINS=0.248, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H2=-0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, T_SCC_BODY_TEXT_LINE=-0.01 autolearn=no autolearn_force=no X-Spam_action: no action X-BeenThere: guix-devel@gnu.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: "Development of GNU Guix and the GNU System distribution." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: guix-devel-bounces+larch=yhetil.org@gnu.org Sender: guix-devel-bounces+larch=yhetil.org@gnu.org X-Migadu-Flow: FLOW_IN X-Migadu-Country: US X-Migadu-Spam-Score: -5.85 X-Migadu-Queue-Id: E9BEC43BEC X-Spam-Score: -5.85 X-Migadu-Scanner: mx11.migadu.com X-TUID: lC+L9FmQRlvU --=-=-= Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable Hello, > Is that because the changes you describe were done after the staging > data was loaded or is it a bug? Our staging instance inherits its append-only property from our main archive. In the staging case (for "prototypes", soon-to-be-deployed new feature or so), that makes it hard to see through the "old bug" noise. It's old origins that were ingested initially with a first version of the lister (which got iteratively fixed). =2D--- @anlambert made a pass this week in docker (from scratch) to check (thx ;) > Excellent! I believe this addresses a problem we recently reported > regarding tarballs published with our own content-addressed URLs, which > look like: > > https://bordeaux.guix.gnu.org/file/BiocNeighbors_1.20.0.tar.gz/sha256/0= a5wg099fgwjbzd6r3mr4l02rcmjqlkdcz1w97qzwx1mir41fmas As a result, he actually enhanced the listing so the urls mentioned earlier ^ is treated correctly out of the data in the url. (@me That needs a bump in deployment [for next week]) Early on, I was referring to another heuristic using a HEAD query to parse header informations [if any]. As that specific url does not provide any, so it passed through. =2D--- Note: cc-ed julien@malka.sh instead of community@nixos.org (as asked in the thread) Cheers, =2D- tony / Antoine R. Dumont (@ardumont) =2D---------------------------------------------------------------- gpg fingerprint BF00 203D 741A C9D5 46A8 BE07 52E2 E984 0D10 C3B8 Timothy Sample writes: > Hello, > > This is very exciting work, thanks everyone! > > "Antoine R. Dumont (@ardumont)" writes: > >> FWIW, in the "new" lister [1] implementation, there are a bunch of extra >> computations done [1] to try and resolve those situations. It's trying >> to fetch more information from upstream server (e.g. crates urls which >> ends in /download, ...) now. It's probably not exhaustive though. >> >> [1] https://gitlab.softwareheritage.org/swh/devel/swh-lister/-/blob/mast= er/swh/lister/nixguix/lister.py?ref_type=3Dheads > > I was just looking over some of the new results and noticed that crates > are being treated as =E2=80=98content=E2=80=99 rather than =E2=80=98tarba= ll-directory=E2=80=99. E.g.: > > https://webapp.staging.swh.network/browse/content/sha1_git:e05b33b2d3b402= 54ceaaa5fe4c501d1b15c75ea6/?origin_url=3Dhttps://crates.io/api/v1/crates/di= ff/0.1.12/download > > Is that because the changes you describe were done after the staging > data was loaded or is it a bug? > > > -- Tim --=-=-= Content-Type: application/pgp-signature; name="signature.asc" -----BEGIN PGP SIGNATURE----- iQJSBAEBCgA8FiEEvwAgPXQaydVGqL4HUuLphA0Qw7gFAmWqpvQeHGFyZHVtb250 QHNvZnR3YXJlaGVyaXRhZ2Uub3JnAAoJEFLi6YQNEMO4pZ0QALm9tiMlnrYF3oA2 gsiVY8dTGbqYrPWLiE+vVhk9CnNI1sbxmMGgatA8id7pXhyPyHQ6NxPx9oiX9ZWz HC2Q8aXdZbxDQgb2jfIbVSeckbHKR/XAZMfVSASE88otegfc60qS/gPCxxUOWlEN hUAoRgHfkKyeA1Y5IZZuNFtGtbFP8rXAPyxm3hKSZLWYQnBCEG6VaKwYHwDFu2R0 QxIfqOC/CM/cJU2SqItTMR9yLxQuKkGrOcxjeL0iX5rKulG5WFLRNnss+QmJLJ8g JuApHJLOlZF3Sy4xNHHLzsLwu9P1PmWCXHQmR05fZCOjDRjdKuzzyPQ/EAiWE1dD OjOEFx14l6LbTyjjqCwDNWEGHIkeAIkKWqVLQsf4iwtMeHPMJMhR2I3F23T5V8sY ZmPcV5RM7/JiyPPV9eCqKw1SYk/5CLTIwXIoCwVz15p0+isP0AJHBzEUHnvhsxD2 GvucSS8+NNPnYdNAxC8dVQfz9HAO/0BV0qQI8owRct81IbTYuCvTJ4cSK/xxjslP GKpC9w7IdNoz1edtuLNSf0nRSZlEpM8yOskCGw6c3Z0OwoNxUJi5KGuBYxalmWA+ rhMAjghDE41j9f/mSF3fjoUe3z2xkY/q+nCpNa0AjBruqUWqWzjfz4vMR1pTWrwB p1mZTVyLi8bKiglgrG6+H+9XDIv1 =J78c -----END PGP SIGNATURE----- --=-=-=--