From mboxrd@z Thu Jan 1 00:00:00 1970 From: bill-auger Subject: Re: Internet Archive APIs useful as fallback? Date: Wed, 19 Dec 2018 22:00:14 -0500 Message-ID: <20181219220014.6e520026@parabola> References: <161edf75-57b7-4d5b-8acf-a4c61358add1@riseup.net> <87d0pxwon3.fsf@gnu.org> Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable Return-path: Received: from eggs.gnu.org ([2001:4830:134:3::10]:56004) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1gZoZn-0001xQ-Mx for guix-devel@gnu.org; Wed, 19 Dec 2018 22:00:44 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1gZoZi-0004Aw-OD for guix-devel@gnu.org; Wed, 19 Dec 2018 22:00:43 -0500 Received: from palegreen.birch.relay.mailchannels.net ([23.83.209.140]:24157) by eggs.gnu.org with esmtps (TLS1.0:DHE_RSA_AES_256_CBC_SHA1:32) (Exim 4.71) (envelope-from ) id 1gZoZi-00049k-Ci for guix-devel@gnu.org; Wed, 19 Dec 2018 22:00:38 -0500 Received: from relay.mailchannels.net (localhost [127.0.0.1]) by relay.mailchannels.net (Postfix) with ESMTP id 6EABF5C46F3 for ; Thu, 20 Dec 2018 03:00:31 +0000 (UTC) Received: from pdx1-sub0-mail-a3.g.dreamhost.com (unknown [100.96.11.179]) (Authenticated sender: dreamhost) by relay.mailchannels.net (Postfix) with ESMTPA id E6C0A5C4347 for ; Thu, 20 Dec 2018 03:00:30 +0000 (UTC) Received: from pdx1-sub0-mail-a3.g.dreamhost.com (localhost [127.0.0.1]) by pdx1-sub0-mail-a3.g.dreamhost.com (Postfix) with ESMTP id A44AC80945 for ; Wed, 19 Dec 2018 19:00:30 -0800 (PST) Received: from parabola (75-138-186-142.dhcp.oxfr.ma.charter.com [75.138.186.142]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) (Authenticated sender: bill-auger@peers.community) by pdx1-sub0-mail-a3.g.dreamhost.com (Postfix) with ESMTPSA id 523B680940 for ; Wed, 19 Dec 2018 19:00:29 -0800 (PST) In-Reply-To: <87d0pxwon3.fsf@gnu.org> List-Id: "Development of GNU Guix and the GNU System distribution." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: guix-devel-bounces+gcggd-guix-devel=m.gmane.org@gnu.org Sender: "Guix-devel" To: guix-devel@gnu.org On Wed, 19 Dec 2018 15:57:04 +0100 Ludovic Court=C3=A8s wrote: > The Internet Archive is not in the business of archiving software, but > it=E2=80=99d be interesting to see if it archives tarballs that people pu= t on > =E2=80=9Crandom=E2=80=9D web sites FWIW, The Internet Archive is not *in the business* of *anything* - it is a charity - but more importantly, this post is referring to the "Wayback Machine"; which is not identical to "The Internet Archive", but is a very specialized subset of the archive.org service - the Wayback Machine differs from the main Internet Archive both in function and scope namely, the Wayback Machine only caches web pages, it does so in a semi-automated fashion, carries no metadata, and does not download any external assets (such as images and tarballs or any external HTML in frames) unless they are defined explicitly in the HTML of the base web page (such as data-uri images and other blobs) the larger Internet Archive is generally useful for anything that is naturally suitable for archival and it has a specific section/tags for software; but that is entirely a manual process done by a registered user; and all items are associated with that registered account