From mboxrd@z Thu Jan 1 00:00:00 1970 From: swedebugia Subject: Re: Internet Archive APIs useful as fallback? Date: Thu, 20 Dec 2018 05:42:17 +0100 Message-ID: <07da678d-5a4e-1b8b-38c5-b286ed284a8f@riseup.net> References: <161edf75-57b7-4d5b-8acf-a4c61358add1@riseup.net> <87d0pxwon3.fsf@gnu.org> <20181219220014.6e520026@parabola> Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 7bit Return-path: Received: from eggs.gnu.org ([2001:4830:134:3::10]:48290) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1gZq3u-0002ku-Q7 for guix-devel@gnu.org; Wed, 19 Dec 2018 23:35:55 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1gZq3q-0001mW-QZ for guix-devel@gnu.org; Wed, 19 Dec 2018 23:35:54 -0500 Received: from mx1.riseup.net ([198.252.153.129]:37801) by eggs.gnu.org with esmtps (TLS1.0:DHE_RSA_AES_256_CBC_SHA1:32) (Exim 4.71) (envelope-from ) id 1gZq3q-0001lR-JG for guix-devel@gnu.org; Wed, 19 Dec 2018 23:35:50 -0500 In-Reply-To: <20181219220014.6e520026@parabola> Content-Language: en-US List-Id: "Development of GNU Guix and the GNU System distribution." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: guix-devel-bounces+gcggd-guix-devel=m.gmane.org@gnu.org Sender: "Guix-devel" To: bill-auger , guix-devel@gnu.org On 2018-12-20 04:00, bill-auger wrote: snip > > namely, the Wayback Machine only caches web pages, it does so in a > semi-automated fashion, carries no metadata, and does not download any > external assets (such as images and tarballs or any external HTML in > frames) unless they are defined explicitly in the HTML of the base web > page (such as data-uri images and other blobs) This is no longer the case. I found that it most of the times archives blobs too. As an aside, if we need it (and we do/did in the case of bzip2) we could consider helping them raise funds (they have their yearly fundraiser going now) -- Cheers Swedebugia