From mboxrd@z Thu Jan 1 00:00:00 1970 From: Pjotr Prins Subject: Re: GeoIP database redistribution? Date: Tue, 24 Jan 2017 06:06:21 +0000 Message-ID: <20170124060621.GA30899@mail.thebird.nl> References: <8760l7t5ax.fsf@wasp.i-did-not-set--mail-host-address--so-tickle-me> <87k29l63pz.fsf@kirby.i-did-not-set--mail-host-address--so-tickle-me> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Return-path: Received: from eggs.gnu.org ([2001:4830:134:3::10]:51814) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1cVuIG-0004C5-7Y for guix-devel@gnu.org; Tue, 24 Jan 2017 01:09:25 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1cVuIC-00054K-Ak for guix-devel@gnu.org; Tue, 24 Jan 2017 01:09:24 -0500 Received: from mail.thebird.nl ([95.154.246.10]:34396) by eggs.gnu.org with esmtps (TLS1.0:DHE_RSA_AES_256_CBC_SHA1:32) (Exim 4.71) (envelope-from ) id 1cVuIC-000542-4k for guix-devel@gnu.org; Tue, 24 Jan 2017 01:09:20 -0500 Content-Disposition: inline In-Reply-To: <87k29l63pz.fsf@kirby.i-did-not-set--mail-host-address--so-tickle-me> List-Id: "Development of GNU Guix and the GNU System distribution." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: guix-devel-bounces+gcggd-guix-devel=m.gmane.org@gnu.org Sender: "Guix-devel" To: Marius Bakke Cc: guix-devel@gnu.org On Mon, Jan 23, 2017 at 07:17:12PM +0100, Marius Bakke wrote: > ng0 writes: > > > I want to slowly package OONI (https://ooni.torproject.org/). > > One of its dependencies, txtorcon, requires python-geoip which > > depends on geoip-c-api. I've got both covered, but both of them > > want (either to download or to be present) for tests (a/the) > > legacy database file of maxmind. > > > > Question 1: Can we distribute the database in a source? I can't > > access the homepage of maxmind for cloudflare reasons. > > The database is distributed freely under cc-by-sa4.0: > > https://dev.maxmind.com/geoip/legacy/geolite/#License > > So packaging it should be fine. :) This actually raises the issue of packaging large data files (we are getting into TB's). Could there be a way Guix fetches external datasets as part of the distribution? I think that if it is not executable code and SHA values/pfff values match it would be safe to do. Idea? That is a first step towards reproducible analysis. Pj. --