Ludovic Courtès writes: > Pjotr Prins skribis: > >> On Mon, Jan 23, 2017 at 07:17:12PM +0100, Marius Bakke wrote: >>> ng0 writes: >>> >>> > I want to slowly package OONI (https://ooni.torproject.org/). >>> > One of its dependencies, txtorcon, requires python-geoip which >>> > depends on geoip-c-api. I've got both covered, but both of them >>> > want (either to download or to be present) for tests (a/the) >>> > legacy database file of maxmind. >>> > >>> > Question 1: Can we distribute the database in a source? I can't >>> > access the homepage of maxmind for cloudflare reasons. >>> >>> The database is distributed freely under cc-by-sa4.0: >>> >>> https://dev.maxmind.com/geoip/legacy/geolite/#License >>> >>> So packaging it should be fine. :) >> >> This actually raises the issue of packaging large data files (we are >> getting into TB's). Could there be a way Guix fetches external >> datasets as part of the distribution? I think that if it is not >> executable code and SHA values/pfff values match it would be safe to >> do. > > If we do add something this big, we’ll have to make sure it’s not > substitutable and does not ever land on the build machines. Just to note, I don't think Pjotr was referring to this particular package (geolite database). I just checked, and the City DB is 19.8MiB compressed and 104.1 MiB uncompressed. The Country DB is ~2MiB and the IPv6 databases are tiny.