From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mp11.migadu.com ([2001:41d0:2:bcc0::]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits)) by ms9.migadu.com with LMTPS id GFI4F3iGLmTijAAASxT56A (envelope-from ) for ; Thu, 06 Apr 2023 10:44:40 +0200 Received: from aspmx1.migadu.com ([2001:41d0:2:bcc0::]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits)) by mp11.migadu.com with LMTPS id 0HYfF3iGLmTpbQEA9RJhRA (envelope-from ) for ; Thu, 06 Apr 2023 10:44:40 +0200 Received: from lists.gnu.org (lists.gnu.org [209.51.188.17]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by aspmx1.migadu.com (Postfix) with ESMTPS id F1102D3B1 for ; Thu, 6 Apr 2023 10:44:39 +0200 (CEST) Received: from localhost ([::1] helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1pkLE3-0007Ga-5C; Thu, 06 Apr 2023 04:44:11 -0400 Received: from eggs.gnu.org ([2001:470:142:3::10]) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1pkLE1-0007GD-Ko for guix-devel@gnu.org; Thu, 06 Apr 2023 04:44:09 -0400 Received: from mail-wm1-x335.google.com ([2a00:1450:4864:20::335]) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.90_1) (envelope-from ) id 1pkLDz-0005t6-8a for guix-devel@gnu.org; Thu, 06 Apr 2023 04:44:09 -0400 Received: by mail-wm1-x335.google.com with SMTP id 5b1f17b1804b1-3edc2294fb4so11465355e9.0 for ; Thu, 06 Apr 2023 01:44:06 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20210112; t=1680770645; h=content-transfer-encoding:mime-version:message-id:date:references :in-reply-to:subject:cc:to:from:from:to:cc:subject:date:message-id :reply-to; bh=AQbjjXmmyoi7IcZrRvcIa8Zf8AbhbIZCNNGAwQU7gqU=; b=kQYwUEXZhsnySeswv5SuyC+XgSprqwVRwWmWNae1vJJH5bnw0Nus33MgPLFlnd1nHA SBogCWmd1kd+Ql2p/Ore3VR01bxMohYvijWasU3npd+QLQYEhWmEFfWdua1l8yRfVQAB FT9ovgSnm245AA0zRQxY4bThU8PXKHC47bSay9HfFpUCjGa0CtyLF4J9rS8qZ2wWXlNI v05N6QEl7f9zloxGrrQBW/J0rLgxSM0xnuekLVMEJwChfONI5DFQVhdzgY33O+1fDspW 2PsJGuhd38fJEeuLYSMwtlbDHqeXaMaKxMCFb2NRFGvbLHPb3fA8szif6hPcz32l1uKJ +33w== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; t=1680770645; h=content-transfer-encoding:mime-version:message-id:date:references :in-reply-to:subject:cc:to:from:x-gm-message-state:from:to:cc :subject:date:message-id:reply-to; bh=AQbjjXmmyoi7IcZrRvcIa8Zf8AbhbIZCNNGAwQU7gqU=; b=Wuos0cln+wXmGP0fJT1doxzJv0MIo5xSeL+/YrpGFBf+y7V8ZtBqIGLnWI+P1xw9Rg uJbXNPDOb9+1Mn8aAsVPwrWn/Kf2BXF2NLNKK0vBxU+2MqXEe1NsL8WMRzchHRu1JLak 3CoXJ5q18mqUstDSUPMFm6q1euGE2ZRikgssDKMihGvKpibc9bkLFm0+lp1aEcW4/WcO J6iJ2u8zMoZd+OWvwSBs7V0ZrRy1nNHH/Vd+NtSYbVOFFMCAMvdZ+FvD2oC496aAUIeu uoHjNxiykjLjKjZ6HLP7CKgg7kdrUJmTrGaKI5ntT9dkRMQSVCfLFeblm8POCDhFY49E uZRQ== X-Gm-Message-State: AAQBX9dWwENq9bKF34j3L58ltvXCKMFwO/6xRJRZMHk6SBR47JInxQa+ IRCAcRkR6oFBhsogOTXQaK3soWqTQNw= X-Google-Smtp-Source: AKy350apvPXAt88AHvOHYNuurApMCIUID6s57fP3TSq+Pj5c++dZnUr22q2qg412juXP0TneNqmeig== X-Received: by 2002:a05:600c:3b16:b0:3ee:1c42:6582 with SMTP id m22-20020a05600c3b1600b003ee1c426582mr4443514wms.3.1680770645157; Thu, 06 Apr 2023 01:44:05 -0700 (PDT) Received: from lili ([2a01:e0a:59b:9120:65d2:2476:f637:db1e]) by smtp.gmail.com with ESMTPSA id m30-20020a05600c3b1e00b003ef5deb4188sm4678827wms.17.2023.04.06.01.44.04 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Thu, 06 Apr 2023 01:44:04 -0700 (PDT) From: Simon Tournier To: Ryan Prior , Nicolas Graves , "licensing@fsf.org" Cc: guix-devel@gnu.org Subject: Re: Guidelines for pre-trained ML model weight binaries (Was re: Where should we put machine learning model parameters?) In-Reply-To: References: Date: Thu, 06 Apr 2023 10:42:00 +0200 Message-ID: <868rf5e71j.fsf@gmail.com> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable Received-SPF: pass client-ip=2a00:1450:4864:20::335; envelope-from=zimon.toutoune@gmail.com; helo=mail-wm1-x335.google.com X-Spam_score_int: -20 X-Spam_score: -2.1 X-Spam_bar: -- X-Spam_report: (-2.1 / 5.0 requ) BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, FREEMAIL_FROM=0.001, RCVD_IN_DNSWL_NONE=-0.0001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001 autolearn=ham autolearn_force=no X-Spam_action: no action X-BeenThere: guix-devel@gnu.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: "Development of GNU Guix and the GNU System distribution." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: guix-devel-bounces+larch=yhetil.org@gnu.org Sender: guix-devel-bounces+larch=yhetil.org@gnu.org X-Migadu-Country: US X-Migadu-Flow: FLOW_IN ARC-Seal: i=1; s=key1; d=yhetil.org; t=1680770680; a=rsa-sha256; cv=none; b=gY3Z1533iFTxhv27xPA6p6cn3GyHsWCD9T0ofZBTZS54whlM9xlLDRAwPyWeoVJ3yQHHY0 HPtJNfY2yty2N6HYmDxuCstHcjepGkvHtsgElOCHJoqqWe8PSuJsW9WGKftsoPFHbK86OL dvPZPaAbpeGo/WTPbW0bUAqd2vI4jrZNYDjjRhbGDxh+fCENAaXrz1Px/XC3BdpWMB/Ftw Xdy5+fost4N0xl1dotG9wa6ghwtaGa8WnnFUG4g6dweAQvdGjf37ZlzsOg+EqjwklN1lnM n5go8fLnQ7yEI+UZflyx9HlqnzMsCBfyE/1JVgSbUjVK+luUMlCisj7JEurHPg== ARC-Authentication-Results: i=1; aspmx1.migadu.com; dkim=pass header.d=gmail.com header.s=20210112 header.b=kQYwUEXZ; dmarc=pass (policy=none) header.from=gmail.com; spf=pass (aspmx1.migadu.com: domain of "guix-devel-bounces+larch=yhetil.org@gnu.org" designates 209.51.188.17 as permitted sender) smtp.mailfrom="guix-devel-bounces+larch=yhetil.org@gnu.org" ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=yhetil.org; s=key1; t=1680770680; h=from:from:sender:sender:reply-to:subject:subject:date:date: message-id:message-id:to:to:cc:cc:mime-version:mime-version: content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references:list-id:list-help: list-unsubscribe:list-subscribe:list-post:dkim-signature; bh=AQbjjXmmyoi7IcZrRvcIa8Zf8AbhbIZCNNGAwQU7gqU=; b=in+ucXTTftq5IuiyrI0werIHf0Qo15g1mBcJ/+fVuBQ9hinvtAF3MQ9h4hcmQ0W3qmp6X8 PbBRucJQXFEexyCNEotxxiM2BDT4+uAV9Rk0EBiY6cv3y+c9x21yc/TUHyt0X8WA/xv9EP mWCj0Jy6X23gwh4/O0eBY0GOv0M9VkFY7D9v0xhOsSCJOmF+jU30Qas70NMCnrJItqvt1e qv/dSGcQ0DbnShHL1nnMc4qaXD9dsFxyZ7axqTbQQAxoqU8P971fOY0ZGWiJRCdr5LNbDu tMzu/r6zGERYeenPMUkf0Noh347PMZ3G+Xiirwo/ZxGGnacddc11O+2yJB5O5A== X-Migadu-Spam-Score: 0.55 X-Migadu-Scanner: scn1.migadu.com Authentication-Results: aspmx1.migadu.com; dkim=pass header.d=gmail.com header.s=20210112 header.b=kQYwUEXZ; dmarc=pass (policy=none) header.from=gmail.com; spf=pass (aspmx1.migadu.com: domain of "guix-devel-bounces+larch=yhetil.org@gnu.org" designates 209.51.188.17 as permitted sender) smtp.mailfrom="guix-devel-bounces+larch=yhetil.org@gnu.org" X-Spam-Score: 0.55 X-Migadu-Queue-Id: F1102D3B1 X-TUID: KcNEzmEXhoaw Hi, On Mon, 03 Apr 2023 at 18:07, Ryan Prior wrote: > Hi there FSF Licensing! (CC: Guix devel, Nicholas Graves) This morning > I read through the FSDG to see if it gives any guidance on when > machine learning model weights are appropriate for inclusion in a free > system. It does not seem to offer much.=20 Years ago, I asked to FSF and Stallman how to deal with that and I had never got an answer back. Anyway! :-) Debian folks discussed such topic [1,2] but I do not know if they have an =E2=80=9Cofficial=E2=80=9D policy. I remember we discussed on guix-devel or guix-patches similar topic some years ago =E2=80=93 but I do not find back the thread. For what my opinion is worth, I think that machine learning model weights should be considered as any other data (images, text files, translated strings, etc.) and thus they are appropriated for inclusion or not depending on if their license is compliant. Since it is computing, we could ask about the bootstrap of such generated data. I think it is a slippery slope because it is totally not affordable to re-train for many cases: (1) we would not have the hardware resources from a practical point of view,, (2) it is almost impossible to tackle the source of indeterminism (the optimization is too entailed with randomness). From my point of view, pre-trained weights should be considered as the output of a (numerical) experiment, similarly as we include other experimental data (from genome to astronomy dataset). 1: https://salsa.debian.org/deeplearning-team/ml-policy 2: https://people.debian.org/~lumin/debian-dl.html Cheers, simon