From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mp10.migadu.com ([2001:41d0:8:6d80::]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits)) by ms9.migadu.com with LMTPS id aALlKNcJK2R4XAAASxT56A (envelope-from ) for ; Mon, 03 Apr 2023 19:16:07 +0200 Received: from aspmx1.migadu.com ([2001:41d0:8:6d80::]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits)) by mp10.migadu.com with LMTPS id OHzYJ9cJK2T/gAEAG6o9tA (envelope-from ) for ; Mon, 03 Apr 2023 19:16:07 +0200 Received: from lists.gnu.org (lists.gnu.org [209.51.188.17]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by aspmx1.migadu.com (Postfix) with ESMTPS id 644FE87BD for ; Mon, 3 Apr 2023 19:16:07 +0200 (CEST) Received: from localhost ([::1] helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1pjNM8-00048N-0I; Mon, 03 Apr 2023 12:48:32 -0400 Received: from eggs.gnu.org ([2001:470:142:3::10]) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1pjNM4-000488-Dj for guix-devel@gnu.org; Mon, 03 Apr 2023 12:48:29 -0400 Received: from 4.mo581.mail-out.ovh.net ([178.32.122.254]) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1pjNM2-0004XU-Hw for guix-devel@gnu.org; Mon, 03 Apr 2023 12:48:28 -0400 Received: from director1.ghost.mail-out.ovh.net (unknown [10.108.4.183]) by mo581.mail-out.ovh.net (Postfix) with ESMTP id A40C52337C for ; Mon, 3 Apr 2023 16:48:13 +0000 (UTC) Received: from ghost-submission-6684bf9d7b-qmdk2 (unknown [10.111.208.77]) by director1.ghost.mail-out.ovh.net (Postfix) with ESMTPS id 565C21FD3E for ; Mon, 3 Apr 2023 16:48:13 +0000 (UTC) Received: from ngraves.fr ([37.59.142.101]) by ghost-submission-6684bf9d7b-qmdk2 with ESMTPSA id EKdVB00DK2RnygsAUCNrCg (envelope-from ) for ; Mon, 03 Apr 2023 16:48:13 +0000 X-OVh-ClientIp: 81.67.140.142 To: guix-devel@gnu.org Subject: Where should we put machine learning model parameters ? Date: Mon, 03 Apr 2023 18:48:12 +0200 Message-ID: <87jzyshpyr.fsf@ngraves.fr> MIME-Version: 1.0 Content-Type: text/plain X-Ovh-Tracer-Id: 237846355680748085 X-VR-SPAMSTATE: OK X-VR-SPAMSCORE: 0 X-VR-SPAMCAUSE: gggruggvucftvghtrhhoucdtuddrgedvhedrvdeijedguddtiecutefuodetggdotefrodftvfcurfhrohhfihhlvgemucfqggfjpdevjffgvefmvefgnecuuegrihhlohhuthemucehtddtnecunecujfgurhephffvufffkfggtgesthdtredttddttdenucfhrhhomheppfhitgholhgrshcuifhrrghvvghsuceonhhgrhgrvhgvshesnhhgrhgrvhgvshdrfhhrqeenucggtffrrghtthgvrhhnpedvjefgfeejgfejudeggfeljeeludffffdvjeettddukedtueejgeegieeigefgtdenucffohhmrghinheprghlphhhrggtvghphhgvihdrtghomhdpshhrrdhhthenucfkphepuddvjedrtddrtddruddpkedurdeijedrudegtddrudegvddpfeejrdehledrudegvddruddtudenucevlhhushhtvghrufhiiigvpedtnecurfgrrhgrmhepihhnvghtpeduvdejrddtrddtrddupdhmrghilhhfrhhomhepoehnghhrrghvvghssehnghhrrghvvghsrdhfrheqpdhnsggprhgtphhtthhopedupdhrtghpthhtohepghhuihigqdguvghvvghlsehgnhhurdhorhhgpdfovfetjfhoshhtpehmohehkedupdhmohguvgepshhmthhpohhuth Received-SPF: pass client-ip=178.32.122.254; envelope-from=ngraves@ngraves.fr; helo=4.mo581.mail-out.ovh.net X-Spam_score_int: -18 X-Spam_score: -1.9 X-Spam_bar: - X-Spam_report: (-1.9 / 5.0 requ) BAYES_00=-1.9, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H2=-0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001 autolearn=ham autolearn_force=no X-Spam_action: no action X-BeenThere: guix-devel@gnu.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: "Development of GNU Guix and the GNU System distribution." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Reply-to: Nicolas Graves From: Nicolas Graves via "Development of GNU Guix and the GNU System distribution." Errors-To: guix-devel-bounces+larch=yhetil.org@gnu.org Sender: guix-devel-bounces+larch=yhetil.org@gnu.org X-Migadu-Country: US X-Migadu-Flow: FLOW_IN ARC-Seal: i=1; s=key1; d=yhetil.org; t=1680542167; a=rsa-sha256; cv=none; b=T6xOmerQC2ibS6m09p2L/6IWDi3qQXs/tSZvx1lUkUI0rtlEnPxSqBhkek02DeEiCIPiAm uGVd+Z2pDC39IkjKMSkV419oCWxAda8se4Cntj/jC4Wjzi/fTjN2Th4mtg0O+bajtOL8OC R+EEkwO6XkLpiFM+EY4ZQDcv29SfmbcQVXFY81ZkTKVsHdHbYrWtw+eQzEMGd7CPHH/hjn uI1vv8ScQHkCtoZLkh7KiuaVuL8V3/4Yc7WpbWJf7anfQo6QJSUjicf6OVX4vviqCoLhwd BuJL8b3qfjryLwRaXNhWVirfB8FRpxwHLWyfjZnnyXx1d544uMg2mGsARODgGw== ARC-Authentication-Results: i=1; aspmx1.migadu.com; dkim=none; dmarc=pass (policy=none) header.from=gnu.org; spf=pass (aspmx1.migadu.com: domain of "guix-devel-bounces+larch=yhetil.org@gnu.org" designates 209.51.188.17 as permitted sender) smtp.mailfrom="guix-devel-bounces+larch=yhetil.org@gnu.org" ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=yhetil.org; s=key1; t=1680542167; h=from:from:sender:sender:reply-to:reply-to:subject:subject:date:date: message-id:message-id:to:to:cc:mime-version:mime-version: content-type:content-type:list-id:list-help:list-unsubscribe: list-subscribe:list-post; bh=4/WtZqklBViR6hig81jNwEOgjIBKLXq2MXtogtnV40k=; b=s2a9iTOce8lCbJ5Rg0xFPCM7vwmcQJIu9LsvCLFLBDP3pgAsUHMob97IHzBnurQ2J2OkbA A0JEuO+5IgDMExadtjqD0IZBmrn42x6ojDplzDZKmgo1VphKMp7ZRf6iveL7mpYV4SvhSb BZrmAtlLUDiWVnrplswN+gKTagvH6Y10vndYxWGRTC8rd0ZYyg/AO5hlKOo/uScr/ItbUz 1iesCeTNcLsCaz9n46nUP9YLAQWCwZrqOZYBKQ+JvrvZk6xGImReefN+4ihpQ5WT7Ywddq QthD7HqNLdglsz4syQstnCU+XsYCkK3LMxPPbzLG/iLtLM9HB2mucrle5Rmq3w== Authentication-Results: aspmx1.migadu.com; dkim=none; dmarc=pass (policy=none) header.from=gnu.org; spf=pass (aspmx1.migadu.com: domain of "guix-devel-bounces+larch=yhetil.org@gnu.org" designates 209.51.188.17 as permitted sender) smtp.mailfrom="guix-devel-bounces+larch=yhetil.org@gnu.org" X-Migadu-Scanner: scn0.migadu.com X-Migadu-Spam-Score: 2.98 X-Spam-Score: 2.98 X-Migadu-Queue-Id: 644FE87BD X-TUID: 0G8owUSVK1hN Hi Guix! I've recently contributed a few tools that make a few OSS machine learning programs usable for Guix, namely nerd-dictation for dictation and llama-cpp as a converstional bot. In the first case, I would also like to contribute parameters of some localized models so that they can be used more easily through Guix. I've already discussed this subject when submitting these patches, without a clear answer. In the case of nerd-dictation, the model parameters that can be used are listed here : https://alphacephei.com/vosk/models One caveat is that using all these models can take a lot of space on the servers, a burden which is not useful because no build step are really needed (except an unzip step). In this case, we can use the #:substitutable? #f flag. You can find an example of some of these packages right here : https://git.sr.ht/~ngraves/dotfiles/tree/main/item/packages.scm So my question is: Should we add this type of models in packages for Guix? If yes, where should we put them? In machine-learning.scm? In a new file machine-learning-models.scm (such a file would never need new modules, and it might avoid some confusion between the tools and the parameters needed to use the tools)? -- Best regards, Nicolas Graves