From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mp2.migadu.com ([2001:41d0:403:4876::]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits)) by ms8.migadu.com with LMTPS id UOI3MeBDAWZKwAAAe85BDQ:P1 (envelope-from ) for ; Mon, 25 Mar 2024 10:29:05 +0100 Received: from aspmx1.migadu.com ([2001:41d0:403:4876::]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits)) by mp2.migadu.com with LMTPS id UOI3MeBDAWZKwAAAe85BDQ (envelope-from ) for ; Mon, 25 Mar 2024 10:29:04 +0100 X-Envelope-To: larch@yhetil.org Authentication-Results: aspmx1.migadu.com; dkim=pass header.d=elephly.net header.s=zoho header.b=Jk7pLTtu; spf=pass (aspmx1.migadu.com: domain of "gwl-devel-bounces+larch=yhetil.org@gnu.org" designates 209.51.188.17 as permitted sender) smtp.mailfrom="gwl-devel-bounces+larch=yhetil.org@gnu.org"; dmarc=none; arc=pass ("zohomail.com:s=zohoarc:i=1") ARC-Message-Signature: i=2; a=rsa-sha256; c=relaxed/relaxed; d=yhetil.org; s=key1; t=1711358944; h=from:from:sender:sender:reply-to:subject:subject:date:date: message-id:message-id:to:to:cc:cc:mime-version:mime-version: content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references:list-id:list-help: list-unsubscribe:list-subscribe:list-post:dkim-signature; bh=Seo7px5CfrtROgXHCjstTTWA0D0eZ3FkdxVc7StS0JY=; b=H7IlLvMW1Li18Rn6noDd+KGaUcFmDFengtxN1hNboKbINw3Xctnks08QDde1IjvuVegL3L yVwivTYGDxKNGulQSgBeWF0wEHHSm+53BVfkzN8OPEdUBlj9otj+dt/jKFsCeKkI9p7EIn xtsPjtm3LLIwqsqgYe/7aPCXXj/D4gAT4H/bo1TPIA6097vGnS5S5QkAWtBaBVUCFYs6Tf i8wVpzFnArdLHNvJrSUic1N8AyBzT8Fh8KYJcNwFl758O1CZQOCSM3vU1AqaBsnDl7JbBq fOLvcnIZEtxBP967wuYVcLISQiyfYPf4nc6wg569KGZ83tZu9D0P6W1a6mE3DQ== ARC-Authentication-Results: i=2; aspmx1.migadu.com; dkim=pass header.d=elephly.net header.s=zoho header.b=Jk7pLTtu; spf=pass (aspmx1.migadu.com: domain of "gwl-devel-bounces+larch=yhetil.org@gnu.org" designates 209.51.188.17 as permitted sender) smtp.mailfrom="gwl-devel-bounces+larch=yhetil.org@gnu.org"; dmarc=none; arc=pass ("zohomail.com:s=zohoarc:i=1") ARC-Seal: i=2; s=key1; d=yhetil.org; t=1711358944; a=rsa-sha256; cv=pass; b=gnixM7/851obQffq2xe92d5o5PtcF4m7S6ggt99I9LxYirC2CCedKgmaaYG4gb9b5rvwbi jaW+0OuWHRieV3T7hEwaAFhnIqJzw1thHz6lONoLqxwvMZhRvzC0Ywj4Daa0dFzQQDPdbA QXznIWUvsN8I43kh+MZbeY+BbXNXXwiK0V14N44DFBRhOWhcZM/ANNo9TMPpbyBSGsEMt3 ZcfrCVYEYMGXssCoGnHYpQGCZcurwelBZcu9FVS4KJC4Q+ps3zWI7UI0VcoOj9WzBN+s06 TTzYZrw3u7kQPgzpC+RsHnOdF5CNQCfliEjv9V1KB92yfGmI4Ei/1eujS/iqYA== Received: from lists.gnu.org (lists.gnu.org [209.51.188.17]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by aspmx1.migadu.com (Postfix) with ESMTPS id 279276FA02 for ; Mon, 25 Mar 2024 10:29:02 +0100 (CET) Received: from localhost ([::1] helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1rogdT-0005FR-KY; Mon, 25 Mar 2024 05:28:55 -0400 Received: from eggs.gnu.org ([2001:470:142:3::10]) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1rogdR-0005FC-Ux for gwl-devel@gnu.org; Mon, 25 Mar 2024 05:28:54 -0400 Received: from sender4-of-o51.zoho.com ([136.143.188.51]) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1rogdP-0004tV-OO for gwl-devel@gnu.org; Mon, 25 Mar 2024 05:28:53 -0400 ARC-Seal: i=1; a=rsa-sha256; t=1711358929; cv=none; d=zohomail.com; s=zohoarc; b=dp9vRu2Me0/gKnH3XWfBU2SnH6sGyW8Fu4Evy0OXgN28X7mCzvGd/WWPdSotx9hcwqHEvt0hxxwaZN3G+YcYn0169DctP9+LpJqK4IcBRp6M9/+ErMCRmxvHW3rHorJIywMh2HkHZjQk9BWsFgRXWmXTcxESu8+k7z9r3GgWx0k= ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=zohomail.com; s=zohoarc; t=1711358929; h=Content-Type:Content-Transfer-Encoding:Cc:Cc:Date:Date:From:From:In-Reply-To:MIME-Version:Message-ID:References:Subject:Subject:To:To:Message-Id:Reply-To; bh=Seo7px5CfrtROgXHCjstTTWA0D0eZ3FkdxVc7StS0JY=; b=Vn4FmayaaWZ55Vw7Z1db3d9NLQXdE5deYyAOs1jmeUrPkE9U0enjQPQUT4xyNqDFOqJibJxWL57i6MyS4ozklkdlnC+8zseEaQLV2TH3MFP/SgkAfmVSAjWQDBz4PV7ODN97hKXaWmHwGBDti+O5yXZgalG4nFlM4eE1d9gZDcU= ARC-Authentication-Results: i=1; mx.zohomail.com; dkim=pass header.i=elephly.net; spf=pass smtp.mailfrom=rekado@elephly.net; dmarc=pass header.from= DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; t=1711358929; s=zoho; d=elephly.net; i=rekado@elephly.net; h=References:From:From:To:To:Cc:Cc:Subject:Subject:Date:Date:In-reply-to:Message-ID:MIME-Version:Content-Type:Content-Transfer-Encoding:Message-Id:Reply-To; bh=Seo7px5CfrtROgXHCjstTTWA0D0eZ3FkdxVc7StS0JY=; b=Jk7pLTtuZVpg+tYA+sKG8ZeTj2mMZsKFPTdRBf5gVvB7n2dklI7bwXO7Z9eKlTjF u8QwsdUyXEV658DZYtwpGon7/FuO+QfhcPt7MTXSYVvdyVipHSQ5UppVNnbXkCzyvaX rQzpamgOEdnowk4azKdE5W1YdgkCm2pHCRwGICos= Received: from localhost (196-110-142-46.pool.kielnet.net [46.142.110.196]) by mx.zohomail.com with SMTPS id 1711358926556958.0310652185286; Mon, 25 Mar 2024 02:28:46 -0700 (PDT) References: <2010bdb88116d64da3650b06e58979518b2c7277.camel@ist.tugraz.at> <87plvjd4el.fsf@elephly.net> <54f697191220794a99dde447f2f2ce56439d8408.camel@ist.tugraz.at> User-agent: mu4e 1.10.8; emacs 29.1 From: Ricardo Wurmus To: Liliana Marie Prikler Cc: gwl-devel@gnu.org Subject: Re: Processing large amounts of files Date: Mon, 25 Mar 2024 10:25:22 +0100 In-reply-to: <54f697191220794a99dde447f2f2ce56439d8408.camel@ist.tugraz.at> Message-ID: <87h6gud5is.fsf@elephly.net> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-ZohoMailClient: External Received-SPF: pass client-ip=136.143.188.51; envelope-from=rekado@elephly.net; helo=sender4-of-o51.zoho.com X-Spam_score_int: -20 X-Spam_score: -2.1 X-Spam_bar: -- X-Spam_report: (-2.1 / 5.0 requ) BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H4=-0.01, RCVD_IN_MSPIKE_WL=-0.01, SPF_HELO_NONE=0.001, SPF_PASS=-0.001 autolearn=ham autolearn_force=no X-Spam_action: no action X-BeenThere: gwl-devel@gnu.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: gwl-devel-bounces+larch=yhetil.org@gnu.org Sender: gwl-devel-bounces+larch=yhetil.org@gnu.org X-Migadu-Country: US X-Migadu-Flow: FLOW_IN X-Migadu-Queue-Id: 279276FA02 X-Spam-Score: -5.97 X-Migadu-Spam-Score: -5.97 X-Migadu-Scanner: mx10.migadu.com X-TUID: 1cbGHXLYRuRr Liliana Marie Prikler writes: >> When running with "-l all" I see this: >>=20 >> =C2=A0 info: .75 Computing workflow `cat'... >> =C2=A0 debug: 3.13 Computing script for process `meow' >> =C2=A0 guix: 3.13 Looking up package `bash-minimal' >> =C2=A0 guix: 3.13 Opening inferior Guix at >> `/gnu/store/pb1nkrn3sg6a1j6c4r5j2ahygkf4vkv9-profile' >> =C2=A0 guix: 4.27 Looking up package `guix' >> =C2=A0 debug: 4.45 Generating all scripts and their dependencies. >> =C2=A0 debug: 4.89 Generating all scripts and their dependencies. >> =C2=A0 run: 6.73 Executing: /bin/sh -c >> /gnu/store/5idhbvhrwj3p53kkz2vikdn1ypncwj84-gwl-meow.scm '((inputs >> "/tmp/meow/0" ... >> =C2=A0 process: 8.80 In execvp of /bin/sh: Argument list too long >> =C2=A0 error: 8.80 Wrong type argument in position 1: #f >>=20 >> This at least tells us that the last error here is due to sh refusing >> to run. > Good to know, and I thought it'd be just that, but=E2=80=A6 shouldn't this > failure to invoke sh be caught through something? Yes, it really should. This may be a problem with how we capture stdout and stderr. I'll look into it. >> > For comparison: >> > =C2=A0 time cat /tmp/meow/{0..7769} >> > =C2=A0 [=E2=80=A6] >> > =C2=A0=20 >> > =C2=A0 real=C2=A0=C2=A00m0,144s >> > =C2=A0 user=C2=A0=C2=A00m0,049s >> > =C2=A0 sys=C2=A0=C2=A0=C2=A00m0,094s >> >=20 >> > It takes GWL 6 times longer to compute the workflow=C2=A0than to create >> > the inputs in Guile, and 600 times longer than to actually execute >> > the shell command.=C2=A0 I think there is room for improvement :) >>=20 >> Yeah, not good.=C2=A0 Do you have any recommendations? > We already talked about this in response to your second mail, but (LRU) > Caching of things that can be cached would be an approach to take.=20 > Perhaps there's also inefficiencies in auto-connecting inputs =E2=80=93 n= ot > exhibited by this example, but thinkable. > > Design-wise, we might need a way of splitting large worfklows anyhow.=20 > Files and environment variables work, but feel clunky at the moment, > and particular files remind me about recursive make=E2=80=A6 maybe when I= get > the time, I can code something up and then look at ways for > simplification. I'd be very happy to see a rough proposal and/or patches. GWL is currently unburdened due to the fact that it hardly has any active/vocal users, so I'm willing to evolve it in a direction that serves actual users. --=20 Ricardo