all messages for Guix-related lists mirrored at yhetil.org
 help / color / mirror / code / Atom feed
From: "Ludovic Courtès" <ludovic.courtes@inria.fr>
To: Lars-Dominik Braun <ldb@leibniz-psychology.org>
Cc: Simon Tournier <zimon.toutoune@gmail.com>,
	 guix-science@gnu.org,
	 Simon Tournier <simon.tournier@u-paris.fr>,
	 lars@6xq.net
Subject: Re: [PATCH] Add draft post "CRAN, a practical example for being reproducible at large scale using GNU Guix".
Date: Tue, 13 Dec 2022 14:53:34 +0100	[thread overview]
Message-ID: <875yefwgtd.fsf@gnu.org> (raw)
In-Reply-To: <Y5BQjYCg37m83zCx@zpidnb93> (Lars-Dominik Braun's message of "Wed, 7 Dec 2022 09:36:29 +0100")

[-- Attachment #1: Type: text/plain, Size: 718 bytes --]

Hello!

Lars-Dominik Braun <ldb@leibniz-psychology.org> skribis:

>> Applied, thanks.  It is under drafts/ [1].  Last round proofread before
>> publishing.  On Friday?
> Friday sounds good. I’m attching minor changes to the synax highlighting.

We missed one Friday but there are plenty coming up.  :-)

As mentioned on #guix-hpc, I think it’d be interesting to add a
reference to https://www.nature.com/articles/s41597-022-01143-6 to
illustrate the rationale.  I think it’s important because R users are
likely to wonder why they’d bother with Guix in the first place.

Here’s a proposal in that direction; feel free to take it, tear it down,
change it, or whatever.

Thanks,
Ludo’.


[-- Attachment #2: Type: text/x-patch, Size: 2196 bytes --]

diff --git a/drafts/reproducible-cran.md b/drafts/reproducible-cran.md
index c691163..28f6108 100644
--- a/drafts/reproducible-cran.md
+++ b/drafts/reproducible-cran.md
@@ -60,6 +60,42 @@ pre-built substitutes to speed up installation times. Additionally,
 reproducing environments would include fewer steps if the package
 recipes were available to anyone by default.
 
+## Why deploy R software with Guix anyway?
+
+At this point, perhaps you're wondering: R is stable, and tools such as
+[Packrat](https://rstudio.github.io/packrat/) let me save and restore
+the exact R package versions I need.  While this might seem “good
+enough”, we can already tell this approach [has a number of
+shortcomings](https://hpc.guix.info/blog/2022/07/is-reproducibility-practical/),
+one of which being that it cannot handle dependencies not written in
+R—such as R itself.
+
+A [study published in *Nature Scientific Data* in February
+2022](https://doi.org/10.1038/s41597-022-01143-6) gives empirical
+insight into this:
+
+> _[We] retrieve and analyze more than 2000 replication datasets with
+> over 9000 unique R files published from 2010 to 2020. Second, we
+> execute the code in a clean runtime environment to assess its ease of
+> reuse. […] We find that 74% of R files failed to complete without
+> error in the initial execution, while 56% failed when code cleaning
+> was applied, showing that many errors can be prevented with good
+> coding practices._
+
+Three fourth of those R packages fail to run out of the box—this is
+huge.  How did the authors re-execute this code?
+
+> _We re-executed R code from each of the replication packages using
+> three R software versions, R 3.2, R 3.6, and R 4.0, in a clean
+> environment._
+
+Despite this guesswork, coupled with automatic “source cleaning”, the
+authors found that most packages still fail to run.
+
+The motivation to deploy R software with Guix becomes clear: it’s the
+ability to automatically redeploy the same software environment, at
+different points in time, on different machines.
+
 ## Introducing guix-cran
 
 GNU Guix provides a mechanism called “channels”,

  reply	other threads:[~2022-12-13 13:56 UTC|newest]

Thread overview: 14+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2022-12-06  7:53 [PATCH] Add draft post "CRAN, a practical example for being reproducible at large scale using GNU Guix" Lars-Dominik Braun
2022-12-06 12:51 ` Simon Tournier
2022-12-07  7:44   ` Efraim Flashner
2022-12-07  8:39     ` Lars-Dominik Braun
2022-12-07 11:11       ` Simon Tournier
2022-12-07  8:36   ` Lars-Dominik Braun
2022-12-13 13:53     ` Ludovic Courtès [this message]
2022-12-13 16:34       ` zimoun
2022-12-16  8:00       ` Lars-Dominik Braun
2022-12-16  8:58         ` Ludovic Courtès
2022-12-17  9:53           ` Lars-Dominik Braun
2022-12-17 11:43             ` Simon Tournier
2022-12-19 15:06               ` Lars-Dominik Braun
2022-12-21 14:43                 ` Ludovic Courtès

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=875yefwgtd.fsf@gnu.org \
    --to=ludovic.courtes@inria.fr \
    --cc=guix-science@gnu.org \
    --cc=lars@6xq.net \
    --cc=ldb@leibniz-psychology.org \
    --cc=simon.tournier@u-paris.fr \
    --cc=zimon.toutoune@gmail.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
Code repositories for project(s) associated with this external index

	https://git.savannah.gnu.org/cgit/guix.git

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.