From: zimoun <zimon.toutoune@gmail.com>
To: Roel Janssen <roel@gnu.org>
Cc: Guix Devel <guix-devel@gnu.org>
Subject: Re: GWL pipelined process composition ?
Date: Wed, 18 Jul 2018 23:55:25 +0200 [thread overview]
Message-ID: <CAJ3okZ39W5vikMP0_zjooO-8VmobaXoLCZN-nCk7ygNe29bSyg@mail.gmail.com> (raw)
In-Reply-To: <87601cxxjo.fsf@gnu.org>
Hi Roel,
Thank you for all your comments.
> Maybe we can come up with a convenient way to combine two processes
> using a shell pipe. But this needs more thought!
Yes, from my point of view, the classic shell pipe `|` has two strong
limitations for workflows:
1. it does not compose at the 'process' level but at the procedure 'level'
2. it cannot deal with two inputs.
As an illustration for the point 1., it appears to me more "functional
spirit" to write one process/task/unit corresponding to "samtools
view" and another one about compressing "gzip -c". Then, if you have a
process that filters some fastq, you can easily reuse the compress
process, and composes it. For more complicated workflows, such as
RNAseq or other, the composition seems an advantage.
As an illustration for the point 2., I do not do with shell pipe:
dd if=/dev/urandom of=file1 bs=1024 count=1k
dd if=/dev/urandom of=file2 bs=1024 count=2k
tar -cvf file.tar file1 file2
or whatever process instead of `dd` which is perhaps not the right example here.
To be clear,
process that outputs fileA
process that outputs fileB
process that inputs fileA *and* fileB
without write on disk fileA and fileB.
> If you have an idea to improve on this, please do share. :-)
I do not know where to look. :-)
Any ideas ?
All the best,
simon
next prev parent reply other threads:[~2018-07-18 21:55 UTC|newest]
Thread overview: 6+ messages / expand[flat|nested] mbox.gz Atom feed top
2018-07-18 11:20 GWL pipelined process composition ? zimoun
2018-07-18 17:29 ` Roel Janssen
2018-07-18 21:55 ` zimoun [this message]
2018-07-19 7:13 ` Pjotr Prins
2018-07-19 11:44 ` zimoun
2018-07-19 8:15 ` Roel Janssen
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=CAJ3okZ39W5vikMP0_zjooO-8VmobaXoLCZN-nCk7ygNe29bSyg@mail.gmail.com \
--to=zimon.toutoune@gmail.com \
--cc=guix-devel@gnu.org \
--cc=roel@gnu.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
Code repositories for project(s) associated with this external index
https://git.savannah.gnu.org/cgit/guix.git
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.