01 - Introduction¶
Benefits of workflow languages¶
- Reproducibility
- Automation
- Portability
- Scalability
- Interpretability
A good paper on Scalable Workflows and Reproducible Data Analysis for Genomics - although genomics focussed, it covers a lot of the concepts we will touch on in this workshop
Benefits of Snakemake¶
- Based in the popular (and widely used) programming language, Python
- Great documentation, actively maintained (but so are the other workflow languages mentioned below)
- Easier to learn (particularly if you're familiar with python)
- Flexible
See what other people think:
Other workflow languages¶
Choose your favourite flavour of workflow language!
- Common Workflow Language (CWL)
- Nextflow
- Workflow Description Language (WDL)
- Guix Workflow Language (GWL)
The real point is to use a workflow language (where applicable) and just use the flavour you like!
This workshop¶
This workshop is designed with someone who had some familiarity with the command line. However, I've tried to make it as accessible as possible to anyone who wants to learn Snakemake.
Throughout this workshop, I'll be indicating the code to remove and the code to insert (relative to the previous step) with the following:
However, the actual +
and -
symbols should not be included in your own code
At each section of the workshop you can find a drop down box under "Current snakefile:" that will contain the main Snakefile that comprises the pipeline as a plain text file to copy and paste from if you need to catch up.