Skip to content

01 - Introduction

Benefits of workflow languages

  • Reproducibility
  • Automation
  • Portability
  • Scalability
  • Interpretability

A good paper on Scalable Workflows and Reproducible Data Analysis for Genomics - although genomics focussed, it covers a lot of the concepts we will touch on in this workshop

Benefits of Snakemake

Snakemake

  • Based in the popular (and widely used) programming language, Python
  • Great documentation, actively maintained (but so are the other workflow languages mentioned below)
  • Easier to learn (particularly if you're familiar with python)
  • Flexible

See what other people think:

Other workflow languages

Choose your favourite flavour of workflow language!

The real point is to use a workflow language (where applicable) and just use the flavour you like!

This workshop

This workshop is designed with someone who had some familiarity with the command line. However, I've tried to make it as accessible as possible to anyone who wants to learn Snakemake.

Throughout this workshop, I'll be indicating the code to remove and the code to insert (relative to the previous step) with the following:

- Remove this line of code
+ Add this line of code

However, the actual + and - symbols should not be included in your own code

At each section of the workshop you can find a drop down box under "Current snakefile:" that will contain the main Snakefile that comprises the pipeline as a plain text file to copy and paste from if you need to catch up.


Back to homepage