Skip to content

Latest commit

 

History

History
152 lines (128 loc) · 7.93 KB

README.md

File metadata and controls

152 lines (128 loc) · 7.93 KB

GGR-cwl

Join the chat at https://gitter.im/alexbarrera/GGR-cwl

CWL tools and workflows associated with the Genomics of Gene Regulation (GGR) project

GGR pipelines created using the Common Workflow Language v1.0 specification. The workflows are parametrized with values that best suit the GGR samples, but they can be easily tailored for specific needs.

For a detail User Guide to the CWL workflows, please see the wiki.

If you need help navigating the files generated by the pipelines, have a look at the processed files cheatsheet.

Pipelines

Steps

Pipelines

Steps

Pipelines

Steps

Pipelines

Steps

Pipelines

Steps


Workflow differences legend

Depending on the experiments, there might be small differences in the workflows which will be determined by:

  • All
    • Type of read:
      • SE: Single End reads
      • PE: Paired-End reads
  • ChIP-seq only
    • With or without control. If a control sample is available -with-control or not.
  • RNA-seq only
    • Strand specificity:
      • Unstranded: reads are not strand-specific, is not possible to know from which DNA strand they come.
      • Stranded: reads are strand-specific and can be map to the Watson and Crick strands.
      • Reverse Stranded: reads come from cDNA, which switches the mapping of the forward and reverse strand.
    • Custom SJDB: By default the STAR 2-pass mapping strategy is implemented in which a first pass of STAR is run to generate a large pool of novel splice junctions (referred as SJDB). These junctions are used to generate a genome index which is employed in the mapping step. However, this 2-pass strategy can be skipped, using a custom genome index Because typically this genome would be created with a precomputed SJDB, this option is denoted with -with-sjdb.
  • ATAC-seq only
    • Blacklist removal: whether or not to mask out blacklisted regions.