An R
pipeline for automatic processing of low-pass whole genome sequencing data to detect copy number variation using the QDNASeq
package.
- Raw read QC (
FastQC
) - Adapter/Quality trimming (
Trimgalore
) - Post trimming QC (
FastQC
) - Alignment (
bwa v0.7.15-r1140
) - Sorting and indexing (
Samtools
) - Copy number calling (
QDNASeq
) - Summary report generation (
R
)
After low-pass whole genome sequencing of a number of samples, a typical workflow will involve
- Creating a design file associating each set of FastQ files with the appropriate sample, genome and annotation information.
- Passing this design file to the main LowPassKaryo_Wrapper.R script which will sanity check the parameters and then handle submission of procesing jobs to your HPC cluster/farm.
- On sucessful completion, the pipeline will produce one pdf file containing QDNASeq copy number profiles for each species included in the processing run and an html report containing primary alignment QC metrics and recording the software versions used.
Details of the local configuration required to set up the pipeline and also instructions on how to subsequently run it may be found in the DOCS/
directory