Skip to content

Commit

Permalink
Merge pull request #183 from planemo-autoupdate/workflows/epigenetics…
Browse files Browse the repository at this point in the history
…/atacseq

Updating workflows/epigenetics/atacseq from 0.4 to 0.5
  • Loading branch information
lldelisle authored Aug 31, 2023
2 parents a2bf374 + cde406f commit 9c47555
Show file tree
Hide file tree
Showing 5 changed files with 494 additions and 113 deletions.
6 changes: 5 additions & 1 deletion workflows/epigenetics/atacseq/.dockstore.yml
Original file line number Diff line number Diff line change
@@ -1,7 +1,11 @@
version: 1.2
workflows:
- name: main
primaryDescriptorPath: /atacseq.ga
subclass: Galaxy
publish: true
primaryDescriptorPath: /atacseq.ga
testParameterFiles:
- /atacseq-tests.yml
authors:
- name: Lucille Delisle
orcid: 0000-0002-1964-4960
10 changes: 10 additions & 0 deletions workflows/epigenetics/atacseq/CHANGELOG.md
Original file line number Diff line number Diff line change
@@ -1,5 +1,15 @@
# Changelog

## [0.5] 2023-03-17

### Automatic update
- `toolshed.g2.bx.psu.edu/repos/devteam/picard/picard_MarkDuplicates/2.18.2.3` was updated to `toolshed.g2.bx.psu.edu/repos/devteam/picard/picard_MarkDuplicates/2.18.2.4`
- `toolshed.g2.bx.psu.edu/repos/iuc/bedtools/bedtools_coveragebed/2.30.0` was updated to `toolshed.g2.bx.psu.edu/repos/iuc/bedtools/bedtools_coveragebed/2.30.0+galaxy1`
- `toolshed.g2.bx.psu.edu/repos/lparsons/cutadapt/cutadapt/4.0+galaxy1` was updated to `toolshed.g2.bx.psu.edu/repos/lparsons/cutadapt/cutadapt/4.4+galaxy0`

### Manual update
- add normalization steps for coverage

## [0.4] 2023-01-16

### Automatic update
Expand Down
7 changes: 5 additions & 2 deletions workflows/epigenetics/atacseq/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -11,6 +11,7 @@ You can have more information about ATAC-seq analysis in the [slides](https://tr

- reference_genome: this field will be adapted to the genomes available for bowtie2 and the genomes available for bedtools slopbed (dbkeys table)
- effective_genome_size: this is used by macs2 and may be entered manually (indications are provided for heavily used genomes)
- bin_size: this is used when normalization of coverage is performed. Large values will allow to have smaller output files but with less resolution while small values will increase computation time and size of output files to produce more resolutive bigwigs.

## Processing

Expand All @@ -21,13 +22,15 @@ You can have more information about ATAC-seq analysis in the [slides](https://tr
- The BAM is converted to BED to enable macs2 to take both pairs into account.
- The peaks are called with macs2 which at the same time generates a coverage file.
- The coverage file is converted to bigwig
- The amount of reads 500bp from summits and the total number of reads are computed if further normalization is wanted.
- The amount of reads 500bp from summits and the total number of reads are computed.
- Two normalizations are computed:
- By million reads
- By million reads in peaks (500bp from summits)
- Other QC are performed:
- A histogram with fragment length is computed.
- The evaluation of percentage of reads to chrM or MT is computed.
- A multiQC is run to have an overview of the QC.

### Warning

- The coverage output is not normalized.
- The `reference_genome` parameter value is used to select references in bowtie2 and bedtools slopbed. Only references that are present in bowtie2 **and** bedtools slopbed are selectable. If your favorite reference genome is not available ask your administrator to make sure that each bowtie2 reference has a corresponding len file for use in bedtools slopbed.
19 changes: 17 additions & 2 deletions workflows/epigenetics/atacseq/atacseq-tests.yml
Original file line number Diff line number Diff line change
Expand Up @@ -17,7 +17,8 @@
location: https://zenodo.org/record/3862793/files/SRR891268_chr22_enriched_R2.fastq.gz
filetype: fastqsanger
reference_genome: hg19
effective_genome_size: '2700000000'
effective_genome_size: 2700000000
bin_size: 1000
outputs:
mapping stats:
element_tests:
Expand Down Expand Up @@ -82,6 +83,20 @@
asserts:
has_line:
line: "10167"
bigwig_norm:
element_tests:
SRR891268_chr22_enriched:
asserts:
has_size:
value: 1130014
delta: 100000
bigwig_norm2:
element_tests:
SRR891268_chr22_enriched:
asserts:
has_size:
value: 1133795
delta: 100000
'MultiQC on input dataset(s): Stats':
element_tests:
bowtie2:
Expand All @@ -91,7 +106,7 @@
cutadapt:
asserts:
has_line:
line: "SRR891268_chr22_enriched_2\t4.0\t285247\t41011\t40415\t4283\t280964\t28524700\t480633\t27163516\t4.771948521807416"
line: "SRR891268_chr22_enriched_2\t4.4\t285247\t41011\t40415\t4283\t280964\t28524700\t480633\t27163516\t4.771948521807416"
general_stats:
asserts:
has_text:
Expand Down
Loading

0 comments on commit 9c47555

Please sign in to comment.