Skip to content

Commit

Permalink
📚 more doc details
Browse files Browse the repository at this point in the history
  • Loading branch information
dmiller15 committed Jun 11, 2024
1 parent 726080d commit 5371753
Show file tree
Hide file tree
Showing 3 changed files with 25 additions and 7 deletions.
15 changes: 12 additions & 3 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -4,12 +4,21 @@ This repository contains tools and workflows for processing of tumor-only
samples. The Kids First DRC recommends running the tumor only pipeline ONLY
when no matched normal sample is available. If your data has matched normals
we recommend running the [Kids First DRC Somatic Variant
Workflow](https://github.com/kids-first/kf-somatic-workflow) instead.
Workflow](https://github.com/kids-first/kf-somatic-workflow) instead. This
workflow is not a traditional production pipeline run on all data, but rather
is run at the user's request.

When comparing the SNV outputs of this workflow to those of the somatic workflow,
we have found the outputs to be considerably more noisy. To cut down on this
noise, we have included a set of recommended filters for Mutect2 [in our
docs](./docs/MUTECT2_TUMOR_ONLY_FILTERING.md).
noise, we have included some recommended inputs, parameters, and filters for
Mutect2 [in our docs](./docs/MUTECT2_TUMOR_ONLY_FILTERING.md). In short we recommend:
- Restrict the callable regions with a blacklist and Panel of Normals (PON)
- Remove low support reads:
- Allele Depth (AD) == 0: WGS uninformative reads
- Variant Allele Frequency (VAF) < 1%: WXS noise
- Remove potential germline variants: gnomAD AF > 0.00003
- Only keep variants that are PASS
- Rescue any variants that fall in hotspot regions/genes

It can also be used to process PDX data by first pre-processing reads using the
Xenome tool, explained more here in documentation.
Expand Down
2 changes: 1 addition & 1 deletion tools/manta.cwl
Original file line number Diff line number Diff line change
Expand Up @@ -17,7 +17,7 @@ arguments:
- position: 0
shellQuote: false
valueFrom: >-
/manta-1.4.0.centos6_x86_64/bin/configManta.py
/manta-1.4.0.centos6_x86_64/bin/configManta.py --runDir=./
- position: 9
shellQuote: false
valueFrom: >-
Expand Down
15 changes: 12 additions & 3 deletions workflows/kfdrc_tumor_only_dna_wf.cwl
Original file line number Diff line number Diff line change
Expand Up @@ -9,12 +9,21 @@ doc: |
samples. The Kids First DRC recommends running the tumor only pipeline ONLY
when no matched normal sample is available. If your data has matched normals
we recommend running the [Kids First DRC Somatic Variant
Workflow](https://github.com/kids-first/kf-somatic-workflow) instead.
Workflow](https://github.com/kids-first/kf-somatic-workflow) instead. This
workflow is not a traditional production pipeline run on all data, but rather
is run at the user's request.
When comparing the SNV outputs of this workflow to those of the somatic workflow,
we have found the outputs to be considerably more noisy. To cut down on this
noise, we have included a set of recommended filters for Mutect2 [in our
docs](./docs/MUTECT2_TUMOR_ONLY_FILTERING.md).
noise, we have included some recommended inputs, parameters, and filters for
Mutect2 [in our docs](./docs/MUTECT2_TUMOR_ONLY_FILTERING.md). In short we recommend:
- Restrict the callable regions with a blacklist and Panel of Normals (PON)
- Remove low support reads:
- Allele Depth (AD) == 0: WGS uninformative reads
- Variant Allele Frequency (VAF) < 1%: WXS noise
- Remove potential germline variants: gnomAD AF > 0.00003
- Only keep variants that are PASS
- Rescue any variants that fall in hotspot regions/genes
It can also be used to process PDX data by first pre-processing reads using the
Xenome tool, explained more here in documentation.
Expand Down

0 comments on commit 5371753

Please sign in to comment.