Here are scripts initially written for Postzygotic Amplicon Sequencing for Mosaicism (PASM), Targeted Amplicon Sequencing (TAS/TASeq), and some codes for the method we now define as Massive Parallel Amplicon Sequencing (MPAS). We provided a perl+R version, two standalone python versions, and a Snakemake pipeline. The scripts and pipelines are useful for the calculation of variant allelic fraction (AF) based on not only amplicon based deep sequencing data, but also the AF estimation as well as annotations for postzygotic mosaic variant studies from all kinds of Next Generation Sequencing (NGS) data.
For the calculation of confidence intervals, you can choose exact binomial confidence interval in R (Clopper-Pearson interval by default), or the iterative methods which considers the base qualities from each base, described in Xu , Yang, and Wu et al. Wei and Zhang. 2015 and Yang and Liu et al. Wu, Wei, and Zhang. 2017, different versions of scripts are available.
A Snakemake pipeline with exact binomial CIs and detailed annotations was implemented by Xin Xu and Xiaoxu Yang, with great input form Martin Breuss, the pipeline is based on the Python scripts written by Xin Xu and a previous Snakemake pipeline written by Martin Breuss and Renee D. George. (2019-08-12)
A new python version with exact binomial CIs was implemented by Xin Xu supervised by Xiaoxu Yang (2019-07-24) and fixed by Jiawei Shen (2022-04-22).
A python version was implemented by Xianing Zheng supervised by Xiaoxu Yang. (2016-04-17)
Note that the Perl package Statistics::R is used to call the yyxMosaicHunter package in R written by Adam Yongxin Ye.
Dependencies of yyxMosaicHunter 0.1.4 are: Rcpp
pryr
. (2014-11-11)
The first part of the perl version is a pileup filter, it takes in SAMTools mpileup results and calculate different characters to count the bases, written by Jiarui Li, modified by Xiaoxu Yang and Xianing Zheng. (2015-03-24)
You can also only output the base qualtiy and deal with the base qualities in R.
If you want to calculate the CIs with PASM Bayesian model, you can use this perl script, or a older version perl script.
For the Perl version: samtools mpileup -r ${chr}:${pos}-${pos} -f <reference_file> -Q0 -q0 -AB -d3000 <input_bam> | ./get_ref_alt_baseQ_corrected_calculate_only_2016_12_03.pl
Primers for the amplicons are designed based on the Primer3 Command Line version.
- Cell-type-resolved mosaicism reveals clonal dynamics of the human forebrain 2024. (Nature)
- Control-independent mosaic single nucleotide variant detection with DeepMosaic. 2023. (Nature Biotechnology)
- Comprehensive multi-omic profiling of somatic mutations in malformations of cortical development. 2023. (Nature Genetics)
- Somatic mosaicism reveals clonal distributions of neocortical development. 2022. (Nature)
- Unbiased mosaic variant assessment in sperm: a cohort study to test predictability of transmission. 2022. (eLife)
- Developmental and temporal characteristics of clonal sperm mosaicism. 2021. (Cell)
- Comprehensive identification of somatic nucleotide variants in human brain tissue. 2021. (Genome Biology)
- Autism risk in offspring can be assessed through quantification of male sperm mosaicism. 2020. (Nature Medicine)
- mTOR pathway somatic variants and the molecular pathogenesis of hemimegalencephaly. 2020. (Epilepsia Open)
- Genomic mosaicism in the pathogenesis and inheritance of a Rett syndrome cohort. 2019. (Genetics in Medicine)
- Mosaicism and incomplete penetrance of PCDH19 mutations. 2019. (Journal of Medical Genetics)
- Somatic double-hit in MTOR and RPS6 in hemimegalencephaly with intractable epilepsy. 2019. (Human Molecular Genetics)
- ATP1A3 mosaicism in families with alternating hemiplegia of childhood. 2019. (Clinical Genetics)
- Distinctive types of postzygotic single-nucleotide mosaicisms in healthy individuals revealed by genome-wide profiling of multiple organs. 2018. (PLoS Genetics)
- MosaicHunter: accurate detection of postzygotic single-nucleotide mosaicism through next-generation sequencing of unpaired, trio, and paired samples. 2017. (Nucleic Acids Research)
- Ultrasensitive and high-efficiency screen of de novo low-frequency mutations by o2n-seq. 2017. (Nature Communications)
- Postzygotic single‐nucleotide mosaicisms contribute to the etiology of autism spectrum disorder and autistic traits and the origin of mutations. 2017. (Human Mutation)
- Genomic mosaicism in paternal sperm and multiple parental tissues in a Dravet syndrome cohort. 2017. (Scientific Reports)
- Amplicon resequencing identified parental mosaicism for approximately 10% of “de novo” SCN1A mutations in children with Dravet syndrome. 2015. (Human Mutation)
📧 Xiaoxu Yang: u6055394@utah.edu, xiaoxuyanglab@gmail.com
-
Cite the Python version or the snakemake wrapper and the binomial model:
Yang X and Breuss MW, et al. Gleeson JG. Developmental and temporal characteristics of clonal sperm mosaicism. 2021. Cell
Breuss MW, Yang X, Schlachetzki JCM, and Antaki D, et al. Gleeson JG. Somatic mosaicism reveals clonal distributions of neocortical development. 2022. Nature
-
Cite the Perl+R version and the Bayesian model:
Yang X and Liu A, et al. Wei L and Zhang Y. Genomic mosaicism in paternal sperm and multiple parental tissues in a Dravet syndrome cohort. 2017. Scientific Reports
Xu X, Yang X, and Wu Q, et al. Wei L and Zhang Y. Amplicon resequencing identified parental mosaicism for approximately 10% of “de novo” SCN1A mutations in children with Dravet syndrome. 2015. Human Mutation