PopGen-Scripts

Miscellaneous scripts I have used in ancient DNA and popgen. Take with a grain of salt.

ADMIXTOOLS

https://github.com/DReichLab/AdmixTools \

F Statistics

https://bodkan.net/admixr/articles/tutorial.html

qpWave and qpAdm

https://github.com/DReichLab/AdmixTools/blob/master/README.QpWave
Both take the same input parameter file:

genotypename:   input genotype file (in eigenstrat or packedancestrymap r format)
snpname:       input snp file      (in eigenstrat format)
indivname:     input indiv file    (in eigenstrat format)
popleft:       left population list (1 per line) 
popright:      right population list (1 per line) 
details:       YES

qpWave finds out how many admixture events are betwee the left and right populations. usually this should be run before qpAdm
qpAdm then finds the weight of admixture from the rightpop to leftppop (target).

Pairwise individual comparisons with qpWave to find populations

To test if each pair of individuals forms a clade relative to outgroups. Outgroup matters - if you use africa then everything will be a clade relatively, if you use something too similar then nothing will be a clade.

Use the qpWave.sh script to run qpwave for every pair ina target population. Requires eigenstrat dataset containing the target population and outgroup population.
Use qpWaveLoop.sh if you want to run pairwise qpwave for multiple populations in your dataset, it submits batch jobs for each specified population in a loop.
Download the data and plot in R with qpWave_Pairwise.R

Outgroup F3 tests

Tests of this tree:

F3 is a measure of the branch length of the blue, therefore higher F3 means closer related X and Test.

Use qp3Pop.sh to run outgroup F3 on a dataset, specifying the Test populaiton, and the script will rotate X as all other populations in the dataset.
Use qp3Pop_Loop.sh if you want to submit a job of qp3Pop.sh for each test population. i.e, to run every combination in your dataset.
Use F3_plot.R to plot in R

Kinship Analyses

READ

The kinship analysis in READ is one-liners in plink to prune the data then one-liner for READ … “python ./READ.py dataprefix”. READ: https://bitbucket.org/tguenther/read/src/master/

Prepare dataset:

plink --bfile ${DATA} --keep-allele-order --maf 0.01 --geno 0.999999 --mind 1.0 --allow-no-sex --recode transpose --out ${DATA}

READ requires R so I've been downloading and running locally: Run READ:

python READ.py <dataprefix>

TKGWV2

Paper:https://www.biorxiv.org/content/10.1101/2021.06.22.449449v1
GitHub: https://github.com/danimfernandes/tkgwv2 \

Phoenix Wiki for using Python virtual envs inside slurm script: https://wiki.adelaide.edu.au/hpc/Python_virtual_environment

hapROH

EIGENSTRAT file input
can test close-kin unions & background relatedness
primed for pseudohaploid 1240k data
https://github.com/hringbauer/hapROH
https://pypi.org/project/hapROH/

prepare the meta information file, usually called <prefix>_meta_blank.csv that looks like:

iid,age,clst,lat,lon
Sample1,10000,Population1,12.03,37.89

the age and lat/long are only used in the mapping hapROH step which I never got to run, so you can put dummy values if you only want to obtain barplots. the population controls how the individuals are clustered.

run hapROH.py on your data, you'll need to edit the script to give the appropriate paths to input files.
run plot_hapROH.py to generate the classic hapROH barplot.

Name		Name	Last commit message	Last commit date
Latest commit History 68 Commits
MDS_plot.R		MDS_plot.R
NJ_tree.R		NJ_tree.R
READ.sh		READ.sh
README.md		README.md
TKGWV2.sh		TKGWV2.sh
Yfitter.sh		Yfitter.sh
clustered_pheatmap_plot.R		clustered_pheatmap_plot.R
conditional_heterozygosity.sh		conditional_heterozygosity.sh
fEEMS.md		fEEMS.md
hapROH.py		hapROH.py
mergeBED.sh		mergeBED.sh
mergeit.sh		mergeit.sh
plot_hapROH.py		plot_hapROH.py
population_F3_barplot.R		population_F3_barplot.R
qp3Pop.sh		qp3Pop.sh
qp3Pop_Loop.sh		qp3Pop_Loop.sh
qpDstat_f4mode.sh		qpDstat_f4mode.sh
qpGraph_findGraphs.R		qpGraph_findGraphs.R
qpWave.sh		qpWave.sh
qpWaveLoop.sh		qpWaveLoop.sh
rename_SNPname_bim.sh		rename_SNPname_bim.sh
vcf_to_pos_BED_snp.sh		vcf_to_pos_BED_snp.sh
yHaplo.sh		yHaplo.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PopGen-Scripts

ADMIXTOOLS

F Statistics

qpWave and qpAdm

Pairwise individual comparisons with qpWave to find populations

Outgroup F3 tests

Kinship Analyses

READ

TKGWV2

hapROH

About

Releases

Packages

Languages

roberta-davidson/PopGen-Scripts

Folders and files

Latest commit

History

Repository files navigation

PopGen-Scripts

ADMIXTOOLS

F Statistics

qpWave and qpAdm

Pairwise individual comparisons with qpWave to find populations

Outgroup F3 tests

Kinship Analyses

READ

TKGWV2

hapROH

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages