Skip to content

Calling SNPs using Stacks pipeline for ponderosa pine using the reference genome of loblolly pine

Notifications You must be signed in to change notification settings

shumengjun/Stacks_ref

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Stacks_ref

Using Stacks pipeline to call SNPs with GBS raw data of 94 ponderosa pine (Pinus ponderosa) and the reference genome of loblolly pine (Pinus taeda).

Software:

Input File

  • Raw fasta file produced by GBS (Genotyping by sequencing) with restriction enzyme ApeKI
  • Barcode information

Output File

VCF file

Step1: clean the data

  • Code: S1_radtags_05.sh, S1_radtags_06.sh
  • Input: two raw fasta filet, barcode file
  • sliding window: 0.2
  • score limit: 20
  • maximum read length: 64
  • Output: 96 .fq.gz file

Step8: bwa

  • Code: S8_bwa.sh
  • Input: reference genome of loblolly pine, and output file of Step 1
  • Output: 96 .sam file

Step9: sam to bam

  • Code: S9_samtobam_05.sh, S9_samtobam_06.sh
  • Input: output file of Step 8
  • Output: 96 .bam file

Step10: sort bam files

  • Code: S10_bamsorted_05.sh, S10_bamsorted_06.sh
  • Input: output file of step 9
  • Output: 96 sorted .bam file

Step11: gstacks_ref

  • Code: S11_gstacks_ref.sh
  • Input: output file of Step 10
  • Output: one catalog .alleles.tsv.gz file, one catalog .tags.tsv.gz file, one catalog .snp.tsv.gz file

Step12: populations_ref

  • Code: S12_populations_ref.sh
  • Input: output file of Step 11
  • Output: VCF file

About

Calling SNPs using Stacks pipeline for ponderosa pine using the reference genome of loblolly pine

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages