Scripts to execute various steps in the PDC Hi-C data analysis.
Scripts were run using PBS job scheduler or in command line.
Each script contains comments about its functionality.
0a.refgenie.sh
- download genomic sequences and assets using refgenie0b.make_annotation_folder.sh
- softlink refgeine assets into one folder0c.restriction_sites.sh
- create restriction site BED files
1.fetchdata.sh
- download data from SRA usingsratoolkit
2a.cutadapt.sh
- trim reads to certain length usingcutadapt
2b.gzipFiles.sh
- compress files usinggzip
3.makePDXs.sh
- combine human and mouse reads4a.splitPDXs-xenome.sh
- split human and mouse reads usingxenome
4b.splitPDXs-getHg38Reads.sh
- extract human reads using the combined alignment strategy
-
5a.runHiC-Pro
- scripts to run HiC-Pro pipelineconfig-system.txt
- an example of configuration file to install HiC-Proconfig_hicpro_hg38.txt
- an example of sample-specific configuration file for HiC-Pro1.run_HiCPro_hg38.sh
- a script creating the following files using settings fromconfig_hicpro_hg38.txt
2.HiCPro_step1_PDX.qsub
- First step of HiC-Pro pipeline3.HiCPro_step2_PDX.qsub
- Second step of HiC-Pro pipelineinputfiles_PDX.txt
- List of samples to process. Only "R1" file is necessary
-
5b.runHiCExplorer
- scripts to run HiCExplorer pipelinerun_bwamem_R1.sh
- aligning R1 readsrun_bwamem_R2.sh
- aligning R2 readshicBuildMatrix.sh
- run HiCExplorer on the aligned R1 and R2 BAM files
-
5c.runJuicer
- scripts to run Juicer pipelinesetup_folders.sh
- setup folders and softlink raw data for Juicer processingrun_juicer.sh
- run Juicer on the linked data
-
5d.collectResults
- scripts to collect and merge QC metrics from different pipelines11.hicpro_collectStats.sh
- HiC-Pro stats12.hicpro_mergeStats.sh
21.hicexplorer_collectStats.sh
- HiCExplorer stats22.hicexplorer_mergeStats.sh
31.juicer_collectStats.sh
- Juicer stats32.juicer_mergeStats.sh
-
6.convertFileFormats
- scripts to convert file formats to.h5
hicpro_to_h5.sh
- convert HiC-Pro results to.h5
formatjuicer_to_h5
- scripts for step-wise conversion of Juicer results to.h5
format1.juicer_to_txt.sh
- convert.hic
format to text format2.juicer2hicpro.R
- R script to convert text format to matrix. Called by3.convert_to_hicpro.qsub
3.convert_to_hicpro.sh
- convert text format to matrix4.hicpro_to_h5.sh
- convert and KR normalize matrix to.h5
format
-
7.hicPlotDistVsCounts.sh
- estimate distance-dependent decay -
8.TADs.sh
- TAD calling usinghicFindTADs