Skip to content

Easy and reproducible end-to-end pipeline for TCGA transcriptome data

Notifications You must be signed in to change notification settings

DabinJeong/TCGA_transcriptome_preprocessing

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

TCGA_preprocessing

Pipeline to download and preprocess TCGA transcriptome data.

nextflow run TCGA_pipeline.nf -c TCGA_pipeline.config

Modify "cancer_type" field of TCGA_pipeline.config file if you want to get data of other cancer type. Please refer to TCGA Study abbreviation (https://gdc.cancer.gov/resources-tcga-users/tcga-code-tables/tcga-study-abbreviations) for available cancer types.

Gene length computed with GTF tools (https://www.genemine.org/gtftools.php) with the following code, which calculated gene length as a length of merged exons of isoforms (non-overlapping exonic length).

gtftools -l gencode.v36.geneLength.exon_length gencode.v36.annotation.gtf

About

Easy and reproducible end-to-end pipeline for TCGA transcriptome data

Resources

Stars

Watchers

Forks

Packages

No packages published