Skip to content

Latest commit

 

History

History
67 lines (50 loc) · 2.83 KB

README.md

File metadata and controls

67 lines (50 loc) · 2.83 KB

DOI

tabula-sapiens

Welcome to Tabula Sapiens!

Where is the data

Raw data

Since April 2021, Tabula Sapiens data have been made available to all users free of charge. AWS has made the data freely available on Amazon S3 so that anyone can download the resource to perform analysis and advance medical discovery without needing to worry about the cost of storing Tabula Sapiens data or the time required to download it. The data can be browsed but before you can download we require users to complete a data sharing agreement.

The s3 public bucket follows this example structure:

aws s3 ls czb-tabula-sapiens
├── Donor1/
├── Donor10/
├── Donor11/
├── Donor12/
├── Donor13/
├── Donor14/
├── Donor15/
├── Donor2/
│   ├── alignment-gencode/
│   │   ├── 10X/
│   │   ├── smartseq2/
│   ├── fastqs/
│   │   ├── 10X/
│   │   ├── smartseq2/
│   ├── gene-count-tables/
│   ├── immune-repertoire-analysis/
│   │   ├── bracer/
│   │   ├── tracer/
├── Donor3/
├── Donor4/
├── Donor5/
├── Donor6/
├── Donor7/
├── Donor8/
├── Donor9/
├── reference/
│   ├── cellranger/
│   │   ├── homo.gencode.v30.annotation.ERCC92.tgz
│   ├── STAR/
│   │   ├── homo.gencode.v30.annotation.ERCC92.tgz

Each DonorN folder contains all the raw data for TSPN. Each fastqs folder is split between 10x and smartseq with respective files inside, an identical strucutre followed by the alignment-gencode folder. The gene-count-tables correspond the smartseq2 only. For the donors for which we generated smartseq2 we also provide the raw outputs for bracer and tracer inside the respective immune-repertoire-analysis folder. The reference files to use to reprocess the entire dataset from fastqs onwards are provide in reference.

For instructions on how to create an AWS account (free of charge) please refer to AWS documentation

Processed data

Our ready-to-use data is available from figshare: https://figshare.com/projects/Tabula_Sapiens/100973

If you would like to use cellxgene to explore the data on your local machine consider using exploratory-cellxgene

How to use cellxgene

Checkout detailed instructions here

Data Portal

To interact with the data checkout our portal: http://tabula-sapiens-portal.ds.czbiohub.org/