Mouse UTRome Atlas

This pipeline merges multiple datasets to define an atlas of 3' UTRs. At its core is a tabulation of how many distinct cell types use a particular tandem (non-intronic) isoform at either 5% or 10% frequency. Genes that have more that two isoforms of this type are classified as multi-UTR genes.

Usage

The pipeline relies on Snakemake and Conda/Mamba. If Conda is not installed, we recommend a Miniforge variant, specifically Mambaforge.

To run with the same Snakemake version, please recreate the environment using:

# replace 'mamba' with 'conda'
mamba env create -f envs/snakemake_5_31.min.yaml

After activating the above environment (conda activate snakemake_5_31), the pipeline can be run with:

snakemake

where the Snakefile is in the working directory.

One will need to update the config.yaml file to provide the file locations.

Adding Datasets

The datasets are assumed to result from the scUTRquant pipeline. They must be added to config.yaml, under the sce object, and added to the merge_sces rule in Snakefile. The colData columns retained in the merged dataset are:

cell_id
tissue
cell_type
cluster
sample
age

Conforming data is done in scripts/merge_sces.R.

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
envs		envs
metadata		metadata
scripts		scripts
README.md		README.md
Snakefile		Snakefile
config.yaml		config.yaml
lsf.yaml		lsf.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Mouse UTRome Atlas

Usage

Adding Datasets

About

Releases 2

Packages

Languages

Mayrlab/atlas-mm

Folders and files

Latest commit

History

Repository files navigation

Mouse UTRome Atlas

Usage

Adding Datasets

About

Resources

Stars

Watchers

Forks

Releases 2

Packages 0

Languages

Packages