Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

mm10 vs Grcm38(Gencode) #143

Open
pdellorusso opened this issue Jan 4, 2019 · 2 comments
Open

mm10 vs Grcm38(Gencode) #143

pdellorusso opened this issue Jan 4, 2019 · 2 comments

Comments

@pdellorusso
Copy link

This is a question, not an issue, but I am curious about whether there is a specific reason to use mm10 over the GRCm38 (https://www.gencodegenes.org/mouse/) primary assembly available from Gencode?

Is this the standard mouse genome assembly to use for all Encode standardized pipelines?

@strattan
Copy link

strattan commented Jan 7, 2019

@pdellorusso Thanks for your question. The GRCm38 build ENCODE uses is based on what GRC calls the "latest major release", which is at the "GRCm38" tab here: https://www.ncbi.nlm.nih.gov/grc/mouse

We do not apply the periodic patches GRC applies, which is up to p6 at this time.

The mm10 ENCODE uses for mapping has chromosome names in "UCSC format" (like "chr1"), and includes autosomes, both sex chromosomes, M, and the unplaced and unlocalized scaffolds. Downstream analysis may choose to use any subset of those mappings but the mapping is always to the same reference.

For transcript annotations, we have used GENCODE M4 https://www.gencodegenes.org/mouse/release_M4.html. We anticipate upgrading to a more recent GENCODE build this year, but the ENCODE RNA working group have not decided on exactly which build or what that timeline is. When we do decide, we will make an announcement on https://www.encodeproject.org/

I hope that's helpful!

@XiaoYan000
Copy link

Hi, I am struggling with annotating by Gencode M25. I have used annotate_variation function and set parameters according to "Create your own gene definition databases for non-human species". After I acquired variant_function, and exonic_variant_function files, I am wondering how to make an output like that table_annotate give, so that I can input them into the maftools for downstream analysis. I am looking forward to your reply. Thank you very much!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants