Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Examples of "bad" plots #98

Open
hyanwong opened this issue Oct 5, 2023 · 3 comments
Open

Examples of "bad" plots #98

hyanwong opened this issue Oct 5, 2023 · 3 comments

Comments

@hyanwong
Copy link
Member

hyanwong commented Oct 5, 2023

There are some cases where we think that tsqc indicates "bad stuff" (e.g. where there are long edges with no sites, etc). But a new user might not know what to look for. We should collect a gallery of these examples, with a bit of a discussion on each as to why we think this is a patter to be wary about. This could, I suppose, be in some sort of tutorial.

@barneyhill
Copy link

barneyhill commented Oct 9, 2023

UKB 450K Exomes + Array (MAF>0.001). TTN (chr2) gene (tsinfer+tdate):

Overview
image
Mutations
image
Edges
image
Trees
image
Nodes
image

@barneyhill
Copy link

UKB 450K Exomes + Array (MAF>0.001). Entire chr21 (tsinfer+tdate):

Overview
image
Mutations
image
image
Edges
image
Trees
image
Nodes
image

Notes

Jerome recommended not to continue with this tree sequence due to these QC plots. Loading times for these plots are substantial (~mins per plot) but I think that's expected given the size of the ts.

@jeromekelleher
Copy link
Member

Very interesting, thanks @barneyhill.

The TTN plots don't look terrible to me. I think it would be worth looking at the actual tree for a few random subsets (of say 10 samples) at a particular location, to get a feel for what the deep structure looks like.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants