-
Notifications
You must be signed in to change notification settings - Fork 55
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Web VEP: fix incorrect variant end #1059
base: postreleasefix/112
Are you sure you want to change the base?
Web VEP: fix incorrect variant end #1059
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Hi @nuno-agostinho ,
Thanks for the changes. I can see it has updated the END
value of the variant to be same as what we get in the CLI. I tested with two variants, and from CLI I get -
rs72549353: 22:42128251-42128254
rs5030656: 22:42128176-42128178
In the sandbox they gave the same END
value - http://wp-np2-11.ebi.ac.uk:7070/Homo_sapiens/Tools/VEP/Results?tl=smKQK4ocAyXPdUac-1
But you will notice that the start value does not match. Is that something you think we should consider? Notice that in the CLI we minimise the allele and gets the position.
The first variant got position - The second variant got - |
Hey @nakib, do you think we should output a
|
Hey there, @nuno-agostinho! Nakib here, albeit a different one. :) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Hi @nuno-agostinho,
Yes it makes sense to use $vf->{original_start}
. We have UPLOADED_ALLELE
and REF_ALLELE
in the web VEP output. Than it would at least be matched with the UPLOADED_ALLELE
. Furthermore this is also what is reported by the Variant Table page for location.
Currently it is not matching either of the allele from those two column values.
User-requested bug fix: This PR fixes the incorrect
END
location for short variants in web VEP by storing this information into theINFO/END
field by default, as already done for SVs.Motivation
Web VEP is returning incorrect
END
location for multiple variants when using rsIDs. Given that web VEP output is in VCF format, noEND
location is available by default. To prepare the variant location, web VEP first checks if it's stored inINFO/END
field, otherwise it calculates based on the length of the reference allele: https://github.com/Ensembl/public-plugins/blob/main/tools/modules/EnsEMBL/Web/TmpFile/VcfTabix.pm#L254-L264VEP stores the correct
END
position in the VariantFeature object. As such, we simply need to save this into theINFO/END
field of the VCF output, like in SVs. As this is done for SVs by default, this could also be the default for short variants.Testing
Compare results for the following variants between:
END
location that is incorrect when compared with VEP CLI