-
Notifications
You must be signed in to change notification settings - Fork 17
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
cis-eQTL summary statistics application #18
Comments
One way of solving this issue I think is to put the GENE as a prefix of the ID, i.e. GENE:CHR:POS:REF:ALT. bcftools have a option to merge multiple vcf file by their ID. As indicated below,
Putting Gene information in any other field will results in a mismatched in the output of bcftools merge as indicated below:
|
The resulting file would not be a valid VCF file since the same variant would be present in multiple rows. INFO field is for variant-level information that does not change across samples/traits/genes, while sample/trait/gene columns are for information (like genotype or association statistics) that vary by sample/trait/gene. |
I wonder what people would suggest for storing eg cis-eQTL data, when a variant is in cis with multiple genes. I'm thinking of using
INFO
to annotate the cis-gene for a variant so there could be multiple lines having the same chr, pos, ref, alt but differentINFO
on gene, and possibly a different ID as a result of including cis-gene names. Is that the best thing to do?The text was updated successfully, but these errors were encountered: