-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ftp.ebi.ac.uk avaialable, but GTEx_ge_brain_frontal_cortex.all.tsv.gz times out #5
Comments
I think this is a better problem report:
My tabix is |
more info. running on ubuntu, a different tabix, same problem:
any thoughts? It's clear this problem is outside of catalogueR! |
Hi @paul-shannon, glad you're finding this tool useful. Thanks for pointing out this issue. I'll look into this and try to figure out what's going on here. Some potential sources:
Potentially related: eQTL-Catalogue/eQTL-Catalogue-resources#15 |
@kauralasoo is there anything on eQTL Catalogue's end that might be causing unstable connections to the FTP server? I just confirmed that the file paths haven't changed, so they do indeed seem to exist. |
Hi @paul-shannon and @bschilder, We just received a confirmation form the EBI helpdesk that the root cause for this was that Paul's IP address had been blocked by the EBI firewall. Paul's IP has been whitelisted now, but unfortunately there is no good solution prevent it from happening to other users, because tabix requests over FTP (incomplete downloads) look a lot like DDoS attacks to the firewall. The REST API is much more robust, because it is able to rate limit the number of requests by IP address on its own. Best, |
Thanks so much for the response @kauralasoo! This is all really helpful info. I'll make some adjustments to catalogueR and may make it so that the REST API is the default method. Update in dev branch
|
Hi Brian,
One possible caution: Kaur explained to me this about GTEx:
Unfortunately the uniformly processed GTEx summary statistics are currently
not available via the API. We hope to fix this with the next release
planned for January 2022. However, we do have the official GTEx V8 summary
statistics in the API. The study ID for those is GTEx_V8. Thus, this
command works:
https://www.ebi.ac.uk/eqtl/api/chromosomes/8/associations?paginate=False&study=GTEx_V8&qtl_group=Brain_Cortex&quant_method=ge&bp_lower=27603335&bp_upper=27608281
We've found that the official imported GTEx v8 summary statistics have
slightly better power than our re-processed ones, probably due to better
handling of covariates.
So perhaps, in your code, in the construction of the REST url, you could substitute like this, at least until the next release?
study=GTEx_V8 for study=GTEX
As it is now, none of the valuable GTEx eQTLs are available when using the REST interface to catalogueR.
- Paul
… On Nov 30, 2021, at 7:48 AM, Brian M. Schilder ***@***.***> wrote:
Thanks so much for the response @kauralasoo! This is all really helpful info. I'll make some adjustments to catalogueR and may make it so that the REST API is the default method.
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub, or unsubscribe.
Triage notifications on the go with GitHub Mobile for iOS or Android.
|
Thanks for the helpful info @paul-shannon, hadn't realized this!
It looks like there is another file called tabix_ftp_paths_imported.tsv: I'll modify |
I've just updated the metadata to include GTEX_V8. I also added a new arg to Currently implemented in the dev branch. |
I'm in the process of overhauling @kauralasoo has anything changed regarding using
|
Thank for this fine package - very useful in our work on Alzheimer's Disease.
I find intermittent - sometimes lasting - problems with the ftp service the package uses.
Here is an example, establishing first that connectivity is good, then showing the error.
The specific file request times out:
The text was updated successfully, but these errors were encountered: