Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ncbi-genome-download works with refseq for me, but not genbank #234

Open
Laura-Alex opened this issue Nov 19, 2024 · 1 comment
Open

ncbi-genome-download works with refseq for me, but not genbank #234

Laura-Alex opened this issue Nov 19, 2024 · 1 comment

Comments

@Laura-Alex
Copy link

This might be the same issue as #224 .

I am trying to download a couple of prokaryotic genomes. Some are in Refseq and Genbank, others only in the latter. When specifying Genbank, the code does not work, even as just a dry run. For example:

This is a RefSeq genome; this code returns the expected result.
ncbi-genome-download --section refseq bacteria -F fasta -r 5 -A GCF_001548395.1 -n

This is a copy of the same genome, stored in GenBank. This code does not work and just...keeps going forever.
ncbi-genome-download --section genbank bacteria -F fasta -r 5 -A GCA_001548395.1 -n

Other things I have tried and did not work:
-using taxid instead
-moving '--section genbank bacteria' to the end of the code
-with and without '-r 5'

When I finally kill the process, this is the error result I get:


Traceback (most recent call last):
  File "/home/user/.conda/envs/environment/bin/ncbi-genome-download", line 10, in <module>
    sys.exit(main())
  File "/home/user/.conda/envs/environment/lib/python3.10/site-packages/ncbi_genome_download/__main__.py", line 26, in main
    ret = args_download(args)
  File "/home/user/.conda/envs/environment/lib/python3.10/site-packages/ncbi_genome_download/core.py", line 184, in args_download
    return config_download(config)
  File "/home/user/.conda/envs/environment/lib/python3.10/site-packages/ncbi_genome_download/core.py", line 203, in config_download
    download_candidates = select_candidates(config)
  File "/home/user/.conda/envs/environment/lib/python3.10/site-packages/ncbi_genome_download/core.py", line 317, in select_candidates
    entries = parse_summary(summary_file)
  File "/home/user/.conda/envs/environment/lib/python3.10/site-packages/ncbi_genome_download/core.py", line 436, in parse_summary
    return SummaryReader(summary_file)
  File "/home/user/.conda/envs/environment/lib/python3.10/site-packages/ncbi_genome_download/summary.py", line 13, in __init__
    while 'assembly_accession' not in line:
KeyboardInterrupt

I would appreciate your help!

@hellopeccat
Copy link

hellopeccat commented Nov 27, 2024

same

I think this might be the same issue as #228 rather than #224

If the task need to be down as soon as possible, you could use the official CLI ncbi_datasets instead.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants