Replies: 6 comments
-
I've just looked at the Parks et al. paper, but for the UBA set there, the authors have created a BioProject at NCBI (https://www.ncbi.nlm.nih.gov/bioproject/?term=PRJNA348753). On that page, there's an "assembly details" table, with a download link that will give you what looks like a tab-separated format containing assembly IDs in the first column. With a bit of scripting magic, you could turn that into an assembly ID file suitable for # use tail to get rid of the two lines of comments on top
# then use cut to only keep the first column
tail -n +3 PRJNA348753_AssemblyDetails.txt | cut -f1 > assembly_ids.txt Now you can use that to run ncbi-genome-download --section genbank --assembly-accessions assembly_ids.txt bacteria,archaea Hope this helps. |
Beta Was this translation helpful? Give feedback.
-
Yes, it works. Thank you |
Beta Was this translation helpful? Give feedback.
-
Let's just keep this open in case anybody else wants to do something similar. |
Beta Was this translation helpful? Give feedback.
-
Sounds good.
Thanks
BK Song
… On Jul 17, 2018, at 10:57 AM, Kai Blin ***@***.***> wrote:
Let's just keep this open in case anybody else wants to do something similar.
—
You are receiving this because you modified the open/close state.
Reply to this email directly, view it on GitHub <#70 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/Alyda1gVe4HvfMXLRkV5GK1CktnGebgHks5uHftGgaJpZM4VP2qS>.
|
Beta Was this translation helpful? Give feedback.
-
Hi Kai, |
Beta Was this translation helpful? Give feedback.
-
Hi Kai, Is there a way to automate the download of PRJNA348753_AssemblyDetails.txt from the NCBI Bioproject website by using only the BioProject ID PRJNA348753 as the input? Thanks. Ching |
Beta Was this translation helpful? Give feedback.
-
Dear kblin,
I like to build a database of MAG/UBA sequences reported in the papers of Parks et al 2017 (https://www.nature.com/articles/s41564-017-0012-7) and Anantharaman et al 2016 (https://www.nature.com/articles/ncomms13219). Is there a way to download the fasta and genbank files of the MAG/UBA sequences?
Thanks
BK
Beta Was this translation helpful? Give feedback.
All reactions