-
Notifications
You must be signed in to change notification settings - Fork 7
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Feat: add rank lineage #130
Conversation
Codecov ReportPatch coverage:
❗ Your organization is not using the GitHub App Integration. As a result you may experience degraded service beginning May 15th. Please install the Github App Integration for your organization. Read more. Additional details and impacted files@@ Coverage Diff @@
## dev #130 +/- ##
==========================================
+ Coverage 82.85% 85.93% +3.08%
==========================================
Files 114 114
Lines 1744 1778 +34
Branches 308 316 +8
==========================================
+ Hits 1445 1528 +83
+ Misses 255 202 -53
- Partials 44 48 +4
☔ View full report in Codecov by Sentry. |
[skip ci]
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Missing any docs updates?
Overall looks ok to me, just a few queries (but not blockers IMO)
No tested, will try now but can't guaruntee I'll get to it, I trust you ;)
"Pseudomonadales;Gammaproteobacteria;Proteobacteria;" | ||
"Bacteria;root", | ||
"Saccharomycetes;Ascomycota;Eukaryota;root", |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Which order is this meant to be in the actual file/ I typically would view the rank to go broad to specific left to right,
e.g. "Saccharomycetes;Ascomycota;Eukaryota;root",
would be root;Eukaryota;Ascomycota;Saccaromycetes,
Which also makes me wonder: what happens when there are species that are 'missing' an intermediate taxonmic rank (e.g. has a species, genus, family name, but no order name...., but then continues with classes, phyla etc.)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
That's a fair point. I simply used the order returned by taxopy (which makes sense from the perspective of a particular taxon). However, you're right that most tools provide the lineage from higher to more specific rank.
Tested and no bugs: still works for me! I still personally find the reversed the lineages reflexively unintuitive, but not wrong. You're welcome to merge from my POV! (once docs updated, if necessary!) |
We can turn around the order of the lineages. It makes sense to me and hopefully, not too many people have built code based on this yet. I don't actually know how taxopy reports skipped ranks in the lineage. Would need to add test data for that. Do you happen to know an example from the NCBI taxonomy? |
As discussed, will merge this and make the proposed changes in a new PR. |
👍
Not off the top of my head... I remember seeing it in metaphlan2 or malt results... Maybe candidatus phyla etc? |
Possibly something like this: https://www.ncbi.nlm.nih.gov/Taxonomy/Browser/wwwtax.cgi?mode=Info&id=1387480&lvl=3&lin=f&keep=1&srchmode=1&unlock (Can't check properly from my phone though...) |
--add-rank-lineage
totaxpasta merge
andtaxpasta standardise
#126