Collating results is slow for large datasets (>1500 genomes) #14
Labels
enhancement
something we'd like pyani to do that it doesn't already
performance
the issue relates to making pyani more efficient
Milestone
Currently, the code writes out all results individually and leaves processing output for calculation of ANI etc. until the end. This leaves an uninformative, and long, lag time before the results are presented to the user.
It may be possible to collate/summarise intermediate results in file, as we go. The total analysis time will be no shorter, but it might avoid that 'dead time' after the alignments are done.
The text was updated successfully, but these errors were encountered: