Skip to content

2.0.0-beta.0

Pre-release
Pre-release
Compare
Choose a tag to compare
@thatbudakguy thatbudakguy released this 12 Feb 14:47

Major update to address working with a corpus of many files.

  • BLAST-like matching strategy based on chinesetextreuse
  • Fast indexing and lookup powered by spaCy
  • All-against-all matching supported via graph structure
  • Better logging and console output via rich
  • Smith-Waterman alignment of matches via lingpy
  • Structured data input via .jsonl files