2.0.0-beta.0
Pre-release
Pre-release
Major update to address working with a corpus of many files.
- BLAST-like matching strategy based on
chinesetextreuse
- Fast indexing and lookup powered by
spaCy
- All-against-all matching supported via graph structure
- Better logging and console output via
rich
- Smith-Waterman alignment of matches via
lingpy
- Structured data input via
.jsonl
files