0.9.0
Significant update - refactoring of public API (e.g. pt.transformer.TransformerBase
-> pt.Transformer
) and support in the Terrier backend for making indices from pre-tokenised documents. Python 3.10 is now supported.
What's Changed
- fix error in IRDSDataset when a query field is named "query" by @seanmacavaney in #303
- Fix type annotation by @heinrichreimer in #313
- addresses #315 IRDS corpus_iter are not subscriptable by @cmacdonald in #316
- Missing comma in bm25_qe example by @JohnGiorgi in #319
- Argument meta should be supplied as dictionary by @JohnGiorgi in #320
- use Jnius 1.4 by @cmacdonald in #249
- Python 3.10 support by @cmacdonald in #322
- Lz4 support for pt.io.autoopen() by @cmacdonald in #323
- addresses #326 faster version of add_ranks for single queries by @cmacdonald in #327
- addresses #321 pt.apply.doc_score batching by @cmacdonald in #325
- IterDictIndexer can index pre-tokenised documents by @cmacdonald in #328
- Bump logback-core from 1.2.0 to 1.2.9 in /terrier-python-helper by @dependabot in #336
- documenting BM25F controls and tuning by @cmacdonald in #296, addresses #294
- 0.9refactor by @cmacdonald in #314, #339, addresses #271
- pt.Experiment() alters the input measures list to drop "mrt" #301
- Expose Termpipelines in Terrier index backend by @cmacdonald in #338
- pt.rewrite.tokenise() impl by @cmacdonald in #340 addresses #252 #253
- upgraded GitHub actions by @cmacdonald in #341, #342
- fix LTR groupby for xgboost & lightgbm by @cmacdonald in #284
New Contributors
- @heinrichreimer made their first contribution in #313
- @JohnGiorgi made their first contribution in #319
Full Changelog: 0.8.1...0.9.0