Release date: July 31, 2022
- Added HC4 regressions on HC4 corpora and NeuCLIR22 corpora.
- Added
wikipedia-dpr-100w
regression and the ability use eval utils beyondtrec_eval
. - Added various BEIR regressions.
- Added BM25 quantized regressions for MS MARCO V1.
- Added more Rocchio regression conditions for MS MARCO V1 and V2.
- Added support for Ukrainian and Polish analyzers.
- Added Rocchio bindings to
SimpleSearcher
to support Pyserini. - Added ability to automatically download corpora in regression script.
- Fixed bugs in Rocchio implementation and additional refactoring.
Sorted by number of commits:
- Jimmy Lin (lintool)
- Ogundepo Odunayo (ToluClassics)
- Manveer Tamber (manveertamber)
- Yuqi Liu (yuki617)
- Yuqing Xie (amyxie361)
- Alexei Ivanov (aivan6842)
- Andy Wu (Pie31415)
- Xinyu (Crystina) Zhang (crystina-z)
- Jack Lin (jacklin64)
- Volodymyr Ky (proger)
All contributors with five or more commits, sorted by number of commits, according to GitHub:
- Jimmy Lin (lintool)
- Peilin Yang (Peilin-Yang)
- Ahmet Arslan (iorixxx)
- Edwin Zhang (edwinzhng)
- Xueguang Ma (MXueguang)
- Rodrigo Nogueira (rodrigonogueira4)
- Emily Wang (emmileaf)
- Royal Sequiera (rosequ)
- Yuqi Liu (yuki617)
- Chris Kamphuis (Chriskamphuis)
- Victor Yang (Victor0118)
- Boris Lin (borislin)
- Tommaso Teofili (tteofili)
- Ogundepo Odunayo (ToluClassics)
- Nikhil Gupta (nikhilro)
- Matt Yang (justram)
- Stephanie Hu (stephaniewhoo)
- Yuhao Xie (Kytabyte)
- Shane Ding (shaneding)
- Ronak Pradeep (ronakice)
- Kuang Lu (lukuang)
- Adam Yang (adamyy)
- Luchen Tan (LuchenTan)
- Joel Mackenzie (JMMackenzie)
- Xinyu Mavis Liu (x389liu)
- Salman Mohammed (salman1993)
- Hang Cui (HangCui0510)
- Zhiying Jiang (bazingagin)
- Johnson Han (x65han)
- Matt Yang (d1shs0ap)
- Kelvin Jiang (kelvin-jiang)
- Michael Tu (tuzhucheng)
- Xinyu (Crystina) Zhang (crystina-z)
- Dayang Shi (dyshi)
- Manveer Tamber (manveertamber)
- Zeynep Akkalyoncu Yilmaz (zeynepakkalyoncu)
- Peng Shi (Impavidity)
- Ryan Clancy (ryan-clancy)
- Nandan Thakur (thakur-nandan)