Release date: October 23, 2022
- Implemented
HuggingFaceTokenizerAnalyzer
. - Added regressions for MIRACL dev set.
- Added regressions for Mr. TyDi using
HuggingFaceTokenizerAnalyzer
. - Added Telugu analyzer.
- Added ability to extract pseudo-relevance feedback terms.
- Fixed
batch_search
for RM3. - Refactored BM25prf to make thread-safe.
- Minor changes to background linking regressions.
Sorted by number of commits:
- Ogundepo Odunayo (ToluClassics)
- Jimmy Lin (lintool)
- Jason Wu (Jasonwu-0803)
- Melody Li (limelody)
- Xinyu (Crystina) Zhang (crystina-z)
All contributors with five or more commits, sorted by number of commits, according to GitHub:
- Jimmy Lin (lintool)
- Peilin Yang (Peilin-Yang)
- Ogundepo Odunayo (ToluClassics)
- Ahmet Arslan (iorixxx)
- Xueguang Ma (MXueguang)
- Edwin Zhang (edwinzhng)
- Rodrigo Nogueira (rodrigonogueira4)
- Emily Wang (emmileaf)
- Royal Sequiera (rosequ)
- Yuqi Liu (yuki617)
- Tommaso Teofili (tteofili)
- Chris Kamphuis (Chriskamphuis)
- Victor Yang (Victor0118)
- Boris Lin (borislin)
- Matt Yang (justram)
- Nikhil Gupta (nikhilro)
- Stephanie Hu (stephaniewhoo)
- Shane Ding (shaneding)
- Yuhao Xie (Kytabyte)
- Ronak Pradeep (ronakice)
- Kuang Lu (lukuang)
- Adam Yang (adamyy)
- Joel Mackenzie (JMMackenzie)
- Xinyu Mavis Liu (x389liu)
- Salman Mohammed (salman1993)
- Luchen Tan (LuchenTan)
- Xinyu (Crystina) Zhang (crystina-z)
- Manveer Tamber (manveertamber)
- Matt Yang (d1shs0ap)
- Kelvin Jiang (kelvin-jiang)
- Johnson Han (x65han)
- Hang Cui (HangCui0510)
- Zhiying Jiang (bazingagin)
- Michael Tu (tuzhucheng)
- Dayang Shi (dyshi)
- Yuqing Xie (amyxie361)
- Nandan Thakur (thakur-nandan)
- Zeynep Akkalyoncu Yilmaz (zeynepakkalyoncu)
- Peng Shi (Impavidity)
- Ryan Clancy (ryan-clancy)