Release date: December 13, 2019
- Upgraded to Lucene 8.3.0.
- Added basic support for indexing and search in non-English languages.
- Added regressions for NTCIR-8 (Chinese), CLEF 2006 (French), TREC 2002 (Arabic), and FIRE 2012 (Bengali, Hindi, English).
- Added instructions and regressions for docTTTTTquery on MS MARCO Passage Retrieval task.
- Added initial support for indexing CORE open access research papers.
- Added similarity that accurately computes document lengths for BM25.
- Added support for approximate nearest-neighbor search (see arXiv:1910.10208).
- Added ability to read topics directly from the fatjar.
- Improved support for Elasticsearch.
- Improved support for Solr.
- Removed Pyserini from repo, now standalone project.
- Refactored SimpleSearcher in coordination with initial Pyserini release.
- Added IndexReaderUtils, exposing various hooks for Pyserini.
- Added integration with Codecov.
- Added more test cases to increase test coverage.
- Added links to Anserini notebooks.
- Added end-to-end integration tests for Solr and Elasticsearch.
- Moved CACM collection into repo for cleaner, more self-contained testing.
- Refactored logging and counters in indexer, improved documentation of options.
- Cleaned up warnings during build.
- Slimmed down fatjar size, removed unnecessary dependencies.
Sorted by number of commits:
- Jimmy Lin (lintool)
- Tommaso Teofili (tteofili)
- Chris Kamphuis (Chriskamphuis)
- Xinyu Mavis Liu (x389liu)
- Ryan Clancy (r-clancy)
- Zhiying Jiang (bazingagin)
- Peng Shi (Impavidity)
- Zeynep Akkalyoncu Yilmaz (zeynepakkalyoncu)
- Weihua Li (w329li)
- Alireza Mirzaeiyan (amirzaeiyan)
- Kelvin Jiang (kelvin-jiang)
- Leonid Boytsov (searchivarius)
- Maik Fröbe (mam10eks)
- Rodrigo Nogueira (rodrigonogueira4)
- Zhaohao Zeng (matthew-z)
Sorted by number of commits, according to GitHub:
- Jimmy Lin (lintool)
- Peilin Yang (Peilin-Yang)
- Ryan Clancy (r-clancy)
- Ahmet Arslan (iorixxx)
- Royal Sequiera (rosequ)
- Emily Wang (emmileaf)
- Victor Yang (Victor0118)
- Boris Lin (borislin)
- Yuhao Xie (Kytabyte)
- Chris Kamphuis (Chriskamphuis)
- Tommaso Teofili (tteofili)
- Rodrigo Nogueira (rodrigonogueira4)
- Salman Mohammed (salman1993)
- Luchen Tan (LuchenTan)
- Zhiying Jiang (bazingagin)
- Michael Tu (tuzhucheng)
- Dayang Shi (dyshi)
- Xinyu Mavis Liu (x389liu)
- Peng Shi (Impavidity)
- Xin Qian (xeniaqian94)
- Adam Roegiest (aroegies)
- Zeynep Akkalyoncu Yilmaz (zeynepakkalyoncu)
- Weihua Li (w329li)
- Toke Eskildsen (tokee)
- Zhaohao Zeng (matthew-z)
- Kuang Lu (lukuang)
- Xing Niu (xingniu)
- Ronak Pradeep (ronakice)
- Mina Farid (minafarid)
- Mengfei Liu (meng-f)
- Adrien Grand (jpountz)
- Gaurav Baruah (gauravbaruah)
- Edward Lu (edwardhdlu)
- Adrien Pouyet (Ricocotam)
- Ruifan Yu (tiddler)
- Leonid Boytsov (searchivarius)
- Petek Yıldız (ptkyldz)
- Maik Fröbe (mam10eks)
- Matt Yang (justram)
- Kelvin Jiang (kelvin-jiang)
- Charles Wu (charW)
- Matteo Catena (catenamatteo)
- Andrew Yates (andrewyates)
- Antonio Mallia (amallia)
- Alireza Mirzaeiyan (amirzaeiyan)
- Horatiu Lazu (MathBunny)