Release date: April 18, 2020
- Improved support for the COVID-19 Open Research Dataset (CORD-19).
- Added
SimpleNearestNeighborSearcher
, exposing approximate nearest neighbor search capabilities in Pyserini. - Added support for Trialstreamer, exposed as new vertical in the Neural Covidex.
- Added
CovidTopicReader
topic reader and topics for TREC-COVID. - Added
Covid19QueryGenerator
, improved query generation for COVID-19 queries. - Refactored
SimpleSearcher
andIndexReaderUtils
for better Pyserini support. - Refactored indexing pipeline based on revised contract of
contents()
andraw()
inSourceDocument
. - Refactored Core17 and Core18 regressions to reflect revised
SourceDocument
contract (above). Regressions values changed slightly, see here and here. - Improved integration testing harness.
- Added end-to-end integration tests for
AclAnthology
andCoreCollection
. - Added initial test cases for CORD-19.
Sorted by number of commits:
- Jimmy Lin (lintool)
- Edwin Zhang (edwinzhng)
- Tommaso Teofili (tteofili)
- Yuqi Liu (yuki617)
- Kuang Lu (lukuang)
Sorted by number of commits, according to GitHub:
- Jimmy Lin (lintool)
- Peilin Yang (Peilin-Yang)
- Ryan Clancy (r-clancy)
- Ahmet Arslan (iorixxx)
- Royal Sequiera (rosequ)
- Emily Wang (emmileaf)
- Edwin Zhang (edwinzhng)
- Victor Yang (Victor0118)
- Boris Lin (borislin)
- Tommaso Teofili (tteofili)
- Nikhil Gupta (nikhilro)
- Chris Kamphuis (Chriskamphuis)
- Yuhao Xie (Kytabyte)
- Rodrigo Nogueira (rodrigonogueira4)
- Salman Mohammed (salman1993)
- Luchen Tan (LuchenTan)
- Xinyu Mavis Liu (x389liu)
- Zhiying Jiang (bazingagin)
- Michael Tu (tuzhucheng)
- Dayang Shi (dyshi)
- Zeynep Akkalyoncu Yilmaz (zeynepakkalyoncu)
- Peng Shi (Impavidity)
- Xin Qian (xeniaqian94)
- Kuang Lu (lukuang)
- Adam Roegiest (aroegies)
- Weihua Li (w329li)
- Toke Eskildsen (tokee)
- Zhaohao Zeng (matthew-z)
- Yuqi Kiu (yuki617)
- Xing Niu (xingniu)
- Ronak Pradeep (ronakice)
- Mina Farid (minafarid)
- Mengfei Liu (meng-f)
- Maik Fröbe (mam10eks)
- Adrien Grand (jpountz)
- Gaurav Baruah (gauravbaruah)
- Edward Lu (edwardhdlu)
- Adrien Pouyet (Ricocotam)
- Joel Mackenzie (JMMackenzie)
- Ruifan Yu (tiddler)
- Leonid Boytsov (searchivarius)
- Petek Yıldız (ptkyldz)
- Kevin Xu (kevinxyc1)
- Matt Yang (justram)
- Kelvin Jiang (kelvin-jiang)
- Guy Rosin (guyrosin)
- Charles Wu (charW)
- Matteo Catena (catenamatteo)
- Andrew Yates (andrewyates)
- Alireza Mirzaeiyan (amirzaeiyan)
- Antonio Mallia (amallia)
- Horatiu Lazu (MathBunny)
- Edward Li (LuKuuu)