Skip to content

Word2Vec and Pseudo Relevance Feedback for Elasticsearch in IR

Notifications You must be signed in to change notification settings

Sam3448/PseudoFeedback

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

33 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

PseudoFeedback

Pseudo Feedback for Elasticsearch in Information Retrieval.

Basic structure

This work focuses on query expansion for first step, and pseudo feedback for second.

For query expansion, I experiments on both locally trained word embedding (MT bi-lingual English source) and FastText pre-trained Wiki word embedding. The results get better when using pre-trained embedding.

For retrieval part, I also indexed and searched MT and GOLD (manually) translated data. Though vocabulary matches for MT data, GOLD works better, with 3% better of miss probabillity.

Further experiments will be conducted for Pseudo Feedback.

About

Word2Vec and Pseudo Relevance Feedback for Elasticsearch in IR

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages