Skip to content
This repository has been archived by the owner on Jan 9, 2020. It is now read-only.
/ scarletmoon Public archive

Inverted Indexer ~ project @ information retrieval university subject

License

Notifications You must be signed in to change notification settings

gph03n1x/scarletmoon

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

28 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

README NEEDS UPDATE

Scarlet Moon

Inverted Indexer project from information retrieval. Implemented with python 3.6.0

Dependencies

pip install -r requirements.txt
pip install spacy  # Installation failed on my windows machine.
python -m spacy download en_core_web_sm

Adding documents

usage: scan.py [-h] -f FILTER -d DIRECTORY -o OUTPUT
scan.py: error: the following arguments are required: -f/--filter, -d/--directory, -o/--output

Filters are grep like filters.

Directory is the directory you want to scan and add to the token tree.

Output is the name of the tokentree , which is going to get stored at the storage folder.

python scan.py -f *.sgm -d reuters -o tokentree

Using scarlet

nameko run scarlet

Interacting with the service.

TODO.

Running the unittests

python -m unittest discover

About

Inverted Indexer ~ project @ information retrieval university subject

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published