How to run the code?
- Download two folders (
data.nosync
andengine_data
) from my Google Drive - Put both folders in a root directory of the project
- Make sure that Python version is 3.7+
- Install all required packages by running
pip3 install -r requirements.txt
P.S: better to use virtual environment
Now you can run the code by simply typing python3 doc_sum.py
for document summarization task and python3 query_exp.py
for query expansion one.
To provide any other query for document summarization, please consider changin code in doc_sum.py
in line query = "your query here"
in launch()
function.
If you will have a problem with nltk (probably not loaded datasets), please use
import nltk
nltk.download('wordnet') # required for query expansion
nltk.download('stopwords') # required for both parts