GitHub - Alab-NII/multi-hop-analysis

This is the repository for the paper: Analyzing the Effectiveness of the Underlying Reasoning Tasks in Multi-hop Question Answering - EACL 2023 (Findings).

Dataset Information

We use two datasets in our experiments: 2WikiMultihopQA and HotpotQA-small

Pre-process data (file .gz) for dev and train of 2Wiki (Please download raw data from the Github repository of the 2WikiMultihopQA dataset)
Raw data and pre-process data for dev and train of HotpotQA-small
Debiased data
Adversarial data

We follow the steps in https://github.com/yuwfan/HGN to obtain file .gz data from raw data.

How to Run the Code

Set up environment

bash install_packages.sh

Prepare data for training

Download bigbird-roberta-base model from this link: https://huggingface.co/google/bigbird-roberta-base
Edit variables: data_dir, pretrained_model_dir, data_file
Run: python3 preprocess.py

Training

python3 main.py

For evaluation on dev file

python3 predictor.py $checkpoint $data_file

python3 postprocess.py $prediction_file $processed_data_file $original_data_file

python3 official_evaluation.py path/to/prediction path/to/gold

Reproduce the results

Download our checkpoints
Run file predict_dev_all_settings.sh (Note: if you want to use this file for the test set in 2Wiki, comment line #25 about evaluation)

References

We base on HGN for data preprocessing.
We re-use the class Example from the HGN model and update it to work with our dataset.

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
License		License
README.md		README.md
data.py		data.py
install_packages.sh		install_packages.sh
main.py		main.py
model.py		model.py
official_evaluation.py		official_evaluation.py
postprocess.py		postprocess.py
predict_dev_all_settings.sh		predict_dev_all_settings.sh
predictor.py		predictor.py
preprocess.py		preprocess.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Dataset Information

How to Run the Code

Set up environment

Prepare data for training

Training

For evaluation on dev file

Reproduce the results

References

About

Releases

Packages

Contributors 2

Languages

License

Alab-NII/multi-hop-analysis

Folders and files

Latest commit

History

Repository files navigation

Dataset Information

How to Run the Code

Set up environment

Prepare data for training

Training

For evaluation on dev file

Reproduce the results

References

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages