Dynamic Coattention Networks For Question Answering

This project aims at implementing a Dynamic Coattention Network proposed by Xionget al.(2017) for Question Answering, learning to find answers spans in a document, given a question from the Stanford Question Answering Dataset (SQuAD2.0), using the PyTorch Framework. Performance is evaluated with the standard tokenwise F1 score and EM (exact match) percentage over the predicted answers.

Several deep learning models have been proposed for question answering. However, due to their single-pass nature, they have no way to recover from local maxima corresponding to incorrect answers. To address this problem, we introduce the Dynamic Coattention Network (DCN) for question answering. The DCN first fuses co-dependent representations of the question and the document in order to focus on relevant parts of both. Then a dynamic pointing decoder iterates over potential answer spans. This iterative procedure enables the model to recover from initial local maxima corresponding to incorrect answers.

Code

The following scripts, stored in this repository, have been developed for implementing Dynamic Coattention Networks for Question Answering using the SQuAD dataset.

preprocessing: Preprocessing done on the dataset including tokenizing the data, loading the GloVE embeddings, getting the embeddings for the data and batching the data for training.
model.py: The Dynamic Coattention Network model, which comprises two components - the Coattention Encoder and the Dynamic Pointer Decoder.
constants.py: Hyper-paramters of the model.
training_pipeline.py: Script to train the model.
produce_answers.py: Script to produce the answers of the given dataset using the model (path) provided.
gen_scores.py: Script to generate scores todo.
evaluate-v2.0.py: Evaluation script to find F1 and EM score of the model.
test_model.py: Testing the model using dummy values (for debugging the model).
gen_plot.py: Script to generate plots in Section 4.
gen_f1_by_length.py: Script to generate plot on distribution of F1 scores with the number of tokens in the predicted answer on the answerable subset of the SQuAD dataset.
index_convergence.py:
make_plot_f1.py:
plot_f1_vs_loss.py:
training_pipeline_freeze_nondpd_weights.py:

The Report for this project can be found here.

Usage

Training

Training model: python3 training_pipeline.py
Resuming training from saved state: python3 training_pipeline.py "model/2020-03-28_22-39-28/epoch0_batch11.par"

Generate scores for a model at different stages throughout its training

Generating scores for a model: python3 gen_scores.py <model_path> <dataset_file_path.json> [optional eval freq.] [optional eval start step].
Example (training set): python3 gen_scores.py ./model/MI1_dropout_encodings_only/ preprocessing/data/subset-1/train-subset-1.json 2000 50000 -- this will eval model at step 50000, 52000, 54000, ... up to the most recent one.
Example (dev set): python3 gen_scores.py ./model/MI1_dropout_encodings_only/ preprocessing/data/dev-v2.0.json.

Note: The dataset file path needs to be something.json and have a corresponding something-tokenized.json for this script to work! The script will generate a file scores_<datasetname>.log in the model folder, as well as two plots (EM and F1).

Produce answer file for evaluation

Generate predictions on SQuAD dev set: python3 produce_answers.py model/2020-04-01_01-07-06/epoch0_batch791.par
Generate predictions on a different dataset: python3 produce_answers.py model/2020-04-01_01-07-06/epoch0_batch791.par preprocessing/data/subset-1/train-subset-1-tokenized.json [optional_prediction_file_path]
Run evaluation: python3 evaluate-v2.0.py preprocessing/data/subset-1/train-subset-1.json predictions.json

Plot F1 score and loss together

First generate the scores log file using gen_scores.py (see separate instructions for that).
Then: python3 plot_f1_vs_loss.py model/mymodel/scores_train-subset-1.log.

Xiong, Caiming, Zhong, Victor, & Socher, Richard. 2017. Dynamic Coattention Networks for Question Answering, International Conference on Learning Representations (ICLR)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Dynamic Coattention Networks For Question Answering

Code

Usage

Training

Generate scores for a model at different stages throughout its training

Produce answer file for evaluation

Plot F1 score and loss together

About

Releases

Packages

Contributors 5

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 408 Commits
.github/workflows		.github/workflows
model		model
preprocessing		preprocessing
.gitignore		.gitignore
README.md		README.md
constants.py		constants.py
evaluate-v2.0.py		evaluate-v2.0.py
gen_f1_by_length.py		gen_f1_by_length.py
gen_plot.py		gen_plot.py
gen_scores.py		gen_scores.py
index_convergence.py		index_convergence.py
loss.log		loss.log
make_plot_f1.py		make_plot_f1.py
model.py		model.py
plot_f1_vs_loss.py		plot_f1_vs_loss.py
produce_answers.py		produce_answers.py
requirements.txt		requirements.txt
test_model.py		test_model.py
training_pipeline.py		training_pipeline.py
training_pipeline_freeze_nondpd_weights.py		training_pipeline_freeze_nondpd_weights.py

asmitapoddar/question-answering-SQuAD2.0-dcn

Folders and files

Latest commit

History

Repository files navigation

Dynamic Coattention Networks For Question Answering

Code

Usage

Training

Generate scores for a model at different stages throughout its training

Produce answer file for evaluation

Plot F1 score and loss together

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 5

Languages

Packages