Paraphrase Identification with multi-perspective CNN and Dynamic k-max/min pooling

This repo contains the Torch implementation of multi-perspective convolutional neural networks with dynamic k-max/min pooling for identifying paraphrase on various dataset including SICK, MSRVID, MSRP and Quora question pair in Kaggle.

Dependencies

Torch
INTEL MKL library
Glove embeddings (run fetech_and_preprocess.sh)

Running for Quora question pair

chmod +x quora_script_train.sh
./quora_script_train.sh

Experiments

We designed 4 different models, those are the following:

Model_kmax : applies the K-Max pooling instead of the simple max-pooling, and then we stack one more convolutional layer and use another max pooling layer to maintain the same dimension.
Model_kmax2 : adding another convolutional layer before choosing maximum k elements in K-Max pooling.
Model_kmaxmin : Adding the min pooling on the model_kmax
Model_kmaxmin2 : Adding the min pooling on the model_kmax2

Results

We are presenting the result of our works based on the MSRVID dataset.

Models	Model_orig	Model_kmax	Model_kmax2	Model_kmaxmin	Model_kmaxmin2
Test score	0.9075	0.9100	0.9088	0.9105	0.9097

Ackowledgement

We thank provider of authors of the implementation codes for Multi-Perspective Convolutional Neural Networks for Modeling Textual Similarity and also author of

Hua He, Kevin Gimpel, and Jimmy Lin. Multi-Perspective Sentence Similarity Modeling with Convolutional Neural Networks. Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing (EMNLP 2015), pages 1576-1586.

We also thank the public data providers and Torch developers for deep learning libaries and specially for implementation of Dynamic K Max Pooling as described in the paper:

Blunsom, Phil, Edward Grefenstette, and Nal Kalchbrenner. A convolutional neural network for modelling sentences Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics. Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014.

Name		Name	Last commit message	Last commit date
Latest commit History 104 Commits
data		data
model		model
predictions		predictions
scripts		scripts
test_result		test_result
util		util
Conv.lua		Conv.lua
CsDis.lua		CsDis.lua
PaddingReshape.lua		PaddingReshape.lua
README.md		README.md
STS_test.lua		STS_test.lua
TemporalDynamicKMinPooling.lua		TemporalDynamicKMinPooling.lua
fetch_and_preprocess.sh		fetch_and_preprocess.sh
models_kmax.lua		models_kmax.lua
models_kmax2.lua		models_kmax2.lua
models_kmaxmin.lua		models_kmaxmin.lua
models_kmaxmin2.lua		models_kmaxmin2.lua
prac.lua		prac.lua
quora_script_test.sh		quora_script_test.sh
quora_script_train.sh		quora_script_train.sh
test_splitter.py		test_splitter.py
trainMSRP.lua		trainMSRP.lua
trainMSRVID2.lua		trainMSRVID2.lua
trainQUORA.lua		trainQUORA.lua
trainSIC.lua		trainSIC.lua

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Paraphrase Identification with multi-perspective CNN and Dynamic k-max/min pooling

Dependencies

Running for Quora question pair

Experiments

Results

Ackowledgement

About

Releases

Packages

Contributors 3

Languages

h-j-han/Paraphrase_Identification_CNN

Folders and files

Latest commit

History

Repository files navigation

Paraphrase Identification with multi-perspective CNN and Dynamic k-max/min pooling

Dependencies

Running for Quora question pair

Experiments

Results

Ackowledgement

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages