jamie0725 / DLNLP-Project Public

Notifications You must be signed in to change notification settings
Fork 2
Star 2

Mini-project for Deep Learning for Natural Language Processing at UvA.

2 stars 2 forks Branches Tags Activity

Notifications

Name		Name	Last commit message	Last commit date
Latest commit History 89 Commits
FastText		FastText
LSTM		LSTM
Rationale		Rationale
TextCNN		TextCNN
dataset		dataset
preprocessing		preprocessing
report		report
utils		utils
.gitignore		.gitignore
README.md		README.md
environment.yml		environment.yml

Repository files navigation

Deep Learning for Natural Language Processing

About

This repository contains code for mini-project of MSc. course Deep Learning for Natural Language Processing.

Description

We cover the classical NLP problem of question classification, which consists of two parts.

Classification Task

We choose Facebook's FastText as our baseline, and further implement two neural models, namely LSTM and TextCNN.
The three models are compared in terms of the overall classification accuracy, and the precision, recall and F1-score values for each category.

Rationale Extraction

A layer of binary latent variables is added to our neural models that select what parts of the input expose features for classification. This is used for better interpretability of our models.

Dataset

The dataset we use can be found here.

Prerequisites

Install the conda environment by running conda env create -f environment.yml.
Then activate it by conda activate dlnlp.
[Optional] Download the pre-trained Word2Vec word embeddings and unzip into folder preprocessing.

Running Instructions

[Optional] Preprocess the dataset and extract word embeddings by running python -m dataset.
Test the FastText model by running python -m FastText --mode=eval.
Test the LSTM model by running python -m LSTM --mode=eval.
Test the TextCNN model by running python -m TextCNN --mode=eval.
Test the Rationale extraction model by running python -m Rationale --mode=eval.

About

Mini-project for Deep Learning for Natural Language Processing at UvA.

text-classification python3 pytorch rationale

Report repository

Releases

No releases published

Packages

No packages published

Contributors 3

Languages

Python 100.0%