NLP Paper Summaries

This repository contains a list of NLP paper summaries intended to make NLP techniques and topics more approachable and accessible. Work in progress. There is a lot more to be added/improved in the coming days.

How to contribute

If you have blogged about an NLP paper or technique or find an interesting read out there, I encourage you to share with our community. To add your blog posts or summaries to this list just hit on the edit button on this README.md. You can then add your entry by modiying the readme file and submitting a PR which will be reviewed before going live. Alternatively, we can work on transferring the summaries to this repo itself so as to make them more accessible, which is what I am currently doing with some of my own paper summaries below.

If you would like to contribute by blogging about an NLP paper/technique, you can check out our suggestion/guidance at this issue.

And if you need any ideas on how else to contribute to this repo, take a look in the issues section. We are in need of maintainers.

For now, I have adopted a few tracks from ACL for the grouping but this can change based on the granularity of grouping that is needed. Open to ideas here.

Note that we currently provide the source of the where the summary originated from. We are working with a few authors to migrate the content directly to this repo so that summaries are centralized and easily accessible. This also simplifies the way others can contribute to this project. When a summary has been fully migrated to this repo, we will tag the summary as "GitHub" under the "Summary" tab of the tables below to identify them easily.

We are including an extra TL;DR section wherever applicable. This is not meant as a full-fledged summary but rather covers the key points of each paper and serves as a refresher for those who have previously encountered the paper.

If you are facing any issues submitting your PR, just send me an email at ellfae@gmail.com or DM me on Twitter.

Cognitive Modeling and Psycholinguistics

2018

Title	Summary	Paper Source
Detecting Linguistic Characteristics of Alzheimer’s Dementia by Interpreting Neural Models	dair.ai	Paper

Computational Social Science and Social Media

2017

Title	Summary	Paper Source
Multimodal Classification for Analysing Social Media	dair.ai	Paper

Dialogue and Interactive Systems

2020

Title	Summary	Paper Source
Towards a Human-like Open-Domain Chatbot	Google AI Blog	Paper

2019

Title	Summary	Paper Source	TL;DR
What makes a good conversation? How controllable attributes affect human judgements	Abigail See	Paper	Richard Csaky
Learning from Dialogue after Deployment: Feed Yourself, Chatbot!	-	Paper	Richard Csaky
TransferTransfo: A Transfer Learning Approach for Neural Network Based Conversational Agents	-	Paper	Richard Csaky
Learning to Select Knowledge for Response Generation in Dialog Systems	-	Paper	Richard Csaky
Consistent Dialogue Generation with Self-supervised Feature Learning	-	Paper	Richard Csaky
Evaluating Coherence in Dialogue Systems using Entailment	-	Paper	Richard Csaky
Pretraining Methods for Dialog Context Representation Learning	-	Paper	Richard Csaky
Self-Supervised Dialogue Learning	-	Paper	Richard Csaky
Structured Fusion Networks for Dialog	-	Paper	Richard Csaky
Investigating Evaluation of Open-Domain Dialogue Systems With Human Generated Multiple References	-	Paper	Richard Csaky
Multi-Granularity Representations of Dialog	-	Paper	Richard Csaky

2018

Title	Paper Source	TL;DR
Personalizing Dialogue Agents: I have a dog, do you have pets too?	Paper	Richard Csaky
Building a Conversational Agent Overnight with Dialogue Self-Play	Paper	Richard Csaky
Topic-based Evaluation for Conversational Bots	Paper	Richard Csaky
Improving Variational Encoder-Decoders in Dialogue Generation	Paper	Richard Csaky
A Hierarchical Latent Structure for Variational Conversation Modeling	Paper	Richard Csaky
Unsupervised Discrete Sentence Representation Learning for Interpretable Neural Dialog Generation	Paper	Richard Csaky
Sounding Board: A User-Centric and Content-Driven Social Chatbot	Paper	Richard Csaky
DialogWAE: Multimodal Response Generation with Conditional Wasserstein Auto-Encoder	Paper	Richard Csaky
Reinforcing Coherence for Sequence to Sequence Model in Dialogue Generation	Paper	Richard Csaky
Learning to Ask Questions in Open-domain Conversational Systems with Typed Decoders	Paper	Richard Csaky
Multi-turn Dialogue Response Generation in an Adversarial Learning Framework	Paper	Richard Csaky
Zero-Shot Dialog Generation with Cross-Domain Latent Actions	Paper	Richard Csaky
Towards Explainable and Controllable Open Domain Dialogue Generation with Dialogue Acts	Paper	Richard Csaky
Why Do Neural Response Generation Models Prefer Universal Replies?	Paper	Richard Csaky
Aiming to Know You Better Perhaps Makes Me a More Engaging Dialogue Partner	Paper	Richard Csaky
Generating More Interesting Responses in Neural Conversation Models with Distributional Constraints	Paper	Richard Csaky
Adversarial Over-Sensitivity and Over-Stability Strategies for Dialogue Models	Paper	Richard Csaky
Training Millions of Personalized Dialogue Agents	Paper	Richard Csaky
Generating Informative and Diverse Conversational Responses via Adversarial Information Maximization	Paper	Richard Csaky
Better Conversations by Modeling, Filtering, and Optimizing for Coherence and Diversity	Paper	Richard Csaky
A Dataset for Document Grounded Conversations	Paper	Richard Csaky
Talking to myself: self-dialogues as data for conversational agents	Paper	Richard Csaky
Neural Approaches to Conversational AI	Paper	Richard Csaky
Contextual Topic Modeling for Dialog Systems	Paper	Richard Csaky
Automatic Evaluation of Neural Personality-based Chatbots	Paper	Richard Csaky
NEXUS Network: Connecting the Preceding and the Following in Dialogue Generation	Paper	Richard Csaky
MultiWOZ - A Large-Scale Multi-Domain Wizard-of-Oz Dataset for Task-Oriented Dialogue Modelling	Paper	Richard Csaky
The RLLChatbot: a solution to the ConvAI challenge	Paper	Richard Csaky
Neural Response Ranking for Social Conversation: A Data-Efficient Approach	Paper	Richard Csaky
Generating Multiple Diverse Responses for Short-Text Conversation	Paper	Richard Csaky
Wizard of Wikipedia: Knowledge-Powered Conversational agents	Paper	Richard Csaky
Importance of a Search Strategy in Neural Dialogue Modelling	Paper	Richard Csaky
A Study on Dialogue Reward Prediction for Open-Ended Conversational Agents	Paper	Richard Csaky

2017

Title	Paper Source	TL;DR
RUBER: An Unsupervised Method for Automatic Evaluation of Open-Domain Dialog Systems	Paper	Richard Csaky
Adversarial Learning for Neural Dialogue Generation	Paper	Richard Csaky
Hierarchical Recurrent Attention Network for Response Generation	Paper	Richard Csaky
A Copy-Augmented Sequence-to-Sequence Architecture Gives Good Performance on Task-Oriented Dialogue	Paper	Richard Csaky
Generating High-Quality and Informative Conversation Responses with Sequence-to-Sequence Models	Paper	Richard Csaky
A Knowledge-Grounded Neural Conversation Model	Paper	Richard Csaky
Batch Policy Gradient Methods for Improving Neural Conversation Models	Paper	Richard Csaky
Hybrid Code Networks: practical and efficient end-to-end dialog control with supervised and reinforcement learning	Paper	Richard Csaky
Learning Conversational Systems that Interleave Task and Non-Task Content	Paper	Richard Csaky
Learning Discourse-level Diversity for Neural Dialog Models using Conditional Variational Autoencoders	Paper	Richard Csaky
Emotional Chatting Machine: Emotional Conversation Generation with Internal and External Memory	Paper	Richard Csaky
Not All Dialogues are Created Equal: Instance Weighting for Neural Conversational Models	Paper	Richard Csaky
Chat Detection in an Intelligent Assistant: Combining Task-oriented and Non-task-oriented Spoken Dialogue Systems	Paper	Richard Csaky
A Conditional Variational Framework for Dialog Generation	Paper	Richard Csaky
ParlAI: A Dialog Research Software Platform	Paper	Richard Csaky
Emergence of Language with Multi-agent Games: Learning to Communicate with Sequences of Symbols	Paper	Richard Csaky
Generative Encoder-Decoder Models for Task-Oriented Spoken Dialog Systems with Chatting Capability	Paper	Richard Csaky
Personalization in Goal-Oriented Dialog	Paper	Richard Csaky
Natural Language Does Not Emerge ‘Naturally’ in Multi-Agent Dialog	Paper	Richard Csaky
Deal or No Deal? End-to-End Learning for Negotiation Dialogues	Paper	Richard Csaky
DeepProbe: Information Directed Sequence Understanding and Chatbot Design via Recurrent Neural Networks	Paper	Richard Csaky
Enterprise to Computer: Star Trek chatbot	Paper	Richard Csaky
Domain Aware Neural Dialog System	Paper	Richard Csaky
Towards an Automatic Turing Test: Learning to Evaluate Dialogue Responses	Paper	Richard Csaky
A Deep Reinforcement Learning Chatbot	Paper	Richard Csaky
Challenging Neural Dialogue Models with Natural Data: Memory Networks Fail on Incremental Phenomena	Paper	Richard Csaky
Flexible End-to-End Dialogue System for Knowledge Grounded Conversation	Paper	Richard Csaky
Edina: Building an Open Domain Socialbot with Self-dialogues	Paper	Richard Csaky
Interactive Policy Learning In End-to-End Trainable Task-Oriented Neural Dialog Models	Paper	Richard Csaky
Augmenting End-to-End Dialog Systems with Commonsense Knowledge	Paper	Richard Csaky
Multi-Task Learning for Speaker-Role Adaptation in Neural Conversation Models	Paper	Richard Csaky
DailyDialog: A Manually Labelled Multi-turn Dialogue Dataset	Paper	Richard Csaky
A Dual Encoder Sequence to Sequence Model for Open-Domain Dialogue Modeling	Paper	Richard Csaky
Adversarial Advantage Actor-Critic Model For Task-Completion Dialogue Policy Learning	Paper	Richard Csaky
Customized Nonlinear Bandits for Online Response Selection in Neural Conversation Models	Paper	Richard Csaky
End-to-end Adversarial Learning for Generative Conversational Agents	Paper	Richard Csaky
Fine Grained Knowledge Transfer for Personalized Task-oriented Dialogue Systems	Paper	Richard Csaky
BBQ-Networks: Efficient Exploration in Deep Reinforcement Learning for Task-Oriented Dialogue Systems	Paper	Richard Csaky
End-to-End Optimization of Task-Oriented Dialogue Model with Deep Reinforcement Learning	Paper	Richard Csaky
Are You Talking to Me? Reasoned Visual Dialog Generation through Adversarial Learning	Paper	Richard Csaky
RubyStar: A Non-Task-Oriented Mixture Model Dialog System	Paper	Richard Csaky
Examining Cooperation in Visual Dialog Models	Paper	Richard Csaky
Why Do Neural Dialog Systems Generate Short and Meaningless Replies? A Comparison between Dialog and Translation	Paper	Richard Csaky
End-to-End Offline Goal-Oriented Dialog Policy Learning via Policy Gradient	Paper	Richard Csaky
Toward Continual Learning for Conversational Agents	Paper	Richard Csaky
An Ensemble Model with Ranking for Social Dialogue	Paper	Richard Csaky

2016

Title	Paper Source	TL;DR
Incorporating Copying Mechanism in Sequence-to-Sequence Learning	Paper	Richard Csaky
A Persona-Based Neural Conversation Model	Paper	Richard Csaky
How NOT To Evaluate Your Dialogue System: An Empirical Study of Unsupervised Evaluation Metrics for Dialogue Response Generation	Paper	Richard Csaky
StalemateBreaker: A Proactive Content-Introducing Approach to Automatic Human-Computer Conversation	Paper	Richard Csaky
A Network-based End-to-End Trainable Task-oriented Dialogue System	Paper	Richard Csaky
A Hierarchical Latent Variable Encoder-Decoder Model for Generating Dialogues	Paper	Richard Csaky
Deep Reinforcement Learning for Dialogue Generation	Paper	Richard Csaky
An Attentional Neural Conversation Model with Improved Specificity	Paper	Richard Csaky
Topic Aware Neural Response Generation	Paper	Richard Csaky
Multiresolution Recurrent Neural Networks: An Application to Dialogue Response Generation	Paper	Richard Csaky
Neural Discourse Modeling of Conversations	Paper	Richard Csaky
Neural Contextual Conversation Learning with Labeled Question-Answering Pairs	Paper	Richard Csaky
Sequence to Backward and Forward Sequences: A Content-Introducing Approach to Generative Short-Text Conversation	Paper	Richard Csaky
Two are Better than One: An Ensemble of Retrieval- and Generation-Based Dialog Systems	Paper	Richard Csaky
Dialogue Learning With Human-In-The-Loop	Paper	Richard Csaky
Deep Active Learning for Dialogue Generation	Paper	Richard Csaky

2015

Title	Paper Source	TL;DR
Neural Responding Machine for Short-Text Conversation	Paper	Richard Csaky
A Neural Conversational Model	Paper	Richard Csaky
A Neural Network Approach to Context-Sensitive Generation of Conversational Responses	Paper	Richard Csaky
Building End-To-End Dialogue Systems Using Generative Hierarchical Neural Network Models	Paper	Richard Csaky
A Diversity-Promoting Objective Function for Neural Conversation Models	Paper	Richard Csaky
Attention with Intention for a Neural Network Conversation Model	Paper	Richard Csaky
A Survey of Available Corpora for Building Data-Driven Dialogue Systems	Paper	Richard Csaky

Ethics and NLP

2019

Title	Summary	Paper Source
Defending Against Neural Fake News	Ai2 Blog	Paper

2018

Title	Summary	Paper Source
Troubling Trends in Machine Learning Scholarship	dair.ai	Paper
Examining Gender and Race Bias in Two Hundred Sentiment Analysis Systems	dair.ai	Paper

Generation

2019

Title	Summary	Paper Source
Encode, Tag, Realize: High-Precision Text Editing	Google AI Blog	Paper

2016

Title	Paper Source	TL;DR
Latent Predictor Networks for Code Generation	Paper	Richard Csaky

Interpretability and Analysis of Models for NLP

2019

Title	Summary	Paper Source
Revealing the Dark Secrets of BERT	Text Machine Blog	Paper
Probing Neural Network Comprehension of Natural Language Arguments	The Gradient	Paper
What Does BERT Look At? An Analysis of BERT's Attention	dair.ai	Paper
Are Sixteen Heads Really Better than One?	CMU ML Blog	Paper

2018

Title	Summary	Paper Source
Olive Oil is Made of Olives, Baby Oil is Made for Babies: Interpreting Noun Compounds using Paraphrases in a Neural Model	dair.ai	Paper

Language Modeling

2020

Title	Summary	Paper Source
Reformer: The Efficient Transformer	Pragmatic ML	Paper
ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators	Google AI Blog	Paper

2019

Title	Summary	Paper Source	TL;DR
Language Models are Unsupervised Multitask Learners	-	Paper	Richard Csaky
Plug and Play Language Models: A Simple Approach to Controlled Text Generation	Uber Engineering	Paper	-
ALBERT: A Lite BERT for Self-Supervised Learning Of Language Representations	Amit Chaudhary	Paper	-
Fine-Tuning GPT-2 from Human Preferences	Open AI	Paper	-
XLNet: Generalized Autoregressive Pretraining for Language Understanding	dair.ai	Paper	-
Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context	dair.ai	Paper	-

2018

Title	Paper Source	TL;DR
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding	Paper	Richard Csaky

2017

Title	Paper Source	TL;DR
Adversarial Generation of Natural Language	Paper	Richard Csaky
Training RNNs as Fast as CNNs	Paper	Richard Csaky

Machine Translation

2019

Title	Paper Source	TL;DR
Beyond BLEU: Training Neural Machine Translation with Semantic Similarity	Paper	Richard Csaky

2017

Title	Paper Source	TL;DR
Convolutional Sequence to Sequence Learning	Paper	Richard Csaky
Depthwise Separable Convolutions for Neural Machine Translation	Paper	Richard Csaky
Attention Is All You Need	Paper	Richard Csaky
A Continuous Relaxation of Beam Search for End-to-end Training of Neural Sequence Models	Paper	Richard Csaky
Cold Fusion: Training Seq2Seq Models Together with Language Models	Paper	Richard Csaky
Dynamic Evaluation of Neural Sequence Models	Paper	Richard Csaky
Emergent Translation in Multi-Agent Communication	Paper	Richard Csaky
Plan, Attend, Generate: Planning for Sequence-to-Sequence Models	Paper	Richard Csaky
Unsupervised Machine Translation Using Monolingual Corpora Only	Paper	Richard Csaky
Classical Structured Prediction Losses for Sequence to Sequence Learning	Paper	Richard Csaky

2016

Title	Paper Source	TL;DR
Sequence-to-Sequence Learning as Beam-Search Optimization	Paper	Richard Csaky
Deep Recurrent Models with Fast-Forward Connections for Neural Machine Translation	Paper	Richard Csaky
Temporal Attention Model for Neural Machine Translation	Paper	Richard Csaky
Google’s Neural Machine Translation System: Bridging the Gap between Human and Machine Translation	Paper	Richard Csaky
Can Active Memory Replace Attention?	Paper	Richard Csaky
Neural Machine Translation in Linear Time	Paper	Richard Csaky
Unsupervised Pretraining for sequence to sequence learning	Paper	Richard Csaky

2014

Title	Paper Source	TL;DR
Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation	Paper	Richard Csaky
Sequence to Sequence Learning with Neural Networks	Paper	Richard Csaky
Neural Machine Translation by Jointly Learning to Align and Translate	Paper	Richard Csaky

Multi-Task Learning

2018

Title	Summary	Paper Source
A Hierarchical Multi-task Approach for Learning Embeddings from Semantic Tasks	dair.ai	Paper

2017

Title	Paper Source	TL;DR
One Model To Learn Them All	Paper	Richard Csaky

Resources and Evaluation

2018

Title	Summary	Paper Source
nocaps: novel object captioning at scale	dair.ai	Paper

Sentiment Analysis, Stylistic Analysis, and Argument Mining

2019

Title	Summary	Paper Source
Emotion-Cause Pair Extraction: A New Task to Emotion Analysis in Texts	dair.ai	Paper

2018

Title	Summary	Paper Source
DialogueRNN: An Attentive RNN for Emotion Detection in Conversations	dair.ai	Paper
Exploring Emoji Usage and Prediction Through a Temporal Variation Lens	dair.ai	Paper

2017

Title	Summary	Paper Source
Context-Dependent Sentiment Analysis in User-Generated Videos	dair.ai	Paper

2016

Title	Summary	Paper Source
A Deeper Look into Sarcastic Tweets Using Deep Convolutional Neural Networks	dair.ai	Paper

Summarization

2017

Title	Summary	Paper Source
Get To The Point: Summarization with Pointer-Generator Networks	Abigail See	Paper

2016

Title	Paper Source	TL;DR
Abstractive Text Summarization using Sequence-to-sequence RNNs and Beyond	Paper	Richard Csaky

Overviews / Surveys / Highlights

2019

Title	Summary	Paper Source
Generalized Language Models	Lilian Weng	-
10 ML & NLP Research Highlights of 2019	Sebastian Ruder	-
Google Research: Looking Back at 2019, and Forward to 2020 and Beyond	Google AI Blog	-
NLP Year in Review — 2019	dair.ai	-

2018

Title	Summary	Paper Source
A Light Introduction to Transfer Learning for NLP	dair.ai	-
Attention? Attention!	Lilian Weng	-

2017

Title	Summary	Paper Source
Recent Trends in Deep Learning Based Natural Language Processing	dair.ai	Paper

Miscellaneous (To be categorized)

2019

Title	Summary	Paper Source
Parameter-Efficient Transfer Learning for NLP	dair.ai	Paper

TODO List

Categorize papers; I am aiming to categorize paper summaries by using the categories in the NLP Progress or prominent tracks used in NLP conferences; if you have any suggestions/questions please open an issue or send me a message directly on Twitter or via email.
Include more info about each paper summaries, excerpt, authors, source, year, etc.
More paper summaries are coming!
Move all paper summaries to GitHub so others can also edit, contribute, and review

Name		Name	Last commit message	Last commit date
Latest commit History 54 Commits
README.md		README.md

jamshaidsohail5/nlp_paper_summaries

Folders and files

Latest commit

History

Repository files navigation

NLP Paper Summaries

How to contribute

Table of Contents

Cognitive Modeling and Psycholinguistics

2018

Computational Social Science and Social Media

2017

Dialogue and Interactive Systems

2020

2019

2018

2017

2016

2015

Ethics and NLP

2019

2018

Generation

2019

2016

Interpretability and Analysis of Models for NLP

2019

2018

Language Modeling

2020

2019

2018

2017

Machine Translation

2019

2017

2016

2014

Multi-Task Learning

2018

2017

Resources and Evaluation

2018

Sentiment Analysis, Stylistic Analysis, and Argument Mining

2019

2018

2017

2016

Summarization

2017

2016

Overviews / Surveys / Highlights

2019

2018

2017

Miscellaneous (To be categorized)

2019

TODO List

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages