Abstractive-Text-Summarization-Using-RNN-and-Transformers

This repository contains implementations of abstractive text summarization using Recurrent Neural Networks (RNN) with Gated Recurrent Units (GRU),Recurrent Neural Networks with Reinforcement learning and Transformer architectures. The project aims to explore and compare how different neural network models perform in generating concise and coherent summaries from extensive text data.

Repository Structure

Transformer/Custom_Transformer_Model.ipynb: Jupyter notebook with a custom Transformer model tailored for text summarization.
Transformer/T5_implementation.ipynb: Application of the pre-trained T5 model to the text summarization task.
Transformer/Bart_Implementation.ipynb: Application of the pre-trained BART model to the text summarization task.
Transformer/BART_T5_GRAPH.ipynb: Plots graphs from output logs generated by our BART and T5 implemementation
RNN/RNN_Model.ipynb: Demonstrates abstractive text summarization using a GRU-based RNN.
RNN/RL_RNN_Implementation.ipynb : Implementation of a Hybrid RNN with RL based model.
README.md: Provides an overview and instructions for setting up and running the models.

Models Overview

Custom Transformer Model
- Implements self-attention and multi-headed attention mechanism to assess the importance of each word in the text, aiming to capture complex word relationships more effectively than RNNs.
T5 Model
- Utilizes a pre-trained Transformer model fine-tuned for summarization, benefiting from transfer learning to enhance summary quality efficiently.
BART Model
- Implements a denoising autoencoder using a bidirectional encoder to capture rich text features and a left-to-right autoregressive decoder for generating coherent text summaries.
RNN Model
- Uses GRUs to sequentially process text data and generate summaries, focusing on capturing temporal text dependencies.
RNN with RL
- Integrates Reinforcement Learning with a recurrent neural network, optimizing the summarization process through reward-based learning to improve the relevance.

Setup and Usage

To set up this project locally, run the following commands:

git clone git@github.com:Navya0203/Abstractive-Text-Summarization-Using-RNN-and-Transformers.git
cd Abstractive-Text-Summarization-Using-RNN-and-Transformers

Setting Up the Data

Prepare the dataset:

Download the necessary data files. https://www.kaggle.com/datasets/gowrishankarp/newspaper-text-summarization-cnn-dailymail
Place the folder into the DLProject/ folder on your Google Drive Home directory

Running the Notebooks

Navigate to the RNN directory for its implementations:

cd RNN

you will find two notebooks for RNN implementations RNN_Model.ipynb and RL_RNN_Implementation.ipynb

Navigate to the Transformer directory for its implementations:

cd Transformer

you will find 4 notebooks with 3 Transformer implementations, Custom_Transformer_Model.ipynb, T5_implementation.ipynb and Bart_Implementation.ipynb. And another notebook BART_T5_GRAPH.ipynb pulls the graphs from your output log after running its implementation files.

Checkpoints

As the model checkpoints were about 3GB we couldnt add them to the repo. Hence is the link to acess them. https://drive.google.com/drive/folders/18zteggWePbrnLoqO1J_YTrO7ABYEycwy?usp=sharing https://drive.google.com/drive/u/0/folders/1V8XbXvDASKm8ewP4pdcSTeHBas43HU3p

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
RNN		RNN
Transformer		Transformer
Project_FInal_Report.pdf		Project_FInal_Report.pdf
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Abstractive-Text-Summarization-Using-RNN-and-Transformers

Repository Structure

Models Overview

Setup and Usage

Setting Up the Data

Running the Notebooks

Checkpoints

About

Releases

Packages

Contributors 2

Languages

Navya0203/Abstractive-Text-Summarization-Using-RNN-and-Transformers

Folders and files

Latest commit

History

Repository files navigation

Abstractive-Text-Summarization-Using-RNN-and-Transformers

Repository Structure

Models Overview

Setup and Usage

Setting Up the Data

Running the Notebooks

Checkpoints

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages