LSTM - Neural Network - Hamlet Dataset

The LSTM Next Word Prediction project trained on the Hamlet dataset involves building a deep learning model that predicts the next word in a sequence of text. The project typically follows these steps:

Data Preprocessing: The text of Shakespeare's Hamlet is cleaned and processed, converting it into a sequence of tokens (words). The text is then split into input-output pairs, where the input is a sequence of words, and the output is the next word in the sequence.
Model Architecture: A Long Short-Term Memory (LSTM) network, which is a type of Recurrent Neural Network (RNN), is used due to its ability to learn long-term dependencies in sequential data. The LSTM model is typically designed with an embedding layer, followed by one or more LSTM layers, and a dense output layer with a softmax activation to predict the probability distribution of the next word.
Training: The model is trained on the Hamlet text, with the goal of minimizing the loss, typically using categorical cross-entropy. The training process involves feeding sequences of words into the model and adjusting the weights to improve the accuracy of the next word prediction.
Prediction: Once trained, the model can generate text by predicting the next word in a sequence. Given a starting phrase or sequence of words from Hamlet, the model predicts the most likely next word, which can then be fed back into the model to generate further text.
Evaluation: The model's performance is evaluated based on its accuracy in predicting the correct next word in the sequence. Metrics like perplexity might be used to assess the model's efficiency in language modeling tasks.

This project showcases how LSTM networks can be applied to natural language processing tasks, specifically for generating text and predicting sequences based on historical data.

Live Deployment

This project is hosted on - https://lstm-hamlet-ntk6xvyx48gfumb2aurksb.streamlit.app/

Run Locally

Clone this repo into your system.
Create virtual environment using the command -

 conda create -p myenv python==3.9.0

Now install all the packages which are listed in requirements.txt

 pip install -r requirements.txt

Now run all the cell in the Experiments.ipynb And Prediction.ipynb as per your need.
To run on streamlit -

    streamlit run app.py

Tech Stack

Frontend Client: Streamlit Services

Model Used: RNN-LSTM (Long Short Term Memory) Neural Network

Dataset Used: NLTK Hamlet Dataset

Feedback

If you have any feedback or just to say Hi!, please reach out to me at mahobiashubham4@gmail.com

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
README.md		README.md
app.py		app.py
experiments.ipynb		experiments.ipynb
hamlet.txt		hamlet.txt
next_word_lstm.h5		next_word_lstm.h5
requirements.txt		requirements.txt
tokenizer.pickle		tokenizer.pickle

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LSTM - Neural Network - Hamlet Dataset

Live Deployment

Run Locally

Tech Stack

Feedback

About

Languages

ShubhaMahobia/LSTM-Hamlet

Folders and files

Latest commit

History

Repository files navigation

LSTM - Neural Network - Hamlet Dataset

Live Deployment

Run Locally

Tech Stack

Feedback

About

Topics

Resources

Stars

Watchers

Forks

Languages