Skip to content

This project demonstrates how to use the HuggingFace Transformers library to perform sentiment analysis on text data. Specifically, we classify reviews as either _Positive_ or _Negative_ using a pre-trained model.

License

Notifications You must be signed in to change notification settings

dmickelson/SentimentAnalysisWithHuggingFaceProject

Repository files navigation

Sentiment Analysis with HuggingFace Transformers

This project demonstrates how to use the HuggingFace Transformers library to perform sentiment analysis on text data. Specifically, we classify reviews as either Positive or Negative using a pre-trained model.

This is a basic example of using a non-LLM (e.g., not ChatGPT or other large language models) mechanism to classify reviews. You can replace the provided data file with your own data for analysis.

This is just an example, and you can replace the data file with your data for analysis.

Transformer Model

We are using the HuggingFace Transformer model DistilBERT base uncased finetuned SST-2, which includes both tokenizers and the model, making it straightforward to use:

This model is fine-tuned on the SST-2 dataset for sentiment analysis, making it suitable for classifying text as either Positive or Negative.

Data

The data we are analyzing comes from Kaggle: Top 20 Play Store App Reviews (Daily Update)

In particular, we use the Dropbox reviews from this dataset:

Feel free to replace this data file with your own dataset for analysis.

Installation

To install the required dependencies, ensure you have Python installed and then run:

pip install -r requirements.txt

To run the example type:

python main.p

Output Sample

reviewId content score sentiment
6320 d1c16bb5-1322-4ba0-ad09-4ef98d94fc2a Worst update, the offline files are hard to re... 1 NEGATIVE
5564 db22256c-ecd0-4ba6-b9d7-bce23c21ccdc It's usable 4 POSITIVE
5154 a2d4fce3-ca82-408c-b646-22c949abff32 I deleted all my drop box files to free up spa... 2 NEGATIVE
8719 279265a3-7114-4642-b8b3-06d379cfb683 better 4 POSITIVE
9886 2a6ca7de-0e96-46d8-8460-fde56a348438 fntk app 1 NEGATIVE

"Sample Output"

Summary of Changes:

  1. Project Description: Expanded to provide more context about the project.
  2. Transformer Model: Added more details about the model used and its purpose.
  3. Data: Clarified the source and purpose of the data.
  4. Installation: Provided instructions for installing dependencies.
  5. Usage: Added usage instructions for running the script.
  6. Development Tools: Listed tools used for development and provided instructions.
  7. Example Code: Included example code for clarity.
  8. Formatting: Improved formatting and readability throughout the document.

About

This project demonstrates how to use the HuggingFace Transformers library to perform sentiment analysis on text data. Specifically, we classify reviews as either _Positive_ or _Negative_ using a pre-trained model.

Topics

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages