'It wasn’t logic, it was love' and other quotes from #sexandthecity
This project uses Natural Language Processing (NLP) to analyze all the statements made by characters from the Sex and the City TV series.
Project is created with Python - version: 3.8.2.
Python libraries:
- nltk - version 3.5
- pandas - version 1.1.1
- numpy - version 1.19.1
- matplotlib - version 3.3.1
- seaborn - version 0.10.1
The input data consists of a single csv file - SATC_all_lines.csv. This is a Kaggle dataset (https://www.kaggle.com/snapcrack/every-sex-and-the-city-script).
The analysis and its results are stored in the Jupyter Notebook file - sex-and-the-city-lines.ipynb.
Project structure
Characters with the largest number of lines
The 'Sex And The City' quotes search engine
- Quotes about cities and places
- Quotes about men and women
- Quotes about food, fashion and lifestyle
- Quotes about appearance
- Quotes about relationships and emotions
- Other quotes
Counting word and bigram frequencies in the data
This project is complete.