Skip to content

'Beauty is fleeting but a rent-controlled apartment overlooking the park is forever' and other quotes from #sexandthecity 👠

Notifications You must be signed in to change notification settings

katarzynajanicka/sex-and-the-city-language-analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

45 Commits
 
 
 
 
 
 
 
 

Repository files navigation

sex-and-the-city-language-analysis

'It wasn’t logic, it was love' and other quotes from #sexandthecity

Table of contents

General info

This project uses Natural Language Processing (NLP) to analyze all the statements made by characters from the Sex and the City TV series.

Technologies

Project is created with Python - version: 3.8.2.

Python libraries:

  • nltk - version 3.5
  • pandas - version 1.1.1
  • numpy - version 1.19.1
  • matplotlib - version 3.3.1
  • seaborn - version 0.10.1

Setup

The input data consists of a single csv file - SATC_all_lines.csv. This is a Kaggle dataset (https://www.kaggle.com/snapcrack/every-sex-and-the-city-script).

The analysis and its results are stored in the Jupyter Notebook file - sex-and-the-city-lines.ipynb.

Screenshots

Project structure

Characters with the largest number of lines

The 'Sex And The City' quotes search engine

  1. Quotes about cities and places

  1. Quotes about men and women

  1. Quotes about food, fashion and lifestyle

  1. Quotes about appearance

  1. Quotes about relationships and emotions

  1. Other quotes

Counting word and bigram frequencies in the data

Conclusions

Status

This project is complete.

About

'Beauty is fleeting but a rent-controlled apartment overlooking the park is forever' and other quotes from #sexandthecity 👠

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published