Skip to content

Latest commit

 

History

History
105 lines (79 loc) · 7.48 KB

README.md

File metadata and controls

105 lines (79 loc) · 7.48 KB

Netflix_Analysis:

image

Table of Contents:

Introduction

Netflix, a renowned subscription-based streaming service, boasts a vast library of films and television series, including original productions. This project aims to provide insightful data visualizations derived from the Netflix movies and TV shows dataset sourced from Kaggle (https://www.kaggle.com/code/ridwanadejumo/basic-data-visualization-on-the-netflix-dataset).

Leveraging Python programming along with libraries such as Numpy, Pandas, Matplotlib and Seaborn, I delve into various aspects including movies vs. series distribution, content trends over the years, genre preferences, top directors, and more. This project offers a comprehensive exploration of Netflix's content landscape through the lens of data analysis and visualization.

Data Dictionary

Files Description
Data This folder houses a comprehensive collection of Netflix data sourced from Kaggle with data ditionary.
netflix_cleaned_data.ipynb This file contains the code for data cleaning & transformation.
netflix_analysis.ipynb This file encompasses the code for a Netflix data analysis project.
README.md This is the Readme file of the project.

Dataset Description

The Netflix dataset used in this analysis comprises a comprehensive collection of information regarding the streaming platform's content. It includes attributes such as show ID, type (whether it's a movie or TV show), title, director, cast, country of production, date added to Netflix, release year, content rating, duration (in minutes for movies and in seasons for TV shows), genre/category, description, and metadata indicating the month and day the content was added. This dataset provides a rich source of information for exploring trends, preferences, and patterns in Netflix's content library, allowing for in-depth analysis of factors influencing viewer choices and content consumption habits.

Objective

The objective of the Netflix dataset is to analyze and understand the characteristics of the content available on Netflix. The dataset can be used to uncover patterns in the types of TV shows and movies that are popular on the platform, identify trends in the content produced by different countries, and explore the relationship between ratings and other features of the content. The dataset can also be used to develop recommendation systems for users, by analyzing user preferences and matching them with relevant TV shows and movies on the platform. Overall, the Netflix dataset is a valuable resource for data analysts and researchers interested in the streaming media industry and the behavior of consumers in the digital age.

Project Analysis

  • Understanding what content is available in different countries
  • Identifying similar content by matching text-based features
  • Network analysis of Actors / Directors and find interesting insights
  • Does Netflix has more focus on TV Shows than movies in recent years?
  • Which Type of content users watches more?
  • Derive sentiment of content over the years in Netflix
  • Trends in the number of shows and movies added over time
  • The distribution of content across different countries
  • Popular genres and their distribution
  • Content ratings and their distribution
  • Analysis of actors and directors with the most content

Visualizations

image

image

image

image

image

image

image

image

image

image

image

image

image

image

image

image

image

Insights

  • Movies Vs Series Distribution: There are more movies available on Netflix compared to TV shows.
  • Movies Vs Series Distribution Over the Years: In recent years, Netflix has shown a higher investment in producing movies rather than TV shows.
  • Movies Vs Series Distribution by Month: The distribution of movies and TV shows varies by month, with peaks in January and fluctuations throughout the year.
  • Countries with the Most Content (Movies or Series): The United States is the leading producer of both movies and TV shows on Netflix, followed by other countries such as India and the United Kingdom.
  • Top Actors in Each Country (Movies and Series): Certain actors appear more frequently in movies and TV shows across different countries.
  • Distribution of Content by Age Group: Netflix provides a diverse range of content catering to different age groups, with a focus on mature audiences.
  • Top Movie and TV Show Ratings: Content with TV-MA rating dominates both movies and TV shows on Netflix.
  • Content (Genre) Distribution by Country: Different countries prefer certain genres of content, with variations in popularity across regions.
  • Genre Distribution: Drama and International Movies are among the most prevalent genres on Netflix, followed by Comedies and TV Shows.
  • Top Genres of Movies and TV Shows: Documentaries, Stand-up Comedies, and Drama-International Movies are among the top genres for movies, while Kids' TV and International Shows dominate TV shows.
  • Top Directors in Each Country: Certain directors have a significant presence in producing content across different countries.
  • Duration Range for Movies and TV Shows: Most movies on Netflix have a duration between 90 and 100 minutes, while TV shows vary in duration, with some having multiple seasons.