Problem Statement:
- To perform exploratory data analysis on the IMDB movie data-set.
Analysis Approach:
- At first I tried to understand the data better, and then started the cleaning process using python.
- I have used Jupyther Notebook to perform the analysis.
- I have dropped the missing values and duplicate values since they are small in number. Also I have dropped unwanted columns and replaced ‘None’ values with most common values.
- I have then analysed the data using statistical analysis and prepared charts where-ever required.
Insights:
- Top 10 profitable movies and directors
- Top 10 IMDB movies
- Top 10 Movie Genres
- Critic favorite and User favorite actors
- Users vote for movies over the decades
Please have a look at the attached code for detailed analysis and insights.