This is Python for AI/ML Internship 2nd Assignment at Elite Techno Groups. Olympic Games from 1896- 2012. The analysis is carried on Google Colab.
LIBRARY USED : Pandas, Matplotlib, Seaborn, and Numpy
DATA : This is a historical dataset on Olympic games, including all the games from 1896-2012. The data was taken from the Kaggle website.
NOTE : This analysis is primarily on Summer Olympics.
This is Olympics Data Analysis Project created using Python and CSV file.
It is an Exploratory Data Analysis on Olympics Games from 1896 to 2012. I analysized the dataset from the Kaggle
- Data Cleaning and Understanding
- My Observations
- Summer Olympics
- Summer Data Description
- Top Countries Participation
- Gender Participation
- In how many cities Summer Olympics is held so far?
- Which sport is having most number of Gold Medals so far? (Top 5)
- Which sport is having most number of medals so far? (Top 5)
- Which player has won most number of medals? (Top 5)
- Which player has won most number Gold Medals of medals? (Top 5)
- In which year India won first Gold Medal in Summer Olympics?
- Which event is most popular in terms on number of players? (Top 5)
- Which sport is having most female Gold Medalists? (Top 5)
Here are my observations from the three datasets: I'm going to grouped the datasets into two, i.e
- Summer Olympics
- Winter Olympics
I have specifically analysed "SUMMER OLYMPICS".
- dictionary.csv - This file contains Country, Code, Population and GDP per Capita.
- summer.csv - This file contains all summer olympics games. (This contains the Summer Olympics Games database)
- winter.csv - This file contains all winter olympics games.