This repository contains the tasks that I completed while working as an intern for The Sparks Foundation
- Internship Category - Data Science and Business Analytics
- Internship Duration - 1 Month ( August - 2021 )
- Internship Type - Work from Home
In this internship, we were provided a total of 10 Tasks (6 from own domain and 2 from other domains) and I was able to successfully complete all the 8 tasks within the given time-frame.
(Task 1 - Linear_Regression.ipynb)
Please click on the images on right side to view my solution.
- Predict the percentage of marks of an student based on the number of study hours.
- This is a simple linear regression task as it involves just 2 variables.
- Data can be found at http://bit.ly/w
- You can use R, Python, SAS Enterprise Miner or any other tool.
- What will be predicted score if a student studies for 9.25 hrs/ day?
Please click on the images on right side to view my solution.
- From the given ‘Iris’ dataset, predict the optimum number of clusters and represent it visually.
- Use R or Python or perform this task.
- Data can be found at https://bit.ly/3cGyP8j
Please click on the images on right side to view my solution.
- For the given ‘Iris’ dataset, create the Decision Tree classifier and visualize it graphically.
- The purpose is if we feed any new data to this classifier, it would be able to predict the right class accordingly.
- Data can be found at https://bit.ly/3kXTdox
Please click on the images on right side to view my solution.
NOTE : If you face any issue over dataset then please refer to the thread of Issue #2.
- Create a hybrid model for stock price/performance prediction using numerical analysis of historical stock prices, and sentimental analysis of news headlines.
- Stock to analyze and predict SENSEX (S&P BSE SENSEX)
- Download historical stock prices from finance.yahoo.com
- Download textual (news) data from https://bit.ly/36fFPI6
- Use either R or Python, or both for separate analysis and then combine the findings to create a hybrid model.
Please click on the images on right side to view my solution.
- Perform ‘Exploratory Data Analysis’ on the provided dataset ‘SampleSuperstore’
- As a business manager, try to find out the weak areas where you can work to make more profit.
- What all business problems you can derive by exploring the data?
- You can choose any of the tool of your choice (Python/R/Tableau/PowerBI/Excel)
- Dataset link :https://bit.ly/3i4rbWl
- Create storyboards. Screen record along with your audio explaining the charts and interpretations.
Please click on the images on right side to view my solution.
- Create a storyboard showing spread of Covid 19 cases in your country or any region (Asia, Europe, BRICS etc)
- Use animation, timeline and annotations to create attractive and interactive dashboards and story
- Identify interesting patterns and possible reasons helping Covid 19 spread with basic as well as advanced charts
- Use Tableau or Power BI for this task
- Screen record the completed storyboard along with your audio explaining the charts and giving recommendations.
- Dataset: Daily updated .csv file on https://bit.ly/30d2gdi
- Tableau
- Scikit learn