Skip to content

Exploratory Data Analysis of the Popularity of the Google Play Store Apps.

License

Notifications You must be signed in to change notification settings

dimi-fn/Google-Play-Store-Apps

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

52 Commits
 
 
 
 
 
 

Repository files navigation

Google Play Store Apps

*Note:

Due to a GitHub bug (issue #3035 & #3555), sometimes the notebook files (files ending in ".ipynb") may not render. Please reload the page until the content can be displayed, or click here to view the shared Google Colab notebook file.

Source

"Google Play Store Apps - App data of 1.1 Million applications" dataset collected from this kaggle repository and created by G. Prakash.

Description & Objectives

Performed exploratory data analysis with descriptive statistics and data visualization, and applied various machine learning techniques. The analysis undertaken was in the context of the CS982 "Big Data" module of Strathclyde University (Glasgow, UK) in the academic year 2019-20.

Purpose

The results and the diagnostic analysis of the project, using both supervised (classification) and unsupervised methods (clustering), led to insights that can be useful for identifying which attributes are linked with the most popular apps in the Google Play Store.