Skip to content

stacie-kipruto/Global_Health_Trends_Covid_19

Repository files navigation

Exploring COVID-19 Impact: Analyzing Global Health Trends and Big Data Fundamentals

About this Project

  • This project aims to show the positivity and death rate of the COVID-19 virus. The raw data will be preprocessed using EXCEL, SQL workbench in BigQuery.
  • This project also seeks to show the fundamentals of BigQuery which is: a) Storage of Big Data b) Ingestion c) Querying
  • The goal is to analyse the number of deaths globally. Percent Population Infected

Prerequisites

The following tools were used in this project;

  • Excel
  • BigQuery
  • SQL
  • Tableau

Extraction of the Dataset

  • The Covid Dataset includes raw data on confirmed cases and deaths from Johns Hopkins University (JHU) and is publicly available on the website https://ourworldindata.org/
  • The dataset is quite large and coud not be uploaded on GitHub. columns rows

Preprocessing

  • The dataset contains a lot of unnecessary information for the analysis. This project seeks to showcase:
    1. The Total number of deaths per continent.
    2. The Percent Population Infected per country.
  • The dataset was loaded into Excel and slight reformattng was done.
  • Columns that were not necessary were deleted and the main variables that guided this project included: i) Total number of deaths ii) Population iii) Total number of vaccinated persons.
  • To avoid having to join multiple tables on SQL for every variable, only two tables were extracted from the main .csv file

Ingestion - BigQuery

  • The two tables were then loaded onto BigQuery to analyse the data using various SQL queries

Querying

or on the file PortfolioSeries.sql(https://github.com/stacie-kipruto/CovidDeathsSQL/blob/main/PortfolioSeries.sql)

Extraction and Visualisation

Limitations

  • Tableau is not a natively live streaming tool. The data for this project is current and the visuals presented on the dashboard will only show the numbers for the date in which the project was posted.

About

An exploratory data analysis project (using SQL and Tableau) which can be implemented in a business model

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published