Skip to content

Gathered, extracted, transformed and loaded data on California wildfires and median housing prices to help determine the effect of previous and potential wildfire damage.

Notifications You must be signed in to change notification settings

timegnome/ETL-Project

Repository files navigation

ETL-Project

Extraction

Gather the data from the csv's provided by a kaggle data sources and zillows historical housing median prices per month.

Data from the weather api was not accessable without paid subscription.

Transform

Drop unneeded columns and data from each dataframe tables.

Renamed columns to match information and created unique id keys for countys and fire entrys

Match County strings to then hash for primary keys

Created primary table for countys

Melted county tables into 4 columns so dates were a single column

Fill in nan data with 0

Pushed the dataframe tables to a postgres sql database

Load

Pull data from postgres servers using python connections

Merge data into 2 data frames on the county ids

About

Gathered, extracted, transformed and loaded data on California wildfires and median housing prices to help determine the effect of previous and potential wildfire damage.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published