3NF-normalize Yelp data on S3 with Spark and load it into Redshift - automate the whole thing with Apache Airflow
airflow
cloud
sql
spark
nosql
amazon-emr
s3-bucket
data-warehouse
amazon-redshift
data-pipeline
normalization
yelp-dataset
3nf
dimensional-tables
data-marts
etl-process
-
Updated
Aug 17, 2019 - Jupyter Notebook