Skip to content

Developed a batch ETL pipeline to extract, transform, and load transactional data from RDS to Redshift. Used Sqoop to ingest data from RDS to HDFS, PySpark to transform and load data to S3, and Redshift to create and query dimension & fact tables. Performed analytical queries to identify ATMs with inactive transactions, ATM failures by weather, etc

Notifications You must be signed in to change notification settings

urvashiforreal/ETL-Project

About

Developed a batch ETL pipeline to extract, transform, and load transactional data from RDS to Redshift. Used Sqoop to ingest data from RDS to HDFS, PySpark to transform and load data to S3, and Redshift to create and query dimension & fact tables. Performed analytical queries to identify ATMs with inactive transactions, ATM failures by weather, etc

Topics

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published