Skip to content
#

redshift-aws

Here are 5 public repositories matching this topic...

Language: All
Filter by language

Daily Incremental load ETL pipeline for Ecommerce company using AWS Lambda and AWS EMR cluster, Deployed using Apache airflow in a docker container.

  • Updated Mar 17, 2023
  • Python

In this project I build a batch ETL pipeline to read transactional data from Amazon RDS, transform it to a usable format and then load it into an Amazon S3 bucket. The data is then loaded into Redshift Tables, after which I perform analytical queries on the loaded data to gain insights.

  • Updated Oct 5, 2023
  • Jupyter Notebook

Developed a batch ETL pipeline to extract, transform, and load transactional data from RDS to Redshift. Used Sqoop to ingest data from RDS to HDFS, PySpark to transform and load data to S3, and Redshift to create and query dimension & fact tables. Performed analytical queries to identify ATMs with inactive transactions, ATM failures by weather, etc

  • Updated Oct 31, 2023
  • Jupyter Notebook

Improve this page

Add a description, image, and links to the redshift-aws topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the redshift-aws topic, visit your repo's landing page and select "manage topics."

Learn more