IMDB Movie Data ETL Pipeline using S3, Glue, Redshift, EventBridge, SNS
-
Updated
Aug 6, 2024 - Python
IMDB Movie Data ETL Pipeline using S3, Glue, Redshift, EventBridge, SNS
This workshop is to build a serverless data lake architecture using Amazon Kinesis Firehose for streaming data ingestion, AWS Glue for Data Integration (ETL, Catalogue Management), Amazon S3 for data lake storage, Amazon Athena for SQL big data analytics.
Fully Automation end-to-end ETL airlines data ingestion
Terraform implementation of a commonly-used AWS architecture pattern of performing streaming ETL on a Kinesis data stream using a Glue job.
Spark and Data Lakes Project: STEDI Human Balance Analytics (Udacity Data Engineering with AWS Nanodegree)
Serverless Data Lake on AWS. Slideshare: https://www.slideshare.net/SmartBizVN/serverless-data-lake-on-aws
Git repo to accompany the AWS DevOps Blog: Using AWS DevOps Tools to model and provision AWS Glue workflows
Add a description, image, and links to the glue-etl topic page so that developers can more easily learn about it.
To associate your repository with the glue-etl topic, visit your repo's landing page and select "manage topics."