Serverless Data Lake on AWS. Slideshare: https://www.slideshare.net/SmartBizVN/serverless-data-lake-on-aws
-
Updated
May 14, 2020 - Jupyter Notebook
Serverless Data Lake on AWS. Slideshare: https://www.slideshare.net/SmartBizVN/serverless-data-lake-on-aws
Terraform implementation of a commonly-used AWS architecture pattern of performing streaming ETL on a Kinesis data stream using a Glue job.
Git repo to accompany the AWS DevOps Blog: Using AWS DevOps Tools to model and provision AWS Glue workflows
This workshop is to build a serverless data lake architecture using Amazon Kinesis Firehose for streaming data ingestion, AWS Glue for Data Integration (ETL, Catalogue Management), Amazon S3 for data lake storage, Amazon Athena for SQL big data analytics.
Spark and Data Lakes Project: STEDI Human Balance Analytics (Udacity Data Engineering with AWS Nanodegree)
Fully Automation end-to-end ETL airlines data ingestion
IMDB Movie Data ETL Pipeline using S3, Glue, Redshift, EventBridge, SNS
Add a description, image, and links to the glue-etl topic page so that developers can more easily learn about it.
To associate your repository with the glue-etl topic, visit your repo's landing page and select "manage topics."