Skip to content

End-to-end pipeline with stream & batch processing. Google Pub/Sub | Airflow | dbt | BigQuery | Airbyte

Notifications You must be signed in to change notification settings

ranggaaldosas/elt-finance-stream_processing

Repository files navigation

Project ETL Stock IDX - Stream Processing

End-to-end data pipeline with Extract-Load-Transform concept

  • Data Stack
    • Orchestration = Apache Airlow
    • Connector = Airbyte
    • Data Transformation = dbt
    • OLAP/Data Warehouse = Google BigQuery
    • Stream Processing = Google Pub/Sub

A. System Design

system_design

B. Airflow Graph

flow

C. BigQuery Data Warehouse

dwh dwh

D. Temporary Dashboard (soon to be updated)

dwh

E. Discord Notification Webhook Integration

  1. Airflow Dashboard -> Admin -> Add Variables

OR with CLI

airflow variables set DISCORD_SUCCESS_WEBHOOK_URL "https://discord.com/api/webhooks/xxxxx"
airflow variables set DISCORD_FAILURE_WEBHOOK_URL "https://discord.com/api/webhooks/xxxxx"
  1. Add to DAG
DISCORD_SUCCESS_WEBHOOK_URL = Variable.get("DISCORD_SUCCESS_WEBHOOK_URL")
DISCORD_FAILURE_WEBHOOK_URL = Variable.get("DISCORD_FAILURE_WEBHOOK_URL")

dwh

About

End-to-end pipeline with stream & batch processing. Google Pub/Sub | Airflow | dbt | BigQuery | Airbyte

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published