Skip to content

unicef/giga-dagster

Repository files navigation

Giga DataOps Platform: Dagster

The Giga DataOps Platform is a data platform developed by Thinking Machines Data Science in coordination with UNICEF Giga. The objective of the platform is to ingest school data from various sources, applying concepts from master data management and data governance in order to produce a single source of truth—the School Master Data—which will then be used by consumers and downstream applications.

This repository contains the code for the Dagster, Spark, and Hive Metastore services of the Platform, which handles data orchestration, data pipelines, and distributed compute.

Table of Contents

  1. Architecture
  2. Data Flow
  3. Development
  4. Deployment
  5. Support

Jump to other platform services

Releases

No releases published

Packages

No packages published