Skip to content

Real time analytics: Kafka + Druid + Superset. Setting up streaming analytics using open source technologies -> Apache {Kafka, Superset, Druid} to set up a system that allows you to get a visualization of some data.

Notifications You must be signed in to change notification settings

desi109/kafka-druid-superset

Repository files navigation

Real time analytics: Kafka + Druid + Superset

What is Druid?

Apache Druid (incubating) is a real-time analytics database designed for fast slice-and-dice analytics ("OLAP" queries) on large data sets. Druid is most often used as a database for powering use cases where real-time ingest, fast query performance, and high uptime are important. As such, Druid is commonly used for powering GUIs of analytical applications, or as a backend for highly-concurrent APIs that need fast aggregations.

  • Columnar storage format
  • Scalable distributed system
  • Realtime or batch ingestion
  • Self-healing, self-balancing, easy to operate
  • Cloud-native, fault-tolerant architecture that won't lose data
  • Indexes for quick filtering
  • Time-based partitioning
  • Approximate algorithms
  • Automatic summarization at ingest time

What is Superset?

Apache Superset is a data exploration and visualization web application and provides an intuitive interface to explore and visualize datasets, and create interactive dashboards.

  • A rich set of data visualizations
  • An easy-to-use interface for exploring and visualizing data

About

Real time analytics: Kafka + Druid + Superset. Setting up streaming analytics using open source technologies -> Apache {Kafka, Superset, Druid} to set up a system that allows you to get a visualization of some data.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages