Name		Name	Last commit message	Last commit date
parent directory ..
config		config
data		data
images		images
Makefile		Makefile
README.md		README.md

README.md

Managed Offline Flow

Pinot is most commonly used to provide real-time analytics based on streaming data, which can be achieved using a real-time table. However, after running these systems for a while, we'll want to update the data ingested into this table. Perhaps the name of a value in a column has been updated, or we want to remove some duplicate records.

Segments in real-time tables can't be replaced, but we can replace those in offline tables. Managed offline flow is the way that Pinot handles the process of moving the data from real-time to offline tables.

In this recipe we'll learn how to use Pinot offline managed flow.

Property	Value
Pinot Version	0.9.3
Schema	schema
Table Config	Offline
Table Config	Realtime

This is the code for the following recipe: https://dev.startree.ai/docs/pinot/recipes/real-time-offline-job

Makefile

flowchart LR

Producer-->Kafka-->p[Pinot Table]

make recipe

Running this recipe will build the mermaid graph above and start producing data into Kafka.

Run the next Make task: