🚧 This repository is currently in a draft version
Distributed transactions are a common challenge in distributed systems. When multiple services or systems need to coordinate to ensure consistent data, failures—like communication issues or incomplete updates—can cause problems.
Much like horcruxes, distributed transactions are tricky to manage, dangerous if left unchecked, and can create chaos in your system if not handled properly.
This is particularly relevant when working with external systems, such as process engines like Zeebe or Camunda 7, where state is managed independently. If proper strategies aren’t in place, workflows or interactions may lead to inconsistencies between the process engine and your application’s database.
This repository aims to provide practical examples and proven patterns to handle these distributed transaction problems effectively in a Spring Boot and Zeebe environment.
Distributed transactions happen when a single operation spans multiple systems, such as:
- A database for storing application data.
- A process engine like Zeebe for orchestrating workflows.
- Other systems like APIs or message brokers.
Problems arise when we can’t guarantee that all systems succeed or fail together. This can lead to:
- Inconsistent states: Some systems finish their operations, while others fail.
- Duplicate actions: Systems retry failed tasks, causing duplicate operations.
- Data conflicts: Tasks execute out of order or with incomplete data.
Zeebe is a powerful process engine, but it assumes that tasks can start as soon as they are triggered. If you notify Zeebe before your database transaction is committed, or if your transaction fails afterward, you risk creating inconsistencies.
For example:
- You save data to the database and immediately notify Zeebe to start a process.
- If the database transaction fails, Zeebe has already started tasks based on incomplete or incorrect data.
Without proper strategies, this can cause:
- Tasks to fail unexpectedly.
- The database and process engine states to drift apart.
- Retrying the same operation to produce duplicates.
💡 Want to understand these challenges in more detail?
Check out the detailed breakdown of distributed transaction challenges, which includes examples, diagrams, and a reference architecture to explain these problems in depth.
Distributed transactions are a common problem in software architecture, and luckily, there are some well-known and tested solutions to address them. These solutions are applicable across many distributed systems and can also be adapted to work with Zeebe.
Here are some of the most effective patterns, we want to explore:
Trigger Zeebe only after the database transaction is successfully committed. This avoids notifying Zeebe with incomplete or uncommitted data.
Save Zeebe messages in an "outbox" table as part of the same database transaction. A scheduler or worker then reliably sends these messages to Zeebe after the transaction is complete.
Break workflows into smaller steps. Each step has a compensating action to undo its work if something goes wrong, ensuring consistency across systems without requiring a global transaction.
These patterns are widely used in distributed systems and provide different trade-offs between simplicity, reliability, and performance.
This repository currently contains examples of the following patterns to help you get started:
- ✅ After-Transaction Hook: Ensuring Zeebe interactions occur only after the transaction commits.
- 📦 Outbox Pattern: Using a database outbox and scheduler for reliable message delivery.
Getting started with the examples is simple! Follow these steps:
1: Start the Infrastructure:
Navigate to the stack
folder and bring up the infrastructure (Zeebe, Operate, etc.) using Docker Compose:
cd stack
docker-compose up
2: Run the Example:
Go to the folder of the example you want to try and start the application by running the ExampleApplication.kt
main
class.
Each example is a standard Spring Boot application,
so you can run it using your preferred IDE or command line.
3: Interact with the Process:
Each example uses the same newsletter subscription process.
To interact with the process, you need to send requests to the REST API provided by the example services.
- To make this easier, there are predefined API call files located in the bruno directory.
- If you use Bruno, simply open the folder and execute these files.
- Alternatively, you can use any other tool like curl or Postman to send the requests manually.
4: Monitor the Processes:
Once the infrastructure is running, you can monitor and debug workflows using Operate
at http://localhost:9091.
The credentials are demo/demo
.
Distributed transactions are tricky, but together we can solve them! If you have ideas, improvements, or new examples, feel free to contribute by opening a pull request or issue.