Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Process Kafka messages in parallel #1512

Merged
merged 1 commit into from
May 2, 2024
Merged

Conversation

radazen
Copy link
Contributor

@radazen radazen commented May 2, 2024

What's new?

This PR adds a worker pool to each message batcher, allowing all message processing to happen in parallel across multiple workers. The resulting messages are still submitted as a batch in the order they were initially read in. As a result, we can now scale our message processing up considerably by increasing the number of vCPUs per container instance. Even without changing the number of vCPUs, we'll likely see something like a 5x increase in processing throughput.

@radazen radazen requested a review from jarrel-b as a code owner May 2, 2024 17:18
@radazen radazen merged commit f2aacd0 into main May 2, 2024
5 checks passed
@radazen radazen deleted the ezra/kafka-parallel-processing branch May 2, 2024 17:18
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant