Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fix(ci): Changes from v2 for release 2.8.5 (3) #6023

Merged
merged 3 commits into from
Nov 1, 2024
Merged

fix(ci): Changes from v2 for release 2.8.5 (3) #6023

merged 3 commits into from
Nov 1, 2024

Conversation

sakoush
Copy link
Member

@sakoush sakoush commented Nov 1, 2024

What this PR does / why we need it:

Which issue(s) this PR fixes:

Fixes #

Special notes for your reviewer:

sakoush and others added 3 commits November 1, 2024 11:41
…6020)


* enforce serial order on agent connections

* add serial order for dataflow engine

* move logging after acquiring lock

* lint fixes

* add tests for chainer subscribe

* increase sleep to fix flaky test
* fix flaky test

* cap execution of test in case of failures
Following #6020, it was no longer possible to have multiple replicas of dataflow-engine subscribing simultaneously to the scheduler, because all were connecting with the same subscriber name, and a lock was added per name, first waiting the disconnection of the old subscriber before allowing a new one to progress.

We update the dataflow-engine code so that each replica connects with its own hostname as the subscriber name. If the hostname can not be determined, we subscribe with the name seldon-dataflow-engine- followed by the canonical string representation of a UUID v4.

The subscriber name can also be explicitly controlled by passing the --dataflow-replica-id argument or the DATAFLOW_REPLICA_ID environment variable, wich will take precedence, in that order, to setting the value as the hostname.
@sakoush sakoush requested a review from lc525 as a code owner November 1, 2024 15:11
@sakoush sakoush added the v2 label Nov 1, 2024
Copy link
Member

@lc525 lc525 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm

@sakoush sakoush merged commit e3691bb into release-2.8 Nov 1, 2024
17 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants