Only enforce scheduler_key unique index on first execution #6

jujustayfly · 2024-06-13T20:32:18Z

Currently if a job is periodically scheduled but also long running (through multiple workloads), part of it's execution gets skipped. This is because we currently have a unique index on the scheduler_key, which gets copied over to subsequent workload. This means that if the next scheduled job is already enqueued (no matter how far in the future), any new workload from the currently executing job gets swallowed by the conflict on the unique index.

This is particularly an issue when using job-iteration which relies on interrupting and re-enqueueing many workloads for the same active job (over a long period of time)

The proposed change is to change the unique index to only affect workloads where the executions count is 0. This is acceptable because jobs will still respect execution_concurrency_key and enqueue_concurrency_key unique indexes. Retry policies can also be configured on the jobs themselves if needed.

svanhesteren · 2024-06-14T13:31:05Z

Hey cool find! Couple of things: I'm not 100% sure what all the implications will be, a test showing the issue and resolutions would help for sure.

Next to that, by just changing the index in the Gouda class the existing indexes in the database won't get updated. We need a gouda:update task next to the gouda:install task. Good job has this too, so we can just do it like that. Perhaps I'll implement that separately outside of this pr.

julik · 2024-06-14T13:50:05Z

What happens if there is no serialized_params or no executions within it? How will PG react? If we need to know which Workload was the first for a particular job - maybe pull it into the table proper, so that there is always a value, always the correct type etc.?

jujustayfly · 2024-06-14T15:22:52Z

I wonder if a lighter solution to this would be to not copy the scheduler key on workloads from retries 🤔 The original job could still be tracked using the active_job_id

jujustayfly · 2024-06-17T11:00:52Z

I have opted for the lighter solution of only setting a scheduler_key on the initial execution of an active job. This way no migration will be required. @svanhesteren I see we had explicitly written code to always copy the scheduler_key on all executions but I am not sure that is the desired behavior knowing that we also have enqueue and execution concurrency mechanism. Wdyt?

svanhesteren

This is great! A nice a simple solution. 🚀

Update gouda.rb

6d4e8e8

only set scheduler key on initial workload for active job

125c11b

Update scheduler_test.rb

fa192c3

jujustayfly requested a review from svanhesteren June 17, 2024 13:50

svanhesteren approved these changes Jun 18, 2024

View reviewed changes

jujustayfly merged commit 74f53a9 into main Jun 18, 2024
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Only enforce scheduler_key unique index on first execution #6

Only enforce scheduler_key unique index on first execution #6

jujustayfly commented Jun 13, 2024

svanhesteren commented Jun 14, 2024

julik commented Jun 14, 2024

jujustayfly commented Jun 14, 2024

jujustayfly commented Jun 17, 2024

svanhesteren left a comment

Only enforce scheduler_key unique index on first execution #6

Only enforce scheduler_key unique index on first execution #6

Conversation

jujustayfly commented Jun 13, 2024

svanhesteren commented Jun 14, 2024

julik commented Jun 14, 2024

jujustayfly commented Jun 14, 2024

jujustayfly commented Jun 17, 2024

svanhesteren left a comment

Choose a reason for hiding this comment