[BUG] The number of unavailable instances may exceed the maxUnavailable limit during upgrades #8078

YTGhost · 2024-09-03T08:30:26Z

Describe the bug
When performing an upgrade, if the user upgrades to a faulty image causing the Container to crash shortly thereafter, during this brief period (before the Container crashes), since the Pod has already been upgraded, the next Pod upgrade might be permitted (with currentUnavailable being 0 and the previous Pod being checked as healthy). This can lead to the final number of unavailable instances exceeding the maxUnavailable limit.

Expected behavior
In addition to the isHealthy check, it may also be necessary to rely on MinReadySeconds for the isRunningAndAvailable check. At the same time, we also need to ensure that the call to isRunningAndAvailable happens after the kubelet has completed updating the status.

The text was updated successfully, but these errors were encountered:

github-actions · 2024-10-07T00:23:09Z

This issue has been marked as stale because it has been open for 30 days with no activity

YTGhost added the kind/bug Something isn't working label Sep 3, 2024

YTGhost assigned nayutah Sep 3, 2024

free6om added the ks label Sep 3, 2024

nayutah assigned YTGhost Sep 4, 2024

github-actions bot added the Stale label Oct 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] The number of unavailable instances may exceed the maxUnavailable limit during upgrades #8078

[BUG] The number of unavailable instances may exceed the maxUnavailable limit during upgrades #8078

YTGhost commented Sep 3, 2024 •

edited

Loading

github-actions bot commented Oct 7, 2024

[BUG] The number of unavailable instances may exceed the maxUnavailable limit during upgrades #8078

[BUG] The number of unavailable instances may exceed the maxUnavailable limit during upgrades #8078

Comments

YTGhost commented Sep 3, 2024 • edited Loading

github-actions bot commented Oct 7, 2024

YTGhost commented Sep 3, 2024 •

edited

Loading