Skip to content
This repository has been archived by the owner on Oct 22, 2024. It is now read-only.

Readiness probes of some components not correct #71

Open
TilBlechschmidt opened this issue Dec 7, 2021 · 1 comment
Open

Readiness probes of some components not correct #71

TilBlechschmidt opened this issue Dec 7, 2021 · 1 comment
Labels
Priority: High Highest priority that will be addressed first once contribution time is available Status: Pending Initial issue stage waiting for further evaluation Type: Bug Issues that affect the operation of the software

Comments

@TilBlechschmidt
Copy link
Owner

🐛 Bug description

It appears that some components do not propagate the correct readiness probe state regarding connectivity to the Redis server. It seems to be affecting the manager, orchestrator, and gangway. The collector and api probably suffer the same issue but in the observed scenario they kept on crashing because the mongodb server was unavailable.

🦶 Reproduction steps

Steps to reproduce the behavior:

  1. Deploy a webgrid fresh
  2. Make sure the redis and/or MongoDB don't come up
  3. Watch it burn 🔥

🎯 Expected behaviour

This is more of a philosophical discussion on whether the software should crash upon encountering an error or just report a negative readiness state. Probably the latter, however, even that is currently not given. Redis connectivity should be reflected in the readiness state!

📺 Screenshots

image

@TilBlechschmidt TilBlechschmidt added Type: Bug Issues that affect the operation of the software Priority: High Highest priority that will be addressed first once contribution time is available Status: Pending Initial issue stage waiting for further evaluation labels Dec 7, 2021
@TilBlechschmidt TilBlechschmidt added this to the Stable release milestone Dec 7, 2021
@TilBlechschmidt TilBlechschmidt self-assigned this Dec 7, 2021
@TilBlechschmidt
Copy link
Owner Author

image

After giving it a few minutes to settle down, it appears as though the api and collector crash (as expected), the gangway correctly report the readiness, and the manager and orchestrator behave incorrectly.

@TilBlechschmidt TilBlechschmidt removed their assignment Dec 17, 2021
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
Priority: High Highest priority that will be addressed first once contribution time is available Status: Pending Initial issue stage waiting for further evaluation Type: Bug Issues that affect the operation of the software
Projects
None yet
Development

No branches or pull requests

1 participant