Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix test_change_client_ocs_version_and_stop_heartbeat test #9395

Merged
merged 9 commits into from
May 23, 2024

Conversation

fbalak
Copy link
Contributor

@fbalak fbalak commented Feb 29, 2024

Signed-off-by: fbalak <fbalak@redhat.com>
@fbalak fbalak added Squad/Blue provider-client Provider-client solution labels Feb 29, 2024
@fbalak fbalak self-assigned this Feb 29, 2024
@fbalak fbalak requested a review from a team as a code owner February 29, 2024 13:16
Signed-off-by: fbalak <fbalak@redhat.com>
@pull-request-size pull-request-size bot added size/S PR that changes 10-29 lines and removed size/XS labels Feb 29, 2024
@pull-request-size pull-request-size bot added size/XS and removed size/S PR that changes 10-29 lines labels Feb 29, 2024
Signed-off-by: fbalak <fbalak@redhat.com>
@pull-request-size pull-request-size bot added size/S PR that changes 10-29 lines and removed size/XS labels Feb 29, 2024
Signed-off-by: fbalak <fbalak@redhat.com>
nehaberry
nehaberry previously approved these changes Mar 29, 2024
Copy link
Contributor

@nehaberry nehaberry left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM.

@@ -1177,7 +1177,7 @@ def change_client_version():
nonlocal client
nonlocal original_cluster
# run_time of operation
run_time = 60 * 3
run_time = 60 * 7
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@fbalak why is this value set at 60*7 ? Could you please clarify to me?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

StorageClientHeartbeatMissed - description has 120(s) for Warning and 300(s) for Critical.

Expressions in Prometheus rules confirming. Waring expression for reference.
[(time() - 120) > (ocs_storage_client_last_heartbeat > 0)](https://console-openshift-console.apps.ibm-baremetal1.qe.rh-ocs.com/monitoring/query-browser?query0=(time()%20-%20120)%20%3E%20(ocs_storage_client_last_heartbeat%20%3E%200))

image
image

StorageClientIncompatibleOperatorVersion immediate as far as I understand. No interval in the description.

So all together should be 420sec enough, which is the same as Filip set here

dahorak
dahorak previously approved these changes Apr 12, 2024
Copy link
Contributor

@dahorak dahorak left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@fbalak fbalak dismissed stale reviews from dahorak and nehaberry via aaa2be6 May 15, 2024 11:21
@fbalak fbalak requested a review from a team as a code owner May 15, 2024 11:21
@pull-request-size pull-request-size bot added size/M PR that changes 30-99 lines and removed size/S PR that changes 10-29 lines labels May 15, 2024
@openshift-ci openshift-ci bot removed the lgtm label May 15, 2024
Signed-off-by: fbalak <fbalak@redhat.com>
DanielOsypenko
DanielOsypenko previously approved these changes May 15, 2024
Signed-off-by: fbalak <fbalak@redhat.com>
Signed-off-by: fbalak <fbalak@redhat.com>
Signed-off-by: fbalak <fbalak@redhat.com>
@fbalak
Copy link
Contributor Author

fbalak commented May 23, 2024

@fbalak fbalak added the Verified Mark when PR was verified and log provided label May 23, 2024
@fbalak fbalak changed the title update message of StorageClientHeartbeatMissed alert Fix test_change_client_ocs_version_and_stop_heartbeat test May 23, 2024
@openshift-ci openshift-ci bot added the lgtm label May 23, 2024
Copy link

openshift-ci bot commented May 23, 2024

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: dahorak, DanielOsypenko, ebondare, fbalak

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@fbalak fbalak merged commit da3464e into red-hat-storage:master May 23, 2024
5 of 6 checks passed
amr1ta pushed a commit to amr1ta/ocs-ci that referenced this pull request May 28, 2024
…torage#9395)

* update message of StorageClientHeartbeatMissed alert

Signed-off-by: fbalak <fbalak@redhat.com>

* remove dot from the alert message

Signed-off-by: fbalak <fbalak@redhat.com>

* update alert data

Signed-off-by: fbalak <fbalak@redhat.com>

* increase alert collecting time

Signed-off-by: fbalak <fbalak@redhat.com>

* update alert messages

Signed-off-by: fbalak <fbalak@redhat.com>

* update check_alert_list to reflect multiple messages for one alert

Signed-off-by: fbalak <fbalak@redhat.com>

* specify namespace in patch command

Signed-off-by: fbalak <fbalak@redhat.com>

* fix alert dictionary keys

Signed-off-by: fbalak <fbalak@redhat.com>

* fix severity level

Signed-off-by: fbalak <fbalak@redhat.com>

---------

Signed-off-by: fbalak <fbalak@redhat.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
lgtm provider-client Provider-client solution size/M PR that changes 30-99 lines Squad/Blue Verified Mark when PR was verified and log provided
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Fix alert message for StorageClientHeartbeatMissed
5 participants