Skip to content

Commit

Permalink
fixed the alert expression for node which are not sending metrics wit…
Browse files Browse the repository at this point in the history
…hin the timefram
  • Loading branch information
ashish1099 committed Aug 23, 2024
1 parent d706ddc commit e40cfb9
Showing 1 changed file with 3 additions and 2 deletions.
5 changes: 3 additions & 2 deletions argocd-helm-charts/prometheus-linuxaid/rules/prometheus.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -3,8 +3,9 @@ groups:
rules:
- alert: monitor::metrics::missing::pushprox
expr: |
(up{job="pushprox"} == 0) * on (certname) group_left(job) (node_time_seconds - node_boot_time_seconds > 600)
and on(certname) obmondo_monitoring{alert_id="monitor::metrics::missing::pushprox"} > 0
(100 * (count(up == 0) BY (certname, job) / count(up) BY (certname, job)) > 10)
* on (certname) group_right(job) (node_time_seconds - node_boot_time_seconds > 600)
and on(certname) obmondo_monitoring{alert_id="monitor::prometheus::metrics"} > 0
for: 30m
labels:
severity: critical
Expand Down

0 comments on commit e40cfb9

Please sign in to comment.