From parent task:
monitor/traffic.pp: monitoring::grafana_alert { 'varnish-http-requests': monitor/traffic.pp: monitoring::grafana_alert { 'ping-offload': monitor/traffic.pp: monitoring::grafana_alert { 'rpki':
From parent task:
monitor/traffic.pp: monitoring::grafana_alert { 'varnish-http-requests': monitor/traffic.pp: monitoring::grafana_alert { 'ping-offload': monitor/traffic.pp: monitoring::grafana_alert { 'rpki':
Status | Subtype | Assigned | Task | ||
---|---|---|---|---|---|
Open | None | T321808 Port most/all Icinga checks to Prometheus/Alertmanager | |||
Open | None | T288622 All Prometheus based alerts move from Icinga to alert manager exclusively | |||
Resolved | lmata | T281359 Onboard teams with Grafana alerts to AlertManager | |||
Resolved | fgiunchedi | T282806 Port traffic/netops grafana alerts to AlertManager |
Change 695367 had a related patch set uploaded (by Ema; author: Ema):
[operations/puppet@production] alertmanager: route Traffic team alerts
Change 695367 merged by Ema:
[operations/puppet@production] alertmanager: route Traffic team alerts
I've added a always-firing test alert on Grafana with the following tags: team: traffic, severity: critical. Shortly after I did so, we received an alert both via email and IRC, confirming that alertmanager routing works as expected.
09:34 -!- jinxer-wm [~jinxer-wm@user/jinxer-wm] has joined #wikimedia-traffic 09:34 < jinxer-wm> (EmaTestingAlertManager) firing: EmaTestingAlertManager - https://alerts.wikimedia.org
Change 696384 had a related patch set uploaded (by Ema; author: Ema):
[operations/puppet@production] icinga: remove Grafana alerts for Traffic/Netops
OK so it turns out that defining the alerts in Grafana is possible but not recommended, and the right thing to do is adding them to the operations/alert repo instead. My bad!
Change 696468 had a related patch set uploaded (by Ema; author: Ema):
[operations/alerts@master] Traffic team alerts
Change 697710 had a related patch set uploaded (by Ema; author: Ema):
[operations/alerts@master] Netops team alert: ping offload
Change 697721 had a related patch set uploaded (by Filippo Giunchedi; author: Filippo Giunchedi):
[operations/puppet@production] alertmanager: attach runbook/dashboard URLs to IRC messages
Change 697722 had a related patch set uploaded (by Filippo Giunchedi; author: Filippo Giunchedi):
[operations/puppet@production] alertmanager: add a sample JSON alert and instruction on how to test IRC format changes
Change 697737 had a related patch set uploaded (by Filippo Giunchedi; author: Filippo Giunchedi):
[operations/puppet@production] alerts: reload prometheus instances after deploy
Change 697721 merged by Filippo Giunchedi:
[operations/puppet@production] alertmanager: attach runbook/dashboard URLs to IRC messages
Change 697722 merged by Filippo Giunchedi:
[operations/puppet@production] alertmanager: add a sample alert and test instructions
Change 697737 merged by Filippo Giunchedi:
[operations/puppet@production] alerts: reload prometheus instances after deploy
Change 697924 had a related patch set uploaded (by Filippo Giunchedi; author: Filippo Giunchedi):
[operations/puppet@production] alertmanager: highlight 'instance' label in alerts dashboard
Change 697924 merged by Filippo Giunchedi:
[operations/puppet@production] alertmanager: highlight 'instance' label in alerts dashboard
Change 698459 had a related patch set uploaded (by Filippo Giunchedi; author: Filippo Giunchedi):
[operations/puppet@production] alertmanager: print link separators on IRC when needed
Change 698491 had a related patch set uploaded (by Ema; author: Ema):
[operations/puppet@production] alertmanager: define IRC and page routes for sre team
Change 698491 merged by Ema:
[operations/puppet@production] alertmanager: define IRC and page routes for sre team
Change 697710 merged by Ema:
[operations/alerts@master] Netops team alert: ping offload
Change 698548 had a related patch set uploaded (by Filippo Giunchedi; author: Filippo Giunchedi):
[operations/alerts@master] pipeline: use bullseye to get newer prometheus
Change 698548 merged by Filippo Giunchedi:
[operations/alerts@master] pipeline: use bullseye to get newer prometheus
Change 698459 merged by Filippo Giunchedi:
[operations/puppet@production] alertmanager: print link separators on IRC when needed
Change 700649 had a related patch set uploaded (by Ayounsi; author: XioNoX):
[operations/alerts@master] Move RPKI alerts to Prometheus/AM
Change 700649 merged by Ayounsi:
[operations/alerts@master] Move RPKI alerts to Prometheus/AM
Change 702688 had a related patch set uploaded (by Ayounsi; author: Ayounsi):
[operations/puppet@production] Remove old RPKI Grafana alerts
Change 702688 merged by Ayounsi:
[operations/puppet@production] Remove old RPKI Grafana alerts
Change 708081 had a related patch set uploaded (by Filippo Giunchedi; author: Filippo Giunchedi):
[operations/puppet@production] icinga: remove grafana alerts for Traffic, moved to alertmanager
Change 708081 merged by Filippo Giunchedi:
[operations/puppet@production] icinga: remove grafana alerts for Traffic, moved to alertmanager