Page MenuHomePhabricator

Port swift prometheus-based alerts from icinga to alertmanager
Closed, ResolvedPublic

Description

The following alerts are based on check_prometheus and will benefit from being ported to alertmanager / alerts.git. (cfr docs at https://wikitech.wikimedia.org/wiki/Alertmanager#Create_alerts (and the whole page really))

modules/profile/manifests/swift/alerts.pp:        monitoring::check_prometheus { "swift-${site}-container-availability":
modules/profile/manifests/swift/alerts.pp:        monitoring::check_prometheus { "swift-${site}-object-availability":
modules/profile/manifests/swift/alerts.pp:        monitoring::check_prometheus { "swift-${site}-media-uploads":
modules/profile/manifests/swift/alerts.pp:    monitoring::check_prometheus { 'mw-objects-diff-eqiad-codfw':
modules/profile/manifests/swift/alerts.pp:    monitoring::check_prometheus { 'mw-objects-diff-codfw-eqiad':

Event Timeline

Change 812883 had a related patch set uploaded (by Mark Bergsma; author: Mark Bergsma):

[operations/alerts@master] sre: Port swift container_availability from Icinga

https://gerrit.wikimedia.org/r/812883

lmata triaged this task as Medium priority.Aug 1 2022, 8:24 PM

from patch notes alerts have been downgraded from page to critical for testing.

Change 902410 had a related patch set uploaded (by Jbond; author: jbond):

[operations/alerts@master] swift: convert monitoring::check_prometheus checkes

https://gerrit.wikimedia.org/r/902410

Change 902410 abandoned by Jbond:

[operations/alerts@master] swift: convert monitoring::check_prometheus checkes

Reason:

see https://gerrit.wikimedia.org/r/c/operations/alerts/+/812883

https://gerrit.wikimedia.org/r/902410

Change 812883 merged by Filippo Giunchedi:

[operations/alerts@master] sre: Port swift alerts from Icinga

https://gerrit.wikimedia.org/r/812883

Change 922499 had a related patch set uploaded (by Filippo Giunchedi; author: Filippo Giunchedi):

[operations/puppet@production] icinga: remove swift alerts, moved to alertmanager

https://gerrit.wikimedia.org/r/922499

Change 922499 merged by Filippo Giunchedi:

[operations/puppet@production] icinga: remove swift alerts, moved to alertmanager

https://gerrit.wikimedia.org/r/922499

fgiunchedi claimed this task.

This is done