The etcdmirror systemd status check needs to page. Currently if an etcdmirror is down, only an irc alert is sent
PROBLEM - etcdmirror-conftool-eqiad-wmnet service on conf2005 is CRITICAL: CRITICAL - Expecting active but unit etcdmirror-conftool-eqiad-wmnet is failed
Description
Description
Details
Details
Related Changes in Gerrit:
| Subject | Repo | Branch | Lines +/- | |
|---|---|---|---|---|
| sre: add paging alert for etcdmirror down | operations/alerts | master | +28 -0 |
| Status | Subtype | Assigned | Task | ||
|---|---|---|---|---|---|
| Resolved | jijiki | T317340 Incident: 2022-09-08 codfw appservers degradation | |||
| Resolved | Clement_Goubert | T317402 Page on etcdmirror critical status |
Event Timeline
Comment Actions
Change 831103 had a related patch set uploaded (by Clément Goubert; author: Giuseppe Lavagetto):
[operations/alerts@master] sre: add paging alert for etcdmirror down
Comment Actions
Change 831103 merged by jenkins-bot:
[operations/alerts@master] sre: add paging alert for etcdmirror down