Page MenuHomePhabricator

Database alerting
Open, MediumPublic

Description

This is an epic task to gather all alerting (not including monitoring of trends/graphing, only potential emergencies a.k.a. icinga) of databases. There is currently too many false positives, and some gaps on the alerting, so tools and model has to change.

Related incident: https://wikitech.wikimedia.org/wiki/Incident_documentation/2017-07-28_s5_(WikiData_and_dewiki)_read-only

Related Objects

StatusSubtypeAssignedTask
OpenNone
Resolvedjcrespo
Resolvedjcrespo
DeclinedNone
Resolvedaaron
Resolvedjcrespo
ResolvedDzahn
ResolvedCDanis
ResolvedVolans
ResolvedCDanis
ResolvedCDanis
ResolvedMarostegui
OpenNone
ResolvedLadsgroup
ResolvedNone
ResolvedMarostegui
ResolvedMarostegui
ResolvedMarostegui
ResolvedMarostegui
ResolvedMarostegui
ResolvedMarostegui
ResolvedReedy
ResolvedMarostegui
ResolvedMarostegui
ResolvedMarostegui
DeclinedNone
ResolvedMarostegui
ResolvedMarostegui
ResolvedLadsgroup
ResolvedMarostegui
ResolvedMarostegui
ResolvedMarostegui
ResolvedMarostegui
ResolvedMarostegui
ResolvedMarostegui
ResolvedMarostegui
ResolvedLadsgroup
ResolvedMarostegui
ResolvedMarostegui
ResolvedLadsgroup
ResolvedKormat
OpenMarostegui
ResolvedKormat
ResolvedMarostegui
ResolvedKormat
Resolvedjcrespo
OpenNone
ResolvedKormat
Resolvedjcrespo
Resolved hashar
DeclinedLSobanski
OpenNone
ResolvedKormat
OpenNone
ResolvedKormat

Event Timeline

jcrespo triaged this task as Medium priority.Aug 4 2017, 8:41 AM
jcrespo moved this task from Triage to Meta/Epic on the DBA board.

Change 595149 had a related patch set uploaded (by Jcrespo; owner: Jcrespo):
[operations/puppet@production] monitoring: remove usages of 'dba' contact group

https://gerrit.wikimedia.org/r/595149

Change 595153 had a related patch set uploaded (by Jcrespo; owner: Jcrespo):
[operations/puppet@production] mariadb: Default monitor disk paging to false

https://gerrit.wikimedia.org/r/595153

Change 595149 merged by Jcrespo:
[operations/puppet@production] monitoring: remove usages of 'dba' contact group

https://gerrit.wikimedia.org/r/595149

Change 595153 merged by Jcrespo:
[operations/puppet@production] mariadb: Default monitor disk & process paging to false

https://gerrit.wikimedia.org/r/595153

LSobanski renamed this task from Improve database alerting (tracking) to Database alerting.May 14 2021, 10:37 AM