User story
AS a WME engineer and as a PM I want to know what current alerts we are receiving in PD to make informed decisions on what alerts we want to implement. This work is the first step towards monitoring alerts, and later on, associating them with a service and assigning an escalation policy.
To do
- 1. Go to PagerDuty and understand the configuration, and the alert system we have in place at the moment
- 2. Make a list of what current alerts we are receiving in PD
- QA
Update
Current list of alarms are docurneted in the wiki of the Incident management project.
https://gitlab.enterprise.wikimedia.com/wikimedia-enterprise/incident-management/-/wikis/Alarms