Page MenuHomePhabricator

Monitor and alert on rollbacks in maps cluster
Open, Needs TriagePublic

Description

In T301664 the clearest way that errors manifested was as rollbacks. We should create an icinga check that will alert on consistent rollback events over a small period of time.

Event Timeline

@hnowlan: Removing task assignee as this open task has been assigned for more than two years - see the email sent to all task assignees on 2024-04-15.
Please assign this task to yourself again if you still realistically [plan to] work on this task - it would be welcome! :)
If this task has been resolved in the meantime, or should not be worked on by anybody ("declined"), please update its task status via "Add Action… 🡒 Change Status".
Also see https://www.mediawiki.org/wiki/Bug_management/Assignee_cleanup for tips how to best manage your individual work in Phabricator. Thanks!