Page MenuHomePhabricator

Alert SRE if Wikipedia Maps replication sync lag exceeds 1 day
Closed, InvalidPublic

Description

Wikipedia Maps sync lag has exceeded the expected time of 1 day. This is resulting in the updates on OSM such as name updates not being reflected. Screnshot of https://grafana.wikimedia.org/d/000000305/maps-performances?orgId=1&var-cluster=maps1 is enclosed.

I think the operations team should have an alert if the sync lags exceeds 1 day.

Grafana maps Screenshot from 2019-11-04 06-42-37.png (461×983 px, 42 KB)

Event Timeline

Arjunaraoc renamed this task from Wikipedia Maps sync lag with OSM exceeded 5 days to Wikipedia Maps replication failed.Nov 4 2019, 1:24 AM
Arjunaraoc added a project: SRE.
Aklapper renamed this task from Wikipedia Maps replication failed to Alert SRE if Wikipedia Maps replication sync lag exceeds 1 day.Nov 4 2019, 8:31 AM
Aklapper removed a project: SRE.
Aklapper added a project: SRE.
Mathew.onipe raised the priority of this task from Medium to High.Nov 4 2019, 8:41 AM
Mathew.onipe added subscribers: Gehel, MSantos.
Mathew.onipe removed subscribers: MSantos, Gehel.

I'm closing this task as there are icinga alerts for osm sync