Page MenuHomePhabricator

Polish script that checks eventlogging lag to use it for alarming
Closed, DeclinedPublic

Description

Polish script that checks eventlogging lag to use it for alarming

Event Timeline

Nuria raised the priority of this task from to Needs Triage.
Nuria updated the task description. (Show Details)
Nuria added a project: Analytics-Kanban.
Nuria added subscribers: Nuria, Ottomata, DBA, elukey.
Nuria set Security to None.

I took a look to the script and it would be really great to push metrics to statsd about the lag observed for each table. After that alarming with graphite/icinga should be super easy.

The new metrics would be added in https://grafana.wikimedia.org/dashboard/db/eventlogging

Reedy renamed this task from Polish script that checks eventlogging lag to use it for alarming to Polish script that checks eventlogging lag to use it for alarming .Jan 22 2016, 1:37 PM
This comment was removed by elukey.
Milimetric triaged this task as Medium priority.Feb 11 2016, 6:03 PM
Milimetric moved this task from Incoming to Event Platform on the Analytics board.
Milimetric moved this task from Event Platform to Analytics Query Service on the Analytics board.
Milimetric subscribed.

We postponed this too far, we're likely to change how people look at EL data before we get to this improvement.