Page MenuHomePhabricator

Create runbook for VarnishTrafficDrop alert, change dashboard link
Closed, ResolvedPublic

Description

The VarnishTrafficDrop alert currently has no associated runbook, making it hard for SREs to know what to do about it.

Also, the dashboard currently linked is useful but varnish-caching-last-week-comparison, specific to the affected DC if possible, would be better.

Event Timeline

ema triaged this task as Medium priority.Oct 8 2021, 11:53 AM

Change 730193 had a related patch set uploaded (by Ema; author: Ema):

[operations/alerts@master] VarnishTrafficDrop: add runbook and change dashboard link

https://gerrit.wikimedia.org/r/730193

Change 730193 merged by Ema:

[operations/alerts@master] VarnishTrafficDrop: add runbook and change dashboard link

https://gerrit.wikimedia.org/r/730193

ema claimed this task.

Runbook and updated dashboard link are shown correctly. Closing.

06:00 < jinxer-wm> (VarnishTrafficDrop) resolved: 68% GET drop in text@codfw during the past 30 minutes - 
                   https://wikitech.wikimedia.org/wiki/Monitoring/VarnishTrafficDrop - 
                   https://grafana.wikimedia.org/d/000000541/varnish-caching-last-week-comparison?viewPanel=5&var-cluster=text&var-site=codfw - 
                   https://alerts.wikimedia.org