Page MenuHomePhabricator

Automate and/or better-document varnish ban procedure for operations staff, so it can be accomplished with more speed and confidence in outage conditions
Closed, ResolvedPublic

Description

This is a followup task, generated from https://wikitech.wikimedia.org/wiki/Incident_documentation/20160126-20160126-WikimediaDomainRedirection#Actionables.

During a recent outage condition, operations had to perform multiple cache purges/bans. The actionables include better documentation of this procedure, as wikitech currently is somewhat lacking. Wikitech's info mainly comprises of: One-off purges - Don't do this. Consult a varnish specialist first. It also includes very limited info about varnishadm ban but not a detailed use case for our systems.

Event Timeline

RobH assigned this task to BBlack.
RobH raised the priority of this task from to Medium.
RobH updated the task description. (Show Details)
RobH added projects: SRE, Documentation, Traffic.
RobH added subscribers: RobH, akosiaris, mark, Joe.

I initially assigned this to @BBlack, but it can be accomplished by anyone who understands varnish and the use of ban commands. (I've CC'd Mark, Alex, and Giuseppe since they were all responsive during the outage and/or sending these commands.)