Page MenuHomePhabricator

Varnish rate limiting has broken beta
Closed, ResolvedPublic

Description

http://meta.wikimedia.beta.wmflabs.org/wiki/Special:UserRights

Error: 429, Request Rate Exceeded at Wed, 11 Nov 2015 01:10:36 GMT

Event Timeline

Reedy raised the priority of this task from to Needs Triage.
Reedy updated the task description. (Show Details)
Reedy added subscribers: Reedy, BBlack.
Reedy triaged this task as High priority.Nov 11 2015, 1:31 AM
Reedy set Security to None.

Other victims are RESTBase integration tests, from either the office or travis. About 40 API requests distributed across a ~50 second test run.

Maybe oauth from labs is affected. Getting reports that login screens like:

https://tools.wmflabs.org/wikidata-game/

using the "widar" tool (https://tools.wmflabs.org/widar/) are broken:

< Reedy> The URI you have requested, /widar/index.php?action=authorize, appears to be non-functional at this time

edit: made a separate ticket for that: T118363

In case it helps with debugging, there seems to have been a multi-hour delay between the deploy (around 13:46 UTC) & blocking starting in the office. I noticed the first full block when running RESTBase tests just before 00:56 UTC. The same tests had been working okay about 40 minutes earlier.

Change 252385 had a related patch set uploaded (by Faidon Liambotis):
Revert "varnish: misspass limiter"

https://gerrit.wikimedia.org/r/252385

GWicke claimed this task.

Confirmed resolved for RESTBase tests as well. Most of those are actually hitting labs these days, so they were affected by the labs-specific outage.

Krenair subscribed.
<Krenair> I took a look through deployment-cache-text04:/root/.bash_history and found the command they used
<Krenair> Ran it on deployment-cache-mobile04
<Reedy> what was it?
<Krenair> #1447207753
<Krenair> service  varnish restart
<Krenair> #1447207761
<Krenair> service  varnish-frontend restart

Change 252385 abandoned by Faidon Liambotis:
Revert "varnish: misspass limiter"

Reason:
Everything seems to be working fine so far.

https://gerrit.wikimedia.org/r/252385