Page MenuHomePhabricator

Improve visibility of WDQS inaccessability
Closed, ResolvedPublic

Description

During last WDQS outage it was difficult to correctly ascertain the impact of it. Current graphs, while show error rates - also include errors that are BAU ones - like rejections because of throttling. Other then eyeballing it or using more complicated statistical methods, it's impossible to determine which errors are because of an outage. Because of this it's hard to asses the general deterioration of service. We need a way (metric and/or graph) to do that.

AC:

  • Provide a way to determine, how many requests were denied because of an outage.

Related Objects

Event Timeline

Restricted Application added a subscriber: Aklapper. · View Herald Transcript
Gehel triaged this task as Medium priority.Sep 8 2020, 7:19 PM
Gehel claimed this task.
Gehel subscribed.

This is addressed as part of the WDQS Uptime SLO (T313751)