Page MenuHomePhabricator

Create WDQS Uptime SLO dashboard in Grizzly
Closed, ResolvedPublic3 Estimated Story Points

Description

Event Timeline

Gehel triaged this task as High priority.Nov 21 2022, 4:26 PM
Gehel moved this task from Incoming to Current work on the Wikidata-Query-Service board.
Gehel set the point value for this task to 3.Nov 21 2022, 4:45 PM

Change 862178 had a related patch set uploaded (by Ryan Kemper; author: Ryan Kemper):

[operations/grafana-grizzly@master] [WIP] add grizzly dashboard for WDQS uptime

https://gerrit.wikimedia.org/r/862178

Change 862178 merged by Ryan Kemper:

[operations/grafana-grizzly@master] add grizzly dashboard for WDQS uptime

https://gerrit.wikimedia.org/r/862178

ryankemper@grafana1002:/srv/grafana-grizzly$ grr diff slo_dashboards.jsonnet
Dashboard/slo-logstash-tmpl no differences
Dashboard/slo-trafficserver-tmpl no differences
Dashboard/slo-varnish-tmpl no differences
Dashboard/slo-wdqs-tmpl not present in Dashboard
Dashboard/slo-apigw no differences
Dashboard/slo-etcd-tmpl no differences
Dashboard/slo-haproxy-tmpl no differences
ryankemper@grafana1002:/srv/grafana-grizzly$ grr apply slo_dashboards.jsonnet
Dashboard/slo-logstash-tmpl no differences
Dashboard/slo-trafficserver-tmpl no differences
Dashboard/slo-varnish-tmpl no differences
Dashboard/slo-wdqs-tmpl added
Dashboard/slo-apigw no differences
Dashboard/slo-etcd-tmpl no differences
Dashboard/slo-haproxy-tmpl no differences
RKemper renamed this task from Create WDQS Uptime SLO && WDQS/WCQS update lag SLO dashboards in Grizzly to Create WDQS Uptime SLO dashboard in Grizzly.Dec 8 2022, 8:18 PM
RKemper updated the task description. (Show Details)
RKemper updated the task description. (Show Details)

Looking at https://grafana.wikimedia.org/d/slo-wdqs-tmpl/wdqs-slos-grizzly-template?orgId=1&var-datasource=thanos&var-site=All&var-cluster=All, I only see data for "Request Error Budget Remaining". The other graphs don't seem to be timing out, so I suspect there is an issue with the queries. I have not investigated further.

Moving this ticket back to needs review.

Change 867695 had a related patch set uploaded (by Ryan Kemper; author: Ryan Kemper):

[operations/grafana-grizzly@master] wdqs: fix request request error ratio sli pane

https://gerrit.wikimedia.org/r/867695

Change 879599 had a related patch set uploaded (by Ryan Kemper; author: Ryan Kemper):

[operations/puppet@production] [WIP] wdqs: add recording rule for req success ratio

https://gerrit.wikimedia.org/r/879599

Change 879606 had a related patch set uploaded (by Ryan Kemper; author: Ryan Kemper):

[operations/grafana-grizzly@master] [WIP] wdqs: use pre-computed wdqs recording rules

https://gerrit.wikimedia.org/r/879606

Change 879599 merged by Ryan Kemper:

[operations/puppet@production] wdqs: add recording rule for req success ratio

https://gerrit.wikimedia.org/r/879599

Change 883610 had a related patch set uploaded (by Ryan Kemper; author: Ryan Kemper):

[operations/puppet@production] wdqs: add recording rule for req success ratio

https://gerrit.wikimedia.org/r/883610

Change 883610 merged by Ryan Kemper:

[operations/puppet@production] wdqs: add recording rule for req success ratio

https://gerrit.wikimedia.org/r/883610

Change 879606 merged by Ryan Kemper:

[operations/grafana-grizzly@master] wdqs: use pre-computed wdqs recording rules

https://gerrit.wikimedia.org/r/879606

Change 867695 abandoned by Ryan Kemper:

[operations/grafana-grizzly@master] wdqs: fix request request error ratio sli pane

Reason:

obsoleted by https://gerrit.wikimedia.org/r/c/operations/grafana-grizzly/+/879606

https://gerrit.wikimedia.org/r/867695

Change 912944 had a related patch set uploaded (by Ryan Kemper; author: Ryan Kemper):

[operations/grafana-grizzly@master] wdqs: make uptime sli a %

https://gerrit.wikimedia.org/r/912944

Change 912944 merged by Ryan Kemper:

[operations/grafana-grizzly@master] wdqs: make uptime sli a %

https://gerrit.wikimedia.org/r/912944

Forgot to link patch but here's the (hopefully final) grizzly patch to get this where we want it: https://gerrit.wikimedia.org/r/c/operations/grafana-grizzly/+/917938

(patch already merged & deployed)