We collect the metrics for the following wikis, and we should also add alerts for them:
- dewiki
- eswiki
- frwiki
- jawiki
- nlwiki
- ruwiki
- svwiki
- zhwiki
We collect the metrics for the following wikis, and we should also add alerts for them:
Status | Subtype | Assigned | Task | ||
---|---|---|---|---|---|
Resolved | Peter | T197799 Add performance test for more wikis than enwiki | |||
Resolved | Peter | T198287 Add alerts for all Browsertime/WebPageReplay wikis | |||
Resolved | Peter | T225987 Create runbook on what to do when we get an alert |
Updated today and will add the last one tomorrow. Updated the docs: https://wikitech.wikimedia.org/wiki/Performance/WebPageReplay/Alerts
I can do this after the vacation, lets think about, could there be a smarter way to do the alerts, instead of creating a dashboard for each? Templates aren't supported for alerts at the moment in Grafana.
We don't have a good way to do to add a lot of alerts (since templating doesn't work in Grafana). That means that to setup alerts for a new Wikipedia we probably should create a new dashboard so we don't overlap the current one. Let us wait for a while and see which is the best way forward.
These lives in a alert folder in Grafana: https://grafana.wikimedia.org/d/2kP3FjAZE/webpagereplay-ca-wikipedia-org-alerts?orgId=1
RU and AR is missing at the moment (we need more data to be collected) and then the icinga matching is missing.
Change 556108 had a related patch set uploaded (by Phedenskog; owner: Phedenskog):
[operations/puppet@production] icinga: Add all WebPageReplay alerts.
Change 556108 merged by Filippo Giunchedi:
[operations/puppet@production] icinga: Add all WebPageReplay alerts.
Change 558339 had a related patch set uploaded (by Phedenskog; owner: Phedenskog):
[operations/puppet@production] icinga: Add WebPageReplay alerts for ru.wiki
Change 558339 merged by Filippo Giunchedi:
[operations/puppet@production] icinga: Add WebPageReplay alerts for ru.wiki