Dashboard link => https://grafana-rw.wikimedia.org/d/xiWr1c5Iz/search-slos?forceLogin&forceLogin&orgId=1&from=now-1y&to=now
All SLOs defined in T335498 are measured and exposed as dashboards so that we can ensure that SLOs are met or that actions are taken to meet them.
Note that at the moment, we have defined SLI (what we want to measure), but SLO themselves (the level we want to achieve) isn't entirely clear. We will define it after seeing the current measurement. They will probably be defined as "latency is below 100 milliseconds for the 95%-ile of requests 99.9% of the time over 3 months" (numbers subject to change). It would be useful to have those numbers as parameters in the dashboards so that can play with them until we decide on a final SLO.
Our approach to SLOs is documented on Wikitech. Dashboards are usually implemented as Grizzly dashboards. In this case, the data is likely to come from the search satisfaction schema or web request logs, it might be easier to track them as superset dashboards, but this makes integration with other SLOs and reporting more complex.
AC
- Decision is made on where to create dashboards (superset / grafana grizzly dashboards)
- Dashboards exists for all the SLOs defined in T335498