We have a few dashboards for maps services, but none give quite the right information to track user-facing service reliability. For example, we should be able to track the count and proportion of static mapframe thumbnail requests which fail, or are missing mapdata.
Remaining work:
- Verify service request metrics on maps-performances board against webrequest logs
- The current metrics seems to not be accurate, they could be as much as 3x higher than the actual request counts.
- Panels showing the error rate per service and proportion of requests affected
- Not making panels yet. Possibly inaccurate and not very useful prototype panel here.
- Geoshapes have roughly a 15% error rate, snapshots are at 7%.