During this stress test, https://grafana-admin.wikimedia.org/dashboard/db/ores?orgId=1&from=1513256100000&to=1513257600000
the timeout error graph shows about 40 timeout errors, but the testing tool reported more than 13,000.
Description
Description
Related Objects
Related Objects
Event Timeline
Comment Actions
There's one code path that can throw a TimeoutError without adding to this metric, it's the outer timeout in ores/util.py. Interesting that we're hitting this code, I don't think that should be happening.
Meanwhile, I'll have it emit metrics.
Comment Actions
The last comment was wrong, I see how the timeout is caught and metrics are recorded. I currently can't find any code paths to explain the missing metrics.