I'm seeing maps tile generation error on icinga. It reports CRITICAL.
'CRITICAL: 100.00% of data under the critical threshold [5.0]'
I'm guessing this has to do with response time when generating map tiles.
I'm creating this to track it.
Description
Description
Related Objects
Related Objects
- Duplicates Merged Here
- T215629: OSM replication lag on both maps clusters
Event Timeline
Comment Actions
This issue fell through the cracks and hasn't happened recently, but I'll braindump some stuff for the future:
- These alerts are no longer paging since https://gerrit.wikimedia.org/r/c/operations/puppet/+/639154
- There were no related incident, but this kind of error might be caused when the cluster was under load stress, tilerator uses 50% of ncpu and has to compete for resources with the tile server