Right now we don't have anything other than basic host / disk space checks.
Description
Description
| Status | Subtype | Assigned | Task | ||
|---|---|---|---|---|---|
| Resolved | None | T90534 Make toolforge reliable enough (tracking) | |||
| Invalid | None | T90845 Set up sufficient monitoring for toollabs | |||
| Resolved | yuvipanda | T90847 Monitor toollabs home page to make sure it is up | |||
| Declined | None | T90850 Monitor bigbrother |
Event Timeline
Comment Actions
No, this is primarily about getting notified when something of tool labs is down. Keeping historic information and making stats from them would be nice in general for our monitoring, but AFAIK is also not there for production. Some monitoring tools do that at the same time (certain anomaly detection requires historic information).