This task captures the current (May 2023) status of cloudmetrics hosts and their components, and opens the discussion on what their future might be. Please feel free to edit the description if I (@fgiunchedi) missed something.
There are two cloudmetrics hosts, each running the following:
- graphite/statsd
- grafana
- prometheus
Given the big progress that's happened lately (on a multiple-months scale that is, also thanks to the excellent work by @taavi) with metricsinfra project, I'd like to pose the following questions:
- Is graphite/statsd on cloudmetrics still actively consumed/useful? (i.e. does anyone look at the metrics?)
- Ditto for grafana on cloudmetrics? (i.e. https://grafana-cloud.wikimedia.org). I believe the answer is likely "on its way out" given T333568: Move WMCS dashboards to grafana.wmcloud.org
Ideally both answers are 'no', but regardless I'd like to move the labs instance of Prometheus off cloudmetrics and onto the Prometheus production hardware. My understanding is that this move will help both WMCS (one less component to think about) and Observability (less variance/snowflakes).