Context
Since Graphite has long been EOL and statsd (The API we use to post metrics to the Grafana backend) will be replaced with a new one, supporting Prometheus (statslib), we will embark on a series of tasks to perform the migration and also clean up our metrics and dashboards in Grafana.
WMF: T228380 Tech debt: sunsetting of Graphite
Main Objectives
-
Clean up some deprecated and unused tables in graphite so that they are not migrated to Prometheus.- Because of the deadline we just migrated all of it
- All of the API calls are made to a Prometheus backend (and alongside to Graphite, for an interim phase) using the new statslib API across the variety of extensions supporting Wikidata.
- 🚧 Ensure that Grafana dashboards are querying Prometheus instead of Graphite.
- 🚧 Standardization of the presentation of and context for metrics across Grafana Dashboards.
- 🚧 Remove all unused Prometheus/Graphite data processes given new Grafana dashboards.
- 🚧 Mark historical dashboards that Product wants to keep as archived/deprecated (ex: WD co-editors)
- Eventually deprecate the copying of stats to Graphite.
Mitigated Risks
- Graphite is EOL and achieving these objectives will allow us to stop using it.
Statsd uses abandoned libraries in it's upstream and so migrating to statslib will mean we will have one fewer unmaintained weak points in out dependencies. T326607
- Additional stakeholders from MediaWiki Ecosystem will not have to perform these tasks for us with little context.
- Broken Dashboards and Tables in Grafana erodes the trust in the Analytics data we are presenting to Users and Engineers
Potential Tasks
- See WMDE Grafana Cleanup Notes for an overview of the current issues with Grafana