As an Engineering Manager of the team building Wikibase REST API I would like to know how it works and is being used in production environments so that the team can notice unexpected behaviour and have a basic understanding of API's usage in the particular production environment.
For the immediate need we'll utilize the data Mediawiki itself sends to WMF Graphite, and plot it in Grafana dashboard, understanding the limitations of that data. Create the Grafana dashboard in a new "WMDE Wikibase Product Platform" folder.
Data to monitor
- sum of all requests to Wikibase REST API in total and per endpoint
- number of errors (4xx, 5xx) in total and per endpoint
- number of success responses (one bucket containing everything in the 200-399 range) in total and per endpoint
Unless technically difficult getting numbers collected with the time granularity to a second would be ideal but we shall not intend to do more than the Mediawiki+Graphite allows.
Links: