Page MenuHomePhabricator

Add monitoring of API endpoints for API gateway
Open, Needs TriagePublic

Description

Lack of monitoring of /feeds/ meant that a bad config push left this endpoint broken for a period of time. We should have monitoring that will alert us to changes in behaviour that fail to serve responses at /feeds/, /core and other endpoints.

Ideally we would also have monitoring on a per-service basis for discovery services but there is less flexibility there.

Event Timeline

For reference, metrics for public REST endpoints backed by MediaWiki have been added last week: https://grafana.wikimedia.org/d/2pCoanoVk/mw-rest-api?orgId=1 (T321969)

There is a partially defunct dashboard for endpoints routed through RESTbase: https://grafana.wikimedia.org/d/000000559/api-requests-breakdown?orgId=1

As API Gateway is nowadays owned by serviceops, adding the serviceops project tag to open API Gateway tasks tagged with the deprecated/archived "Platform Team Initiatives (API Gateway)" tag at https://phabricator.wikimedia.org/project/profile/4321/, as part of Phabricator Housekeeping.

@hnowlan: Removing task assignee as this open task has been assigned for more than two years - See the email sent on 2025-05-22.
Please assign this task to yourself again if you still realistically [plan to] work on this task - it would be welcome!
If this task has been resolved in the meantime, or should not be worked on by anybody ("declined"), please update its task status via "Add Action… 🡒 Change Status".
Also see https://www.mediawiki.org/wiki/Bug_management/Assignee_cleanup for tips how to best manage your individual work in Phabricator. Thanks!