Page MenuHomePhabricator

Add monitoring of API endpoints for API gateway
Open, Needs TriagePublic

Description

Lack of monitoring of /feeds/ meant that a bad config push left this endpoint broken for a period of time. We should have monitoring that will alert us to changes in behaviour that fail to serve responses at /feeds/, /core and other endpoints.

Ideally we would also have monitoring on a per-service basis for discovery services but there is less flexibility there.

Event Timeline

For reference, metrics for public REST endpoints backed by MediaWiki have been added last week: https://grafana.wikimedia.org/d/2pCoanoVk/mw-rest-api?orgId=1 (T321969)

There is a partially defunct dashboard for endpoints routed through RESTbase: https://grafana.wikimedia.org/d/000000559/api-requests-breakdown?orgId=1

As API Gateway is nowadays owned by serviceops, adding the serviceops project tag to open API Gateway tasks tagged with the deprecated/archived "Platform Team Initiatives (API Gateway)" tag at https://phabricator.wikimedia.org/project/profile/4321/, as part of Phabricator Housekeeping.