Page MenuHomePhabricator

Create an alarm if we are blocked by 429s
Open, HighPublic3 Estimated Story Points

Description

When our infrastructure is being blocked by 429s (Too Many Requests), I want to be able to see alarm being triggered on PagerDuty, so that I can be a able to timely react to the such event.

problem: When we are being blocked by 429 requests we wait until we are allowed to query the API again.

Acceptance criteria
When we are blocked by 429s support engineer gets a notification.

ToDo

  • add an alarm to IaC
  • trigger PagerDuty alarm based on that
  • test it

Notes
We can monitor the logs of structured-data service to do this, all of the logs that we need to catch will contain 429 Too Many Requests within the log entry. Here's an example:

/app/submodules/wmf/wmf.go:589: 429 Too Many Requests:
{
    "error": {
        "code": "too_many_requests",
        "message": "AlimitednumberofparallelconnectionsperIPisallowed."
}

Related task

https://phabricator.wikimedia.org/T342171

Event Timeline

Protsack.stephan renamed this task from Add an alarm for if we are blocked by 429s to Create an alarm if we are blocked by 429s.Aug 16 2023, 12:20 PM
Protsack.stephan updated the task description. (Show Details)
Protsack.stephan updated the task description. (Show Details)
JArguello-WMF lowered the priority of this task from High to Medium.Aug 16 2023, 2:44 PM
JArguello-WMF updated the task description. (Show Details)
JArguello-WMF set the point value for this task to 3.
JArguello-WMF lowered the priority of this task from Medium to Low.Aug 29 2023, 2:12 PM
JArguello-WMF raised the priority of this task from Low to High.Mar 21 2024, 2:22 PM