Page MenuHomePhabricator

Create a script that'll allow easier swatting of queries that're overloading ToolsDB
Closed, InvalidPublic

Description

So right now spotting queries that are being too resource intensive for no reason (such as: running 9 concurrent and exactly-the-same queries!) and swatting them is a bit hard. It involves:

  1. Finding which user is running these
  2. Killing the query
  3. Commenting out crontabs / suspending / killing jobs
  4. Emailing the user

We need to automate a fair bit more of this.

Event Timeline

yuvipanda raised the priority of this task from to Needs Triage.
yuvipanda updated the task description. (Show Details)
yuvipanda added projects: Cloud-Services, DBA.
yuvipanda added subscribers: yuvipanda, jcrespo, Legoktm.
chasemp subscribed.

With the deployment of T183983: Re-institute query killer for the analytics WikiReplica there is now a log placed at labsdb10XX:/var/log/wmf-pt-kill/wmf-pt-kill.log which can be parsed if someone from cloud-services-team still think this is a valid ticket.
I doubt the priority is high, as the query killer has been doing its job since we set up the new labsdb hosts, but not touching the priority myself here.

bd808 renamed this task from Create a script that'll allow easier swatting of queries that're overloading labsdb to Create a script that'll allow easier swatting of queries that're overloading ToolsDB.Jan 4 2020, 9:32 PM
bd808 edited projects, added Data-Services; removed Cloud-Services.
bd808 moved this task from Backlog to ToolsDB on the Data-Services board.