Page MenuHomePhabricator

Throttle big monthly jobs - network saturation
Closed, ResolvedPublic

Description

4 monthly jobs shuffle a lot of data:

  • cassandra mediarequests top files
  • cassandra pageview top articles
  • unique devices per project family
  • unique devices per domain

When the jobs are run on their usual schedule, they have to share the cluster resources, making them naturally throttled. However this month we had to run one job at a different time and it saturated the network, impacting production traffic.
We need to throttle the jobs to make sure they don't saturate the network as for the moment we don't have QoS setup that would prioritize production traffic against analytics traffic.

Event Timeline

Change 789797 had a related patch set uploaded (by Joal; author: Joal):

[analytics/refinery@master] Throttle heavy monthly jobs

https://gerrit.wikimedia.org/r/789797

Change 789797 merged by Mforns:

[analytics/refinery@master] Throttle heavy monthly jobs

https://gerrit.wikimedia.org/r/789797