Page MenuHomePhabricator

Lower the available slots for the dump of enwiki to lower presure on databases
Closed, ResolvedPublic

Description

Coming back to:

I think actually @Ladsgroup's proposal seems like the easier solution on the short term. @xcollazo do you see any complication with reducing parallelism of the enwiki dumps?

After perusing code, the only existing mechanism to reduce parallelism in the current dumps infrastructure seems to be slots mechanism. For each snapshot server, we can control how many concurrent jobs are allowed by modifying this variable. The current maxslots is set to 28 slots per host in the profile::dumps::generation::worker::dumper::maxjobs puppet config.

However, this config is global, meaning all jobs from the dumps will be affected. Dumping already takes around ~18 days to finish a full dump, and around ~8 days to finish a partial dump, which leaves us with just a couple more days in the month to do reruns or restarts when needed.

I want to offer a solution though, in next comment.

If we can modify the puppet config to accept a lower amount of slots for snapshot1012, which runs enwiki dumps, we may be able to reduce load on the databases and avoid issues such as T368098#10038621 and T368098#10111093.

Event Timeline

CC @BTullis.

I think we should be able to do this and set it to, say, 18 (66% of current) for a whole run and see how late enwiki is and whether this avoids the lag issues seen in T368098#10038621 and T368098#10111093.

BTullis triaged this task as High priority.

Yes, I can take a look at this.

Change #1070558 had a related patch set uploaded (by Btullis; author: Btullis):

[operations/puppet@production] Lower the number of slots that the enwiki dump uses

https://gerrit.wikimedia.org/r/1070558

Change #1070558 merged by Btullis:

[operations/puppet@production] Lower the number of slots that the enwiki dump uses

https://gerrit.wikimedia.org/r/1070558

Change #1080265 had a related patch set uploaded (by Btullis; author: Btullis):

[operations/puppet@production] Revert "Lower the number of slots that the enwiki dump uses"

https://gerrit.wikimedia.org/r/1080265

Change #1080265 merged by Btullis:

[operations/puppet@production] Revert "Lower the number of slots that the enwiki dump uses"

https://gerrit.wikimedia.org/r/1080265