Splitting into a separate task for better tracking.
Tagging Platform and serviceops to see if we can get more runners for the Linter job queue temporarily.
Splitting into a separate task for better tracking.
Tagging Platform and serviceops to see if we can get more runners for the Linter job queue temporarily.
This was also reported at https://en.wikipedia.org/wiki/Wikipedia_talk:Linter#Linter_slow
I think the task title is inaccurate.
Specifically:
So now we have a 9 million items backlog. I'll double the concurrency of this job to try and reduce the strain.
Change 754096 had a related patch set uploaded (by Giuseppe Lavagetto; author: Giuseppe Lavagetto):
[operations/deployment-charts@master] cpjobqueue: double the concurrencty for recordlintjob
Change 754096 merged by Giuseppe Lavagetto:
[operations/deployment-charts@master] cpjobqueue: double the concurrencty for recordlintjob
We're now reducing the number of backlogged items at a rate of 25k/minute. At this pace, the backlog should be back near zero in 6 hours. I think this is a reasonable time for resolution. Leaving the task open so we can come back and assess further.
Some graphs for posterity:
- The processing *time* remained the same, even went down a bit
- The insert rate of linting jobs clearly went through the roof.
Kind of. The peaks are a bit higher but there's also no significant gaps that used to happen which probably gave it room to process any backlog? Off the top of my head I'm not really sure why this would happen, short of editing patterns changing.
Change 754110 had a related patch set uploaded (by Arlolra; author: Arlolra):
[mediawiki/services/parsoid@master] Stop emitting \"inline-media-caption\" lints
Change 754110 merged by jenkins-bot:
[mediawiki/services/parsoid@master] Stop emitting \"inline-media-caption\" lints
Given that the lint is being removed, I think we're going to see the same activity in jobs just in the other direction, deleting rows rather than inserting, so it would be nice to keep the extra concurrency it through this week too.
I'd also suggest that the above Parsoid patch be backported to wmf.17, so that way if the train ends up getting rolled back again, even temporarily, we don't have unnecessary churn with wmf.18 deleting lints, wmf.17 re-adding them, and then wmf.18 deleting them again, etc.
My idea was to take the concurrency a bit higher than it is today in normal conditions,
I'd also suggest that the above Parsoid patch be backported to wmf.17, so that way if the train ends up getting rolled back again, even temporarily, we don't have unnecessary churn with wmf.18 deleting lints, wmf.17 re-adding them, and then wmf.18 deleting them again, etc.
That's for the Release-Engineering-Team and Parsoid to decide. @ssastry @thcipriani what do you think?
I think backporting the linter category suppression to wmf.17 is a good idea to avoid the back-and-forth in the event of train rollbacks.
But, practically, instead of trying to backport the Parsoid change to wmf.17 (which is tricky since Parsoid goes through vendor, but in this instance, the change is to a single file maybe we can scap a single file), I think it might be simpler to write a Linter extension patch to drop lint events against this category and then backport that to wmf.17. @Arlolra thoughts?
Change 754558 had a related patch set uploaded (by Subramanya Sastry; author: Subramanya Sastry):
[mediawiki/extensions/Linter@master] Drop 'inline-media-caption' lint requests
Change 754564 had a related patch set uploaded (by Subramanya Sastry; author: Subramanya Sastry):
[mediawiki/vendor@master] Bump wikimedia/parsoid to 0.15.0-a15
Change 754144 had a related patch set uploaded (by Subramanya Sastry; author: Subramanya Sastry):
[mediawiki/extensions/Linter@wmf/1.38.0-wmf.17] Drop 'inline-media-caption' lint requests
Change 754558 merged by jenkins-bot:
[mediawiki/extensions/Linter@master] Drop 'inline-media-caption' lint requests
Change 754564 merged by jenkins-bot:
[mediawiki/vendor@master] Bump wikimedia/parsoid to 0.15.0-a15
Change 754144 merged by jenkins-bot:
[mediawiki/extensions/Linter@wmf/1.38.0-wmf.17] Drop 'inline-media-caption' lint requests
Mentioned in SAL (#wikimedia-operations) [2022-01-18T08:12:50Z] <ladsgroup@deploy1002> Synchronized php-1.38.0-wmf.17/extensions/Linter/includes/RecordLintJob.php: Backport: [[gerrit:754144|Drop 'inline-media-caption' lint requests (T297443 T299302)]] (duration: 00m 52s)