I noticed KubernetesAPILatency firing relatively frequently, to the point that it is essentially noise.
On further inspection it looks like there's a spike up in API latency on running scap, e.g. in codfw:
And SAL entries related to scap line up pretty good:
2023-10-04 22:02 <brennen@deploy2002> Finished scap: Backport for [[gerrit:963351|Revert "Deprecate TOC mutation in OutputPageParserOutput hook" (T348134)]] (duration: 09m 13s) [production] 21:53 <brennen@deploy2002> Started scap: Backport for [[gerrit:963351|Revert "Deprecate TOC mutation in OutputPageParserOutput hook" (T348134)]] [production] 20:54 <urbanecm@deploy2002> Finished scap: Backport for [[gerrit:963347|SpecialManageMentors: Skip OOUI initialization when transcluding (T346760)]], [[gerrit:963348|SpecialManageMentors: Skip OOUI initialization when transcluding (T346760)]], [[gerrit:963349|Fix phan for GrowthExperiments (T347571)]] (duration: 07m 49s) [production] 20:46 <urbanecm@deploy2002> Started scap: Backport for [[gerrit:963347|SpecialManageMentors: Skip OOUI initialization when transcluding (T346760)]], [[gerrit:963348|SpecialManageMentors: Skip OOUI initialization when transcluding (T346760)]], [[gerrit:963349|Fix phan for GrowthExperiments (T347571)]] [production] 14:31 <lucaswerkmeister-wmde@deploy2002> Finished scap: Backport for [[gerrit:963307|prod: Enable wgCampaignEventsEnableEmail in meta and officewiki (T347065)]] (duration: 18m 26s) [production] 14:12 <lucaswerkmeister-wmde@deploy2002> Started scap: Backport for [[gerrit:963307|prod: Enable wgCampaignEventsEnableEmail in meta and officewiki (T347065)]] [production] 14:00 <lucaswerkmeister-wmde@deploy2002> Finished scap: Backport for [[gerrit:963305|beta: Explicitly assign campaignevents-email-participants to all users (T336939)]], [[gerrit:963306|metawiki: Restrict campaignevents-email-participants right (T336939)]] (duration: 10m 40s) [production] 13:49 <lucaswerkmeister-wmde@deploy2002> Started scap: Backport for [[gerrit:963305|beta: Explicitly assign campaignevents-email-participants to all users (T336939)]], [[gerrit:963306|metawiki: Restrict campaignevents-email-participants right (T336939)]] [production] 13:46 <lucaswerkmeister-wmde@deploy2002> Finished scap: Backport for [[gerrit:963036|fonwiki: add wgSiteName, wgMetaNamespace and timezone (T347939)]] (duration: 13m 46s) [production] 13:33 <lucaswerkmeister-wmde@deploy2002> Started scap: Backport for [[gerrit:963036|fonwiki: add wgSiteName, wgMetaNamespace and timezone (T347939)]] [production] 13:20 <lucaswerkmeister-wmde@deploy2002> Finished scap: Backport for [[gerrit:963066|fonwiki: add logos (T347939)]] (duration: 11m 43s) [production] 13:08 <lucaswerkmeister-wmde@deploy2002> Started scap: Backport for [[gerrit:963066|fonwiki: add logos (T347939)]]
The spike could be certainly investigated, though in the meantime I think we can bump either the alert thresholds or the for clause to avoid spurious alerts