Page MenuHomePhabricator

Bug in client error logging stops any errors from being logged in group 0 wikis
Closed, ResolvedPublic

Description

The new method shouldLog returns false if shouldLogFileUrl returns true.
AS a result of this no errors are being logged in group 0 wikis

IT should be the other way round.

We should fix this before rolling the train further.

Event Timeline

Change 670569 had a related patch set uploaded (by Jdlrobson; owner: Jdlrobson):
[mediawiki/extensions/WikimediaEvents@master] Error in shouldLog logic drops most errors

https://gerrit.wikimedia.org/r/670569

Jdlrobson triaged this task as Unbreak Now! priority.Mar 10 2021, 7:59 PM

Change 670569 merged by jenkins-bot:
[mediawiki/extensions/WikimediaEvents@master] Error in shouldLog logic drops most errors

https://gerrit.wikimedia.org/r/670569

Change 670529 had a related patch set uploaded (by DannyS712; owner: Jdlrobson):
[mediawiki/extensions/WikimediaEvents@wmf/1.36.0-wmf.34] Error in shouldLog logic drops most errors

https://gerrit.wikimedia.org/r/670529

Change 670529 merged by jenkins-bot:
[mediawiki/extensions/WikimediaEvents@wmf/1.36.0-wmf.34] Error in shouldLog logic drops most errors

https://gerrit.wikimedia.org/r/670529

Mentioned in SAL (#wikimedia-operations) [2021-03-10T20:41:54Z] <brennen@deploy1002> Synchronized php-1.36.0-wmf.34/extensions/WikimediaEvents/modules/ext.wikimediaEvents/clientError.js: Backport: [[gerrit:670529|Error in shouldLog logic drops most errors (T277094)]] (duration: 01m 14s)

Change 670533 had a related patch set uploaded (by Urbanecm; owner: Urbanecm):
[mediawiki/extensions/WikimediaEvents@wmf/1.36.0-wmf.34] Revert "Error in shouldLog logic drops most errors"

https://gerrit.wikimedia.org/r/670533

Mentioned in SAL (#wikimedia-operations) [2021-03-10T21:30:08Z] <brennen> train status: 1.36.0-wmf.34 (T274938): logstash client error board was set up incorrectly; reverting earlier patch for T277094 and will proceed to group1.

Change 670533 merged by jenkins-bot:
[mediawiki/extensions/WikimediaEvents@wmf/1.36.0-wmf.34] Revert "Error in shouldLog logic drops most errors"

https://gerrit.wikimedia.org/r/670533

Mentioned in SAL (#wikimedia-operations) [2021-03-10T21:40:51Z] <brennen@deploy1002> Synchronized php-1.36.0-wmf.34/extensions/WikimediaEvents/modules/ext.wikimediaEvents/clientError.js: Backport: [[gerrit:670533|Revert "Error in shouldLog logic drops most errors" (T277094)]] (duration: 01m 08s)

Mentioned in SAL (#wikimedia-operations) [2021-03-10T21:43:22Z] <brennen> train status: 1.36.0-wmf.34 (T274938): client errors may still be missing for group0; continuing to hold for T277094 until we know what's broken.

Change 670585 had a related patch set uploaded (by Jdlrobson; owner: Jdlrobson):
[mediawiki/extensions/WikimediaEvents@master] Fix client error logging

https://gerrit.wikimedia.org/r/670585

Change 670535 had a related patch set uploaded (by Brennen Bearnes; owner: Jdlrobson):
[mediawiki/extensions/WikimediaEvents@wmf/1.36.0-wmf.34] Fix client error logging

https://gerrit.wikimedia.org/r/670535

Change 670535 merged by jenkins-bot:
[mediawiki/extensions/WikimediaEvents@wmf/1.36.0-wmf.34] Fix client error logging

https://gerrit.wikimedia.org/r/670535

Mentioned in SAL (#wikimedia-operations) [2021-03-10T22:25:16Z] <brennen@deploy1002> Synchronized php-1.36.0-wmf.34/extensions/WikimediaEvents/modules/ext.wikimediaEvents/clientError.js: Backport: [[gerrit:670535|Fix client error logging (T277094)]] (duration: 01m 09s)

Mentioned in SAL (#wikimedia-operations) [2021-03-10T22:26:37Z] <brennen> train status: 1.36.0-wmf.34 (T274938): T277094 believed resolved, promoting to group1.

Discovered we were dropping errors (around 500) for group 0 wikis due to eventgate_validation_error :
https://logstash.wikimedia.org/app/dashboards#/view/AXN5OoJu3_NNwgAUlbUT?_g=h@0e494b1&_a=h@ab7de48

Screen Shot 2021-03-10 at 2.17.07 PM.png (1×2 px, 246 KB)

The message was incorrectly being passed (due to type checking problem - something that typescript would help us with)
Rather than blocking the train we backported the fix to wmf34. Leaving as UBN as https://gerrit.wikimedia.org/r/c/mediawiki/extensions/WikimediaEvents/+/670585 still needs to be merged.

brennen subscribed.

Removing as a blocker for wmf.34 and adding for wmf.35, since it's been handled with the backport for .34.

Change 670585 merged by jenkins-bot:
[mediawiki/extensions/WikimediaEvents@master] clientError: Restore client error logging output

https://gerrit.wikimedia.org/r/670585

Jdlrobson lowered the priority of this task from Unbreak Now! to Needs Triage.Mar 12 2021, 3:22 PM

We are seeing errors for all group wikis now.