Page MenuHomePhabricator

Create an easy way to observe/monitor Wikibase REST API errors happening on Wikidata
Closed, ResolvedPublic

Description

As an Engineering in the team building Wikibase REST API I would like to know what errors happen in production environments so that I can understand what requests led to an error and reason about those cases.

Errors on Wikidata are all sent to WMF's Logstash: https://logstash.wikimedia.org/
Having a way to quickly see and further filter all errors that happen on test.wikidata.org and www.wikidata.org in requests to REST API would likely suffice for the purpose of that story

Event Timeline

Change 883139 had a related patch set uploaded (by Jakob; author: Jakob):

[mediawiki/extensions/Wikibase@master] REST: Use error log level for unexpected errors

https://gerrit.wikimedia.org/r/883139

Change 883139 merged by jenkins-bot:

[mediawiki/extensions/Wikibase@master] REST: Use error log level for unexpected errors

https://gerrit.wikimedia.org/r/883139

Change 883224 had a related patch set uploaded (by Jakob; author: Jakob):

[mediawiki/extensions/Wikibase@wmf/1.40.0-wmf.20] REST: Use error log level for unexpected errors

https://gerrit.wikimedia.org/r/883224

Change 883224 merged by jenkins-bot:

[mediawiki/extensions/Wikibase@wmf/1.40.0-wmf.20] REST: Use error log level for unexpected errors

https://gerrit.wikimedia.org/r/883224

Mentioned in SAL (#wikimedia-operations) [2023-01-25T14:21:10Z] <urbanecm@deploy1002> Started scap: Backport for [[gerrit:883224|REST: Use error log level for unexpected errors (T327490)]], [[gerrit:883547|User impact: amend incorrect parameter for the single day streak text (T327824)]]

Mentioned in SAL (#wikimedia-operations) [2023-01-25T14:39:58Z] <urbanecm@deploy1002> jakob and sgimeno and urbanecm: Backport for [[gerrit:883224|REST: Use error log level for unexpected errors (T327490)]], [[gerrit:883547|User impact: amend incorrect parameter for the single day streak text (T327824)]] synced to the testservers: mwdebug2001.codfw.wmnet, mwdebug1001.eqiad.wmnet, mwdebug2002.codfw.wmnet, mwdebug1002.eqiad.wmnet

Mentioned in SAL (#wikimedia-operations) [2023-01-25T14:53:31Z] <urbanecm@deploy1002> Finished scap: Backport for [[gerrit:883224|REST: Use error log level for unexpected errors (T327490)]], [[gerrit:883547|User impact: amend incorrect parameter for the single day streak text (T327824)]] (duration: 32m 21s)

WMDE-leszek claimed this task.