Page MenuHomePhabricator

MediaWiki periodic job startupregistrystats-testwiki failed
Closed, ResolvedPublic

Description

Common information

  • alertname: MediaWikiCronJobFailed
  • label_cronjob: startupregistrystats-testwiki
  • label_team: mediawiki-platform
  • prometheus: k8s
  • severity: task
  • site: codfw
  • source: prometheus
  • team: mediawiki-platform

Firing alerts


  • dashboard: https://w.wiki/DocP
  • description: Use kube-env mw-cron codfw; kubectl get jobs -l team=mediawiki-platform,cronjob=startupregistrystats-testwiki --field-selector status.successful=0 to see failures
  • runbook: https://wikitech.wikimedia.org/wiki/Periodic_jobs#Troubleshooting
  • summary: MediaWiki periodic job startupregistrystats-testwiki failed
  • alertname: MediaWikiCronJobFailed
  • label_cronjob: startupregistrystats-testwiki
  • label_team: mediawiki-platform
  • prometheus: k8s
  • severity: task
  • site: codfw
  • source: prometheus
  • team: mediawiki-platform
  • Source

Event Timeline

Krinkle subscribed.

From the Logstash mw-cron dashboard, we can see the blameStartupRegistry.php maintenance script invocation has been failing since Tue Nov 4, as follows:

Script '/srv/mediawiki/php-1.46.0-wmf.1/extensions/WikimediaMaintenance/blameStartupRegistry.php' not found (tried path '/srv/mediawiki/php-1.46.0-wmf.1/extensions/WikimediaMaintenance/blameStartupRegistry.php' and class '/srv/mediawiki/php-1\46\0-wmf\1/extensions/WikimediaMaintenance/blameStartupRegistry\php').

Running it manually via mwscript on the deployment serve shows that it does return an error code:

[22:02 UTC] krinkle at deploy1003.eqiad.wmnet
$ mwscript extensions/WikimediaMaintenance/blameStartupRegistry.php testwiki
[22:02 UTC] krinkle at deploy1003.eqiad.wmnet
(exit=1) $ echo $?
1

This was cauased by these patches on Oct 29:

It went out on Tue 4 Nov with the train: https://sal.toolforge.org/production?p=0&q=%221.46.0-wmf.1%22&d=

Fixed on Sat 8 Nov by Taavi shortly after I reported this breakage on IRC (mediawiki-core): https://gerrit.wikimedia.org/r/c/mediawiki/extensions/WikimediaMaintenance/+/1203232

Other information

Change #1202872 had a related patch set uploaded (by Krinkle; author: Zabe):

[operations/puppet@production] mediawiki: Update location of startupregistrystats script

https://gerrit.wikimedia.org/r/1202872

Change #1203281 had a related patch set uploaded (by Krinkle; author: Majavah):

[mediawiki/extensions/WikimediaMaintenance@wmf/1.46.0-wmf.1] Fix symbolic links

https://gerrit.wikimedia.org/r/1203281

Change #1202872 merged by Effie Mouzeli:

[operations/puppet@production] mediawiki: Update location of startupregistrystats script

https://gerrit.wikimedia.org/r/1202872

Krinkle claimed this task.