Page MenuHomePhabricator

Add a jobrunner server to the Scap canary pool
Open, LowPublic

Description

Follows-up:

  1. Ensure that (if not already) at least one job runner is included in the list of canary servers that Scap uses for deploying MediaWiki code. This alone will already be an improvement, as any hits for mediawiki/exception, mediawiki/error or hhvm that only happen in job runner context would then be caught early.
  2. Include ERROR (and higher) severity messages from the mediawiki/runJobs channel in the Logstash query for canary monitoring.
  3. Once the jobrunner and jobchron service logs are indexed by Logstash, include ERROR (and higher) severity messages in the Logstash query.

Note that the jobrunner and jobchron services are independent PHP CLI programs (not MediaWiki cli scripts) so their logs will have a different type, and are not presently included anywhere else.

Event Timeline

greg subscribed.

Adding our Release-Engineering-Team (Kanban) project as we would like to work on this in the coming quarter or two (no promises though, this is not a "goal" only "other hoped for work").

Krinkle renamed this task from Add jobrunners to Scap canary process to Add jobrunner servers to Scap canary process.Jul 12 2018, 3:59 AM
Krinkle added a project: WMF-JobQueue.
Krinkle moved this task from Untriaged to Meta on the WMF-JobQueue board.
Krinkle renamed this task from Add jobrunner servers to Scap canary process to Add a jobrunner server to the Scap canary pool.May 7 2020, 10:28 PM