Page MenuHomePhabricator

[scap] Suppress/de-emphasize errors from hosts marked has being under maintenance
Open, MediumPublic

Description

The management of the dsh group files used to select hosts that should be synced by scap has been moved to a resource collection based system in operations/puppet (https://gerrit.wikimedia.org/r/#/c/179121/). This may lead to hosts being in the mediawiki-installation group which are known to be broken.

To prevent error fatigue on the part of deployers, it would be nice to find a reasonable way to let the deployer know that although an error was seen syncing host X it was in some way expected. @Joe has suggested that checking icinga may give a good indication of expected down/broken hosts. This would require some amount of reworking the error signalling mechanisms internally in scap.

Event Timeline

bd808 raised the priority of this task from to Needs Triage.
bd808 updated the task description. (Show Details)
bd808 changed Security from none to None.
bd808 subscribed.
greg triaged this task as High priority.Dec 11 2014, 7:33 PM
greg moved this task from To Triage to Next: Maintenance on the Deployments board.
greg added subscribers: mmodell, Reedy.
greg subscribed.
greg lowered the priority of this task from High to Medium.Apr 8 2015, 10:33 PM
greg moved this task from Next: Maintenance to Backlog (Tech) on the Deployments board.
greg added a subscriber: Joe.
hashar renamed this task from [scap] Suppress/de-emphasize errors from hosts marked for maintenance in icinga to [scap] Suppress/de-emphasize errors from hosts marked has being under maintenance.May 29 2015, 12:21 PM