Migrate the toolschecker monitoring system to Debian Stretch instances in the the eqiad1-r region with the Stretch job grid as the target for any grid engine monitoring.
Description
Details
Subject | Repo | Branch | Lines +/- | |
---|---|---|---|---|
toolschecker: Typo fix | operations/puppet | production | +1 -1 | |
wmcs: Migrate tools-checker to Stretch | operations/puppet | production | +993 -886 |
Status | Subtype | Assigned | Task | ||
---|---|---|---|---|---|
Resolved | • Bstorm | T204530 cloudvps: tools and toolsbeta trusty deprecation | |||
Resolved | aborrero | T187219 Remove support for Trusty Grid Engine exec hosts | |||
Resolved | bd808 | T219243 Migrate tools-checker system to Stretch | |||
Resolved | • Bstorm | T219817 Update grid-configurator to keep tools-checker nodes submit nodes |
Event Timeline
Apparently the check for the wikilabels DB is functioning but not added here yet:
https://github.com/wikimedia/puppet/blob/production/modules/icinga/manifests/monitor/toollabs.pp
It does currently work http://checker.tools.wmflabs.org/labsdb/wikilabelsrw
Just adding it to the pile.
Change 500095 had a related patch set uploaded (by BryanDavis; owner: Bryan Davis):
[operations/puppet@production] wmcs: Migrate tools-checker to Stretch
Mentioned in SAL (#wikimedia-cloud) [2019-03-29T20:22:38Z] <bd808> Disabled puppet on tools-checker-0{1,2} to make testing new role::wmcs::toolforge::checker easier (T219243)
Mentioned in SAL (#wikimedia-cloud) [2019-03-29T20:24:29Z] <bd808> Cherry-picked https://gerrit.wikimedia.org/r/#/c/operations/puppet/+/500095/ to tools-puppetmaster-01 for testing (T219243)
Mentioned in SAL (#wikimedia-cloud) [2019-03-29T20:32:36Z] <bd808> Creating tools-checker-03 with role::wmcs::toolforge::checker (T219243)
Mentioned in SAL (#wikimedia-cloud) [2019-03-29T21:08:58Z] <bd808> Updated cherry-pick of https://gerrit.wikimedia.org/r/#/c/operations/puppet/+/500095/ on tools-puppetmaster-01 (T219243)
Mentioned in SAL (#wikimedia-cloud) [2019-04-01T19:43:16Z] <bd808> Shutdown tools-checker-02 via Horizon (T219243)
Mentioned in SAL (#wikimedia-cloud) [2019-04-01T19:44:03Z] <bd808> Deleted tools-checker-02 via Horizon (T219243)
Mentioned in SAL (#wikimedia-cloud) [2019-04-02T03:54:58Z] <bd808> Added etcd service group to tools-k8s-etcd-* (T219243)
Mentioned in SAL (#wikimedia-operations) [2019-04-02T12:11:42Z] <arturo> icinga downtime toolschecker for 1 month T219243
Mentioned in SAL (#wikimedia-cloud) [2019-04-02T12:11:56Z] <arturo> icinga downtime toolschecker for 1 month T219243
Change 500095 merged by Andrew Bogott:
[operations/puppet@production] wmcs: Migrate tools-checker to Stretch
Change 501034 had a related patch set uploaded (by Bstorm; owner: Bstorm):
[operations/puppet@production] toolschecker: Typo fix
Change 501034 merged by Bstorm:
[operations/puppet@production] toolschecker: Typo fix