Page MenuHomePhabricator

Replace Toolforge Stretch grid instances that were on failing cloudvirt
Closed, ResolvedPublic

Description

  • tools-sgewebgrid-generic-0901
  • tools-sgewebgrid-lighttpd-0901
  • tools-sgebastion-06

Event Timeline

tools-sgeexec-0902 and tools-sgeexec-0903 seem to be AWOL too.

Mentioned in SAL (#wikimedia-cloud) [2019-03-28T00:23:52Z] <bstorm_> T216060 created tools-sgewebgrid-generic-0901...again!

tools-sgewebgrid-generic-0901 is fully restored to service. The grid-configurator script wants to delete the two sssd-test nodes because it doesn't find files for them (something I should probably fix), but it was still handy because running in --dry-run mode gave me the command args to run by hand :-D It only left out the modification needed of the queue.

This was partially done and things have been fine for grid capacity.