Page MenuHomePhabricator

Move or recable labstore1004 to 10Gbps rack (if needed) and ethernet
Closed, ResolvedPublic

Description

To be done during [insert scheduled downtime]

  • - put system offline in all checks for maint window
  • - stop backups from (labs|cloud)tore1005
  • - failover to labstore1005/cloudstore1005 to prepare for shutdown
  • - rename/rebuild system from labstore1004 to cloudstore1004, preserving /srv storage
  • - relocate to 10G rack and update netbox
  • - enable PXE for 10G interfaces.
  • - update switch configuration for new primary 10G NIC
  • - remove old switch port info
  • - PXE boot and reimage system
  • - WMCS to fail system back
  • - reconnect DRBD
  • - once DRBD is caught up, re-enable backups

Event Timeline

This server can stay in C2 and can be converted anytime.

Change 647815 had a related patch set uploaded (by Bstorm; owner: Bstorm):
[operations/puppet@production] partman: build a recipe to re-image nfs servers

https://gerrit.wikimedia.org/r/647815

Change 647815 merged by Bstorm:
[operations/puppet@production] partman: build a recipe to re-image nfs servers

https://gerrit.wikimedia.org/r/647815

Change 651545 had a related patch set uploaded (by Bstorm; owner: Bstorm):
[operations/puppet@production] cloud nfs: switch to the 10G interface for booting labstore1004

https://gerrit.wikimedia.org/r/651545

Mentioned in SAL (#wikimedia-operations) [2020-12-22T16:15:16Z] <bstorm> downtimed and stopped puppet on labstore1004 and labstore1005 for failover T266202

Ok, that has gone very badly. Working to fix.

Change 651564 had a related patch set uploaded (by Andrew Bogott; owner: Andrew Bogott):
[operations/puppet@production] labstore1004: move to 'insetup' role

https://gerrit.wikimedia.org/r/651564

Change 651564 merged by Bstorm:
[operations/puppet@production] labstore1004: move to 'insetup' role

https://gerrit.wikimedia.org/r/651564

Change 651545 merged by Bstorm:
[operations/puppet@production] cloud nfs: switch to the 10G interface for booting labstore1004

https://gerrit.wikimedia.org/r/651545

Change 651568 had a related patch set uploaded (by Bstorm; owner: Bstorm):
[operations/puppet@production] cloud nfs: get primary interface from puppet and create mountpoints

https://gerrit.wikimedia.org/r/651568

Change 651568 merged by Andrew Bogott:
[operations/puppet@production] cloud nfs: get primary interface from puppet and create mountpoints

https://gerrit.wikimedia.org/r/651568

Change 651575 had a related patch set uploaded (by Bstorm; owner: Bstorm):
[operations/puppet@production] labstore: remove excess variable from hiera

https://gerrit.wikimedia.org/r/651575

Change 651575 merged by Bstorm:
[operations/puppet@production] labstore: remove excess variable from hiera

https://gerrit.wikimedia.org/r/651575

Change 651608 had a related patch set uploaded (by Bstorm; owner: Bstorm):
[operations/puppet@production] cloud nfs: make sure the /srv/scratch dir is there and another fix

https://gerrit.wikimedia.org/r/651608

Change 651608 merged by Bstorm:
[operations/puppet@production] cloud nfs: make sure the /srv/scratch dir is there and another fix

https://gerrit.wikimedia.org/r/651608