Page MenuHomePhabricator

Move labstore1005 to 10Gbps rack and ethernet
Closed, ResolvedPublic

Description

To be done during [insert scheduled downtime]

  • - put system offline in all checks for maint window
  • - stop puppet, replication and backups to codfw to take down server
  • - apply bios/ilo updates
  • - rebuild system from labstore1005 to cloudstore1005, preserving /srv storage
  • - relocate to 10G rack and update netbox
  • - enable PXE for 10G interfaces.
  • - update switch configuration for new primary 10G NIC
  • - connect crossover cable from labstore1004 to secondary 10G NIC
  • - remove old switch port info
  • - PXE boot and reimage system
  • - update netbox with new name and location
  • - reconnect DRBD
  • - update switch and physical labels with new name and location
  • - once DRBD is caught up, re-enable backups

Event Timeline

@Bstorm Do we need to update both 1004 and 1005 to 10G at the same time? I can convert 1005 to 10G anytime.

@Cmjohnson not really at the same time, no. If the 1Gb crossover cable works after converting the primary interface to 10Gb, then that's pretty easy to do. We don't know for sure if it will work because we've had issues in the past having 10Gb and 1Gb ports enabled in the BIOS at the same time :)

The main thing here is just making sure we are ready to re-image it safely (preserving the data) and doing that part fairly quickly so that we can get replication back up (this is a replica of labstore1004). If we can move just the primary interface, and things work, then it can be done well before labstore1004. If that fails, then we'll need to figure out a 10Gb crossover cable for the two of them and do them at the same time.

@Bstorm I am sorry I confused which one was already in a 10G rack. I need to confirm that 1004 is in C2 and can stay and 1005 is in C5 and needs to move.

Ahhh, ok. That makes it a bit trickier. This is the one that should move first because it is the standby.

@Bstorm I just found a space for labstore1005. Let's schedule a move for Monday if that works for you, It will go to C4

1004 in C2
1005 in C4

I don't know if the re-image is ready at this time (haven't synced up with @Andrew on that), so today would probably not have worked anyway. I can verify by Monday, I think :) Monday works for me.

Change 647815 had a related patch set uploaded (by Bstorm; owner: Bstorm):
[operations/puppet@production] partman: build a recipe to re-image nfs servers

https://gerrit.wikimedia.org/r/647815

Change 647815 merged by Bstorm:
[operations/puppet@production] partman: build a recipe to re-image nfs servers

https://gerrit.wikimedia.org/r/647815

Change 650513 had a related patch set uploaded (by Bstorm; owner: Bstorm):
[operations/puppet@production] cloud nfs: change primary interface of labstore1005 to 10G

https://gerrit.wikimedia.org/r/650513

Change 650513 merged by Bstorm:
[operations/puppet@production] cloud nfs: change primary interface of labstore1005 to 10G

https://gerrit.wikimedia.org/r/650513

Change 650556 had a related patch set uploaded (by Andrew Bogott; owner: Andrew Bogott):
[operations/puppet@production] Change labstore1005 to role(insetup) so that we can log in

https://gerrit.wikimedia.org/r/650556

Change 650556 merged by Andrew Bogott:
[operations/puppet@production] Change labstore1005 to role(insetup) so that we can log in

https://gerrit.wikimedia.org/r/650556

Change 650562 had a related patch set uploaded (by Bstorm; owner: Bstorm):
[operations/puppet@production] cloud nfs: correct interfaces for 10G change

https://gerrit.wikimedia.org/r/650562

Change 650562 merged by Bstorm:
[operations/puppet@production] cloud nfs: correct interfaces for 10G change

https://gerrit.wikimedia.org/r/650562

Change 650586 had a related patch set uploaded (by Bstorm; owner: Bstorm):
[operations/puppet@production] cloud nfs: fix broken file reference

https://gerrit.wikimedia.org/r/650586

Change 650586 merged by Bstorm:
[operations/puppet@production] cloud nfs: fix broken file reference

https://gerrit.wikimedia.org/r/650586

Change 650599 had a related patch set uploaded (by Bstorm; owner: Bstorm):
[operations/puppet@production] cloud nfs: fix custom fact output while syncing drbd

https://gerrit.wikimedia.org/r/650599

Change 650599 merged by Bstorm:
[operations/puppet@production] cloud nfs: fix custom fact output while syncing drbd

https://gerrit.wikimedia.org/r/650599

This has been completed