Page MenuHomePhabricator

Move WMCS servers out of eqiad row B
Closed, ResolvedPublic

Description

Looking at T297083: [ceph] Getting rack level HA and subsequent servers moving tasks reminded me of this.

From Netbox, then manually curating the list to remove any hosts in the private or public vlans.

That leaves us with the cloud-hosts and cloud-storage WMCS servers still present in eqiad row B. Those were not moved on day 1 (when creating C8/D5) as they were too many at the time.
Moving those hosts to the dedicated WMCS racks would allow us to stop stretching the cloud-hosts and cloud-storage vlans between those racks. And remove the link connecting row B to C8 (and removing this now "snowflake").

NameRackPurchase date
cloudvirt1017B72017-05-05
cloudvirt1020B72017-07-21
cloudvirt1019B42017-07-21
cloudvirt1022B72017-12-19
cloudvirt1021B42017-12-19
cloudvirt1024B22018-06-06
cloudvirt1023B72018-06-06
cloudcephmon1001B72019-05-15
cloudvirt-wdqs1001B32019-10-16
cloudvirt-wdqs1002B52019-10-16
cloudvirt-wdqs1003B62019-10-16

All the ones purchased in 2017 and 2018 are due for a refresh, so no need to move them as their replacement will arrive in the proper racks. Is there a task tracking those decom/replacements?

That leave us with the 4 bolded hosts, so I'm wondering if cloudcephmon1001 could be moved as part of T297083: [ceph] Getting rack level HA and the cloudvirt-wdqs100x hosts when possible.

Event Timeline

cloudcephmon1001 can be moved yes, though it's not needed or a blocker for HA, so I'd do it after (HA is blocking upgrading the switches)

Actually, looking at https://wikitech.wikimedia.org/wiki/Network_design_-_Eqiad_WMCS_Network_Infra#/media/File:WMCS_network-L1.png, if x goes down, two mon hosts go down, cloudcephmon1001 and cloudcephmon1003, so to achieve rack HA we need cloudcephmon1001 moved to rack F or E, so re-prioritizing the subtask to move it as part of getting rack HA \o/

Andrew claimed this task.
Andrew subscribed.

None of the servers listed here are racked anymore, so I think this can be closed.