This task will track the work required to prepare/stage, and then swap out the failed PDU tower in a2-eqiad. Details are as follows:
* ps1-a2-eqiad is a dual input (tower A and B combined in a single PDU chassis) with 24 ports per tower.
* ps1-a2-eqiad has had failures occur on its phases, either due to the PDU failing, or due to phase imbalance that cannot be corrected due to the limited number of power plugs per tower (only 24).
** Chris will swap out the existing/failing ps1-a2-eqiad and put in a spare dual wide, 42 port per tower PDU. This isn't as ideal as a brand new PDU (via T210776), but the new PDU has a 30 day lead time.
* All systems in a2-eqiad have to be reviewed, as downtime could result.
** All precautions will be taken to try to migrate PDUs without downtime, but nothing is a certainty when dealing with the power feeds into our rack.
[] - list off all systems in a2-eqiad, check with service owners and schedule a downtime date before Chris leaves for all hands.
[] - @cmjohnson stages new PDU adjacent or in rack, and unplugs the failed side of the existing PDU, plugging in one side of the replacement PDU
[] - @cmjohnson migrates the now de-energized side of the old PDU plugs into the replacement PDU, returning redundant power to all devices
[] - @cmjognson de-energizes the remaining side of old PDU, energizing the replacement PDU fully, and migrates all remaining power to the new PDU
== Servers & Devices in A2-eqiad ==
https://netbox.wikimedia.org/dcim/racks/2/
Network Devices:
asw2-a2-eqiad
asw-a2-eqiad
msw-a2-eqiad
Servers:
an-worker1078
an-worker1079
cloudelastic1001
conf1001
db1074
db1075 - this is a master, cannot lose power
db1079
db1080
db1081
db1082
db1107
es1011
es1012
kafka1012
kafka1013
kafka1023
kafka-jumbo1002
ms-be1019
ms-be1044
ms-be1045
tungsten