This task will track the migration of the ps1 and ps2 to be replaced with new PDUs in rack A4-eqiad.
Each server & switch will need to have potential downtime scheduled, since this will be a live power change of the PDU towers.
These racks have a single tower for the old PDU (with and A and B side), with the new PDUs having independent A and B towers.
- - schedule downtime for the entire list of switches and servers.
- - Wire up one of the two towers, energize, and relocate power to it from existing/old pdu tower (now de-energized).
- - confirm entire list of switches, routers, and servers have had their power restored from the new pdu tower
- - Once new PDU tower is confirmed online, move on to next steps.
- - Wire up remaining tower, energize, and relocate power to it from existing/old pdu tower (now de-energized).
- - confirm entire list of switches, routers, and servers have had their power restored from the new pdu tower
- - setup all remote configuration options for new pdu. (network, snmp, login, etc...)
List of routers, switches, and servers
device | role | SRE team coordination | notes |
asw2-a4-eqiad | asw | @ayounsi | |
rdb1003 | |||
ms-be1046 | ms-be | @fgiunchedi | |
stat1004 | analytics | Analytics | |
kafka1001 | kafka | @herron | |
restbase1007 | @fgiunchedi | ||
wdqs1003 | wdqs | ||
labstore1006 (and two arrays) | labstore | cloud-services-team | |
ganeti1005 | ganeti node | @akosiaris | host will need to be emptied in advance |
contint1001 | #rel-eng | ||
oresrdb1002 | ores | @akosiaris | Fine to reboot at anytime. Caution: Not the case with oresrdb1001 |
netmon1002 | |||
lvs1003 | lvs | Traffic | |
lvs1002 | lvs | Traffic | |
lvs1001 | lvs | Traffic | |
cp1076 | cp | Traffic | |
rhenium | |||
db1111 | db | DBA | |
conf1004 | zookeeper/etcd | serviceops Analytics | |
ms-fe1006 | ms-fe | @fgiunchedi | |
cp1075 | cp | Traffic | |
labservices1002 | labservices | cloud-services-team | |
an-worker1080 | analytics | Analytics | |
maps1001 | |||
oxygen | |||
analytics1070 | analytics | Analytics | |
snapshot1005 | |||
kubestage1001 | kubernetes staging | serviceops | |
logstash1004 | |||
scb1001 | |||
aqs1004 | analytics | Analytics | |
druid1001 | analytics | Analytics |