Page MenuHomePhabricator

Issues reimaging servers in codfw
Closed, ResolvedPublic

Description

I am currently reimaging some old appservers to k8s workers as part of T351074 - in the last batch, two servers failed to reimage successfully: mw2369 and mw2367. The hosts get to the PXE boot stage in the BIOS and then fail to pick up a lease, eventually defaulting back into the currently installed OS.

This might be a re-run of T355333, but it is not a recurrence of the vlan/ip issues as seen in T357539 due to the rack position of the hosts.

I've done a racreset on the hosts and attempted re-imaging to no avail, please check network cabling etc. and firmware versions. Thanks!

Event Timeline

@hnowlan I've replaced the network cable on both of these. These are both connected to a 1G switch so there is no SFP to replace in this case.

If this does not fix the issue lmk and we can upgrade the idrac and bios firmware.

hnowlan claimed this task.

@hnowlan I've replaced the network cable on both of these. These are both connected to a 1G switch so there is no SFP to replace in this case.

If this does not fix the issue lmk and we can upgrade the idrac and bios firmware.

This worked, thank you!