Page MenuHomePhabricator

cloudvirtlocal1001.eqiad.wmnet tends to get stuck on boot
Closed, ResolvedPublic

Description

Now and then cloudvirtlocal1001 won't boot, it just sticks on the boot screen.

cloudvirtlocal1001.png (1×2 px, 678 KB)

Details

Event Timeline

Change 908848 had a related patch set uploaded (by Andrew Bogott; author: Andrew Bogott):

[operations/puppet@production] Move cloudvirtlocal1001 back to 'insetup'

https://gerrit.wikimedia.org/r/908848

Change 908848 merged by Andrew Bogott:

[operations/puppet@production] Move cloudvirtlocal1001 back to 'insetup'

https://gerrit.wikimedia.org/r/908848

When I run

cookbook sre.hosts.dhcp --os bullseye cloudvirtlocal1001

i able to reboot the server as many time as i want and hit F12 and the server will not hang

When i run the

sudo cookbook sre.hosts.reimage

if i cancel it while the OS install is in progress and re-run it again, after the server reboots it hangs
if i leave the OS to complete and cancel the cookbook and re-run it again after the server reboots it doesn't hang

@Papaul Updated netbox relocated to other switch

@Andrew see above cable has been moved to cloudsw1-c8-eqiad

first re-image didn't hang cancel it and while the installation was going and relaunched the re-image the second time no hang. It looks like the move if the server interface from cloudsw2-d5 to cloudsw1-d5 fixed the issue