Page MenuHomePhabricator

cloudcephosd1025 won't reimage
Closed, ResolvedPublic

Description

As part of the cleanup for T348643 I'm trying to put cloudcephosd1025 back into service, but I can't reimage it. It fails during PXE boot.

I upgraded the nic firmware to the approved version (21.85.21.92) but it didn't help.

Related Objects

StatusSubtypeAssignedTask
Resolveddcaro
Resolvedtaavi
ResolvedJclark-ctr

Event Timeline

I see the Drives are not setup correctly and some list as foreign state.

Cleared foreign configs and converted drives to non-raid

@Jclark-ctr cable are plugged into the wrong switch port nic 1 is connected to port xe-0/0/21 and nic 2 is connected to port xe-0/0/20 it should be the other way around see netbox
https://netbox.wikimedia.org/dcim/devices/3980/interfaces/
output on the switch is showing that mac address ending with 91 witch is nic 2 is connected to xe-0/0/20

cloudsw1-e4-eqiad> show ethernet-switching table interface xe-0/0/20 

MAC database for interface xe-0/0/20

MAC database for interface xe-0/0/20.0

MAC flags (S - static MAC, D - dynamic MAC, L - locally learned, P - Persistent static, C - Control MAC
           SE - statistics enabled, NM - non configured MAC, R - remote PE MAC, O - ovsdb MAC)


Ethernet switching table : 1 entries, 1 learned
Routing instance : default-switch
    Vlan                MAC                 MAC         Age    Logical                NH        RTR 
    name                address             flags              interface              Index     ID
    cloud-hosts1-e4-eqiad xx:xx:xx:4d:91 D             -   xe-0/0/20.0            0         0

Cookbook cookbooks.sre.hosts.reimage was started by jclark@cumin1002 for host cloudcephosd1025.eqiad.wmnet with OS bookworm

Jclark-ctr claimed this task.

Server completed Reimage by andrew