Page MenuHomePhabricator

Installation issues on PowerEdge R440 Kafka main eqiad servers with buster / firmware update needed
Closed, ResolvedPublic

Description

Hi folks!

This is basically a clone of T297422, but for the kafka-main100[1-3] nodes. We'd need to upgrade BIOS/NIC/etc.. to allow a clean reimage of the nodes to Buster (already tested in codfw, it worked nicely).
Let me know a time that could work for you, I'd need 10 mins of time ahead to stop daemons etc.. and we cannot have more than one node down at the time (and 10/15 mins need to pass between one node and the other to let them recover).

Thanks!

Hosts to upgrade:

kafka-main1001
kafka-main1002
kafka-main1003

Event Timeline

@elukey Can we plan to do this tomorrow (12 Jan) starting around 1500UTC?

Mentioned in SAL (#wikimedia-operations) [2022-01-12T15:14:09Z] <elukey> stop kafka* on kafka-main1001 to allow dcops maintenance (nic/bios upgrades) - T298867

Mentioned in SAL (#wikimedia-operations) [2022-01-12T16:02:11Z] <elukey> stop kafka* on kafka-main1002 to allow dcops maintenance (nic/bios upgrades) - T298867

Mentioned in SAL (#wikimedia-operations) [2022-01-12T16:25:24Z] <elukey> stop kafka* on kafka-main1003 to allow dcops maintenance (nic/bios upgrades) - T298867

all 3 servers have been updated.