Now that we have moved a majority of jobs to k8s, we can take some hardware back from the jobrunner cluster. Based on the graphs it seems we can reclaim at least 50% of the jobrunner hardware while refreshLinks and cirrusSearchLinksUpdate remain on metal, but we should be careful to account for spikes and avoid letting the remaining hosts be overwhelmed, so a good start is 25%.
As of time of writing, there are 30 jobrunners in codfw, and (minus parse1* hosts), 23 in eqiad. Take care not to repurpose videoscaler-only hosts.
Initial candidates for ~25% migration with rack placement:
- mw1460 (A8)
- mw1461 (A8)
- mw1469 (B6)
- mw1439 (D8)
- mw1486 (C5)
- mw1495 (F)
- mw2395 (A3)
- mw2427 (A6)
- mw2267 (B3)
- mw2430 (B6)
- mw2357 (C6)
- mw2282 (D4) (blocked T355333)
- mw2446 (D5)
Further migrations:
- mw2260 B3
- mw2355 C6
- mw2381 A3
- mw2429 B6
- mw2445 D5
- mw1440 D8
- mw1457 A8
- mw1466 B6
- mw1482 C5
- mw1459 A8
Final servers (excluding videoscalers and hardware that is out of warranty):
- mw1437.eqiad.wmnet (canary)
- mw1438.eqiad.wmnet (canary)
- mw1458.eqiad.wmnet
- mw1467.eqiad.wmnet
- mw1468.eqiad.wmnet
- mw1483.eqiad.wmnet
- mw1484.eqiad.wmnet
- mw1485.eqiad.wmnet
- mw1494.eqiad.wmnet
- mw2351.codfw.wmnet
- mw2353.codfw.wmnet
- mw2382.codfw.wmnet
- mw2394.codfw.wmnet
- mw2419.codfw.wmnet
- mw2426.codfw.wmnet
- mw2428.codfw.wmnet
- mw2444.codfw.wmnet