Page MenuHomePhabricator
Paste P15670

search team cirrussearch elasticsearch 10gb networking status
ActivePublic

Authored by RKemper on Fri, Apr 30, 9:50 PM.
ryankemper@cumin1001:~$ sudo -E cumin 'P{elastic*}' 'lshw -class network -short'
72 hosts will be targeted:
elastic[2025-2060].codfw.wmnet,elastic[1032-1067].eqiad.wmnet
Confirm to continue [y/n]? y
===== NODE GROUP =====
(1) elastic2033.codfw.wmnet
----- OUTPUT of 'lshw -class network -short' -----
ssh: connect to host elastic2033.codfw.wmnet port 22: Connection timed out
===== NODE GROUP =====
(6) elastic[2055-2060].codfw.wmnet
----- OUTPUT of 'lshw -class network -short' -----
H/W path Device Class Description
========================================================
/0/101/0 enp59s0f0 network BCM57412 NetXtreme-E 10Gb RDMA Ethernet Controller
/0/101/0.1 enp59s0f1d1 network BCM57412 NetXtreme-E 10Gb RDMA Ethernet Controller
===== NODE GROUP =====
(33) elastic[2037-2054].codfw.wmnet,elastic[1053-1067].eqiad.wmnet
----- OUTPUT of 'lshw -class network -short' -----
H/W path Device Class Description
========================================================
/0/100/1c.5/0 eno1 network NetXtreme BCM5720 Gigabit Ethernet PCIe
/0/100/1c.5/0.1 eno2 network NetXtreme BCM5720 Gigabit Ethernet PCIe
/0/101/0 enp59s0f0 network BCM57412 NetXtreme-E 10Gb RDMA Ethernet Controller
/0/101/0.1 enp59s0f1d1 network BCM57412 NetXtreme-E 10Gb RDMA Ethernet Controller
===== NODE GROUP =====
(32) elastic[2025-2032,2034-2036].codfw.wmnet,elastic[1032-1052].eqiad.wmnet
----- OUTPUT of 'lshw -class network -short' -----
H/W path Device Class Description
====================================================
/0/100/1c.4/0 eno1 network NetXtreme BCM5719 Gigabit Ethernet PCIe
/0/100/1c.4/0.1 eno2 network NetXtreme BCM5719 Gigabit Ethernet PCIe
/0/100/1c.4/0.2 eno3 network NetXtreme BCM5719 Gigabit Ethernet PCIe
/0/100/1c.4/0.3 eno4 network NetXtreme BCM5719 Gigabit Ethernet PCIe
================
PASS |███████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████▊ | 99% (71/72) [00:10<00:00, 6.96hosts/s]
FAIL |████▏ | 1% (1/72) [00:10<12:02, 10.18s/hosts]
1.4% (1/72) of nodes failed to execute command 'lshw -class network -short': elastic2033.codfw.wmnet
98.6% (71/72) success ratio (< 100.0% threshold) for command: 'lshw -class network -short'. Aborting.: elastic[2025-2032,2034-2060].codfw.wmnet,elastic[1032-1067].eqiad.wmnet
98.6% (71/72) success ratio (< 100.0% threshold) of nodes successfully executed all commands. Aborting.: elastic[2025-2032,2034-2060].codfw.wmnet,elastic[1032-1067].eqiad.wmnet