Page MenuHomePhabricator

New analytic hosts with BBU learning cycle enabled
Closed, ResolvedPublic3 Estimated Story Points

Description

Hello,

Today analytics1062 and 1063 (and possibly 1067) started a learning cycle on their BBU, meaning that it goes from WB to WT (which will probably affect performance).

root@analytics1062:~# megacli -AdpBbuCmd -aAll -nolog | tail -n9

BBU Properties for Adapter: 0

  Auto Learn Period: 90 Days
  Next Learn time: Mon Sep 11 15:27:48 2017
  Learn Delay Interval:0 Hours
  Auto-Learn Mode: Transparent
root@analytics1063:~# megacli -AdpBbuCmd -aAll -nolog

BBU status for Adapter: 0

BatteryType: BBU
Voltage: 3766 mV
Current: -95 mA
Temperature: 51 C
Battery State: Optimal
BBU Firmware Status:

  Charging Status              : Discharging
  Voltage                                 : OK
  Temperature                             : OK
  Learn Cycle Requested	                  : Yes
  Learn Cycle Active                      : Yes
  Learn Cycle Status                      : OK

analytics1067 reported error (T167797), but it might have been temporarily during the battery discharge+charge process, but it does have the auto-learn enabled as the other ones

root@analytics1067:~# megacli -AdpBbuCmd -aAll -nolog | tail -n9

BBU Properties for Adapter: 0

  Auto Learn Period: 90 Days
  Next Learn time: Mon Sep 11 13:19:54 2017
  Learn Delay Interval:0 Hours
  Auto-Learn Mode: Transparent

The main goal of this task to let Analytics know about this situation, it might be okay for them. If so, feel free to close the task!

Event Timeline

Nuria subscribed.

Puting on kanban for @elukey to look at

Mentioned in SAL (#wikimedia-operations) [2017-06-19T12:04:42Z] <elukey> run 'echo "autoLearnMode=1" > /tmp/disable_learn && megacli -AdpBbuCmd -SetBbuProperties -f /tmp/disable_learn -a0' on all the analytics workers to disable BBU Auto learn - T167809

elukey set the point value for this task to 3.Jun 19 2017, 12:17 PM
elukey moved this task from Next Up to Done on the Analytics-Kanban board.
elukey@neodymium:~$ sudo cumin 'R:class = role::analytics_cluster::hadoop::worker' 'megacli -AdpBbuCmd -GetBbuProperties -aALL -nolog | grep "Auto-Learn Mode"'

===== NODE GROUP =====
(42) analytics[1028-1069].eqiad.wmnet
----- OUTPUT of 'megacli -AdpBbuC...Auto-Learn Mode"' -----
  Auto-Learn Mode: Disabled