restbase2009 has been down since 2019-07-07 8:40, on power off / on a message appeared on console about battery failure from the controller (to be investigated)
Description
Event Timeline
Indeed the server is not showing the Smart Storage Battery status. Lets try to upgrade the server firmware since the last upgrade was from 2015.
@fgiunchedi Let me know when we can depool this server for firmware upgrade.
Thanks
After Firmware upgrade, we still have the Smart storage battery problem since the server is out of warranty we can not have the part replaced.
I'm not positive I understand the implications of that.
As far as I know, the host went down, and was rebooted through the management console. Later it was taken down again for a firmware update to address the smart storage battery fault (which did not clear the fault). As of this moment, the machine is up and running. What would marking it inactive do?
@Eevans I was under the impression we have more work to be done on the server. Shall we mark this task as resolved?
I was under that impression too. @PPaul's last comment indicated that there is a problem with the Smart storage battery, but that the machine is out of warranty. What do we do in such a situation?
@jijiki I will talking to @wiki_willy to see what are our options on this.
@wiki_willy this system is out if warranty since April 2019 and we do have a problem with the Smart storage battery. The option I have here is, We do have 5 HP servers that were decom I can look and see if those servers have the same Smart storage battery with this system. In case this is not the case, can you please advice.
Thank you.
@Papaul - if you can't find a spare from any of those decom servers, we can order it, since it's still a while before the 5yr mark.
Thanks
Willy
Mentioned in SAL (#wikimedia-operations) [2019-08-05T14:06:16Z] <jijiki> Depool and restart restbase2009 for maint - T227408
@jijiki please repool the server when you have a minute. We will have to order a new Storage battery for the server since all the decom HP servers are GEN8 and this one is a GEN9 so different storage battery.
Thanks.
Mentioned in SAL (#wikimedia-operations) [2019-08-05T17:33:42Z] <jijiki> Pool restbase2009 - T227408
Re-open as this isn't really complete yet, the battery came in and replacement is proceeding. Since @jijiki did this before and claims it's just a depool command, we'll go with that again :)
Mentioned in SAL (#wikimedia-operations) [2019-09-11T17:05:11Z] <bblack> restbase2009 - depool for hardware work - T227408
Mentioned in SAL (#wikimedia-operations) [2019-09-11T17:07:20Z] <bblack> restbase2009 - shutdown for hardware work - T227408
Smart storage replacement complete.
Embedded HPE Smart Storage Battery 875241-B21 878643-001 6WQXL0BB2BQ4H8 01 0.60 OK
Mentioned in SAL (#wikimedia-operations) [2019-09-11T17:27:32Z] <bblack> restbase2009 - re-pool - T227408