Page MenuHomePhabricator

mc1027.eqiad.wmnet is down, not powering back up
Closed, ResolvedPublic

Description

Icinga reported that mc1027 went down around 2021-03-03 23:23:44 and attemping to power it on via management has been unsuccessful.

</>hpiLO-> power on

status=0
status_tag=COMMAND COMPLETED
Thu Mar  4 06:48:19 2021



Server powering on .......



</>hpiLO-> power

status=0
status_tag=COMMAND COMPLETED
Thu Mar  4 06:58:34 2021



power: server power is currently: Off

Event Timeline

jijiki triaged this task as Unbreak Now! priority.Mar 4 2021, 9:59 AM

@Jclark-ctr or @Cmjohnson please take a look if it is possible to power up this machine (we can coordinate on irc too), If the server is resting in peace, we can close this task since replacements are expected to arrive in eqiad soon. Thank you!

@jijiki I stopped by cage briefly this morning and looked at server. I could not get it to boot could be a bad cpu or main board.

@Jclark-ctr thank you very much, I am closing this task since we have replacements on the way

Mentioned in SAL (#wikimedia-operations) [2021-04-30T08:11:03Z] <moritzm> remove mc1027 from debmonitor, server is broken and won't return (T276415)