Page MenuHomePhabricator

cloudvirt1019: hpssacli not found
Open, Stalled, Needs TriagePublic

Description

See T306354: hpssacli and hpssaducli not available on Debian Bullseye. This machine is due to be refreshed {T311863} soon.

Jul 27 20:01:16 cloudvirt1019 systemd[1]: Started Collect SMART information from all physical disks and report as Prometheus metrics.
Jul 27 20:01:16 cloudvirt1019 smart-data-dump[252566]: Command '['/usr/bin/timeout', '60', '/usr/sbin/hpssacli', 'controller', 'all', 'show', 'config', 'detail']' returned non-zero exit status 127.
                                                       Traceback (most recent call last):
                                                         File "/usr/local/sbin/smart-data-dump", line 123, in _check_output
                                                           return subprocess.check_output(cmd, stderr=stderr) \
                                                         File "/usr/lib/python3.9/subprocess.py", line 424, in check_output
                                                           return run(*popenargs, stdout=PIPE, timeout=timeout, check=True,
                                                         File "/usr/lib/python3.9/subprocess.py", line 528, in run
                                                           raise CalledProcessError(retcode, process.args,
                                                       subprocess.CalledProcessError: Command '['/usr/bin/timeout', '60', '/usr/sbin/hpssacli', 'controller', 'all', 'show', 'config', 'detail']' returned non-zero exit status 127.
Jul 27 20:01:16 cloudvirt1019 smart-data-dump[252566]: Failed to scan for hpsa physical disks
                                                       Traceback (most recent call last):
                                                         File "/usr/local/sbin/smart-data-dump", line 169, in hpsa_list_pd
                                                           raw_output = _check_output('/usr/sbin/hpssacli controller all show config detail')
                                                         File "/usr/local/sbin/smart-data-dump", line 123, in _check_output
                                                           return subprocess.check_output(cmd, stderr=stderr) \
                                                         File "/usr/lib/python3.9/subprocess.py", line 424, in check_output
                                                           return run(*popenargs, stdout=PIPE, timeout=timeout, check=True,
                                                         File "/usr/lib/python3.9/subprocess.py", line 528, in run
                                                           raise CalledProcessError(retcode, process.args,
                                                       subprocess.CalledProcessError: Command '['/usr/bin/timeout', '60', '/usr/sbin/hpssacli', 'controller', 'all', 'show', 'config', 'detail']' returned non-zero exit status 127.
Jul 27 20:01:16 cloudvirt1019 smart-data-dump[252566]: Traceback (most recent call last):
Jul 27 20:01:16 cloudvirt1019 smart-data-dump[252566]:   File "/usr/local/sbin/smart-data-dump", line 459, in <module>
Jul 27 20:01:16 cloudvirt1019 smart-data-dump[252566]:     sys.exit(main())
Jul 27 20:01:16 cloudvirt1019 smart-data-dump[252566]:   File "/usr/local/sbin/smart-data-dump", line 437, in main
Jul 27 20:01:16 cloudvirt1019 smart-data-dump[252566]:     for pd in handler():
Jul 27 20:01:16 cloudvirt1019 smart-data-dump[252566]: TypeError: 'NoneType' object is not iterable
Jul 27 20:01:16 cloudvirt1019 systemd[1]: export_smart_data_dump.service: Main process exited, code=exited, status=1/FAILURE
Jul 27 20:01:16 cloudvirt1019 systemd[1]: export_smart_data_dump.service: Failed with result 'exit-code'.

Related Objects

Event Timeline

cloudvirt1020 is a similar machine of the same age, and doesn't exhibit this behavior.

nskaggs changed the task status from Open to Stalled.Sep 8 2022, 4:50 PM

As this is up for decommissioning, not planning on resolving unless necessary.

Change 906554 had a related patch set uploaded (by Muehlenhoff; author: Muehlenhoff):

[operations/puppet@production] smart: Disable smart-dump for servers with hpsa

https://gerrit.wikimedia.org/r/906554