Page MenuHomePhabricator

Fix cronspam from /usr/local/sbin/pdns_gmetric
Closed, ResolvedPublic

Description

Getting cronspam on hosts: achernar, chromium, maerlant, hydrogen, nescio:

Traceback (most recent call last):
  File "/usr/local/sbin/pdns_gmetric", line 220, in <module>
    main()
  File "/usr/local/sbin/pdns_gmetric", line 216, in main
    submit_metrics(curmetrics)
  File "/usr/local/sbin/pdns_gmetric", line 166, in submit_metrics
    gmetricver = subprocess.Popen(['/usr/bin/gmetric', '-V'], shell=False, stdout=subprocess.PIPE)
  File "/usr/lib/python2.7/subprocess.py", line 710, in __init__
    errread, errwrite)
  File "/usr/lib/python2.7/subprocess.py", line 1335, in _execute_child
    raise child_exception
OSError: [Errno 2] No such file or directory

Do we still need this check (isn't it ganglia only, and that's gone)? Do we need it on these hosts? Either let's add the gmetric binary, add a check if the binary exists and only run it if it does, or remove the cron job, depending on the answer to the first two questions.

Event Timeline

Note this is pretty noisy, as it's generating output once a minute.

@ArielGlenn Yes, it's Ganglia-only, and i started the spam when i removed that - because it's Ganglia - in https://gerrit.wikimedia.org/r/#/c/382929/ as part of the general effort to remove all of Ganglia (T177225).

I did sudo rm /usr/lib/ganglia/python_modules/gdnsd.py ; sudo rm /etc/ganglia/conf.d/gdnsd.pyconf but didn't think about the cron.

Then Faidon saw the cron spam and manually deleted the cron on those hosts.

So this ticket should already be resolved now.

Dzahn claimed this task.

I see now that "labtestcontrol2003"/labservices100x" were also affected but mails stopped 3 days ago and crons are gone. I see nothing else new. Please reopen if you still get anything.