Hi!
Several times the week the webservice on tools.giftbot becomes not available due to different errors. It doesn't matter whether it's running in the grid, local, on cron-tools, k8s, nothing helps. And I don't know, if it is the webservice at tools.giftbot only or anywhere else.
So I put this question on the labs list: is there a way, to restart the webservice _automatically_, if an error occurs and the webservice not available is? Has it to be monitored, what the webservice makes break its running status? Questions on the freenode channel #wikimedia-labs concerning this problem haven't been replied.
One more trial was it, to check the running status of the webservice by an error return of an interrogation, to restart the webservice automatically, but there is no possibility to restart the webservice by a script. We try to run it on cron-tools now, but without success.
@Magnus asked in his reply: "Is it the webservice that fails, or the bot part of giftbot?" but I think, I have explained it, that the webservice itself does not work without issues or outages. The bot resp. the tool is running properly: https://tools.wmflabs.org/giftbot/weblinksuche.fcgi
My last monitoring shows:
* I check the run status of the webservice every hour. Last OK report was 2017-01-17 09:04:07 UTC
* next OK report should be 2017-01-17 10:04:07 UTC, but was missing
* 2017-01-17 10:43:15 UTC I get a webservice restart report on k8s, but it did not restart, possibly due to connection timeout (I cannot return the error case) or the non-possibility, to start the webservice by a simple script, that does not more as "webservice restart" in bash.
Starting the webservice manually by "webservice restart" in bash works, but because of the many webservice outages the webservice should be started automatically, if an error was detected, by a solution as a script
Thank you very much for an explaining reply ...
@Aklapper: please assign this error report because we need a properly running webservice for a service offered for the entire dewiki community and it's implemented in searching for defective weblinks. Thank you ...