I noticed that every now and then the httpbb_hourly_appserver.service service fails on cumin2002 due to Read timed out. (read timeout=10) for the test on https://meta.wikimedia.org/wiki/List_of_Wikipedias.
Here a full log:
Started Run httpbb appserver/ tests hourly on mw2271.codfw.wmnet. Sending to mw2271.codfw.wmnet... https://meta.wikimedia.org/wiki/List_of_Wikipedias (/srv/deployment/httpbb-tests/appserver/test_main.yaml:212) ERROR: HTTPSConnectionPool(host='mw2271.codfw.wmnet', port=443): Read timed out. (read timeout=10) === ERRORS: 124 requests attempted to mw2271.codfw.wmnet. Errors connecting to 1 host. httpbb_hourly_appserver.service: Main process exited, code=exited, status=1/FAILURE httpbb_hourly_appserver.service: Failed with result 'exit-code'. httpbb_hourly_appserver.service: Consumed 2.410s CPU time.
The occurrences in the current journal:
Nov 13 06:38:40 Nov 13 15:38:40 Nov 17 18:40:42 Nov 18 10:36:59 Nov 19 01:37:41 Nov 19 19:37:41 Nov 20 19:38:41 Nov 23 09:03:41 Nov 23 14:03:41
It seems transient, and it seems to happen only on codfw, the same unit on cumin1001 doesn't have any of those read timeout failures.
If the time to generate that page from codfw is correct then maybe the timeout should be increased a bit.