please depool and let it catch up
https://grafana.wikimedia.org/d/000000489/wikidata-query-service?orgId=1&viewPanel=8&from=now-30m&to=now&refresh=1d
Description
Description
Event Timeline
Comment Actions
I was looking at Icinga for other reasons and noticed:
wdqs1004 - "..Query Service HTTP Port on wdqs1004 is CRITICAL: HTTP CRITICAL: HTTP/1.1 503 Service Unavailable ".
(unhandled CRIT since about 18 hours, does it have notifications?)
I did a systemctl restart wdqs-blazegraph and that caused:
RECOVERY - Query Service HTTP Port on wdqs1004 is OK: HTTP OK: HTTP/1.1 200 OK
but in turn also a new:
<+icinga-wm> PROBLEM - WDQS high update lag on wdqs1004 is CRITICAL: 1.224e+05 ge....
https://wikitech.wikimedia.org/wiki/Wikidata_query_service/Runbook%23Update_lag told me to also restart the wdqs-updater service, so I did that.
When that did not seem to immediately resolve it I also depooled the server as the docs above say to do until it catches up.
Reusing this ticket.
2021-09-20
22:14 mutante: wdqs1004 - depool
22:10 mutante: wdqs1004 - service wdqs-updater restart
22:06 mutante: wdqs1004 - HTTP/1.1 503 Service Unavailable - systemctl restart wdqs-blazegraph