Page MenuHomePhabricator

Investigate WDQS categories update failures on Bullseye hosts
Closed, ResolvedPublic

Description

Today, @RKemper noticed that all new WDQS Bullseye hosts have Icinga alerts for load-categories-daily.service and load-dcatap-weekly.service .

Creating this ticket to investigate why these services are failing and fix the problem.

Event Timeline

bking renamed this task from Investigate categories updates failures to Investigate WDQS categories update failures on Bullseye hosts.Jul 17 2023, 9:55 PM
bking updated the task description. (Show Details)
bking added subscribers: RKemper, dcausse.

Investigation so far:
load-categories-daily.service calls

/usr/local/bin/loadCategoriesDaily.sh wdqs

/usr/local/bin/loadCategoriesDaily.sh calls /usr/local/bin/cronUtils.sh which sources variables from either /etc/wdqs/gui_vars.sh or /etc/wdqs/vars.sh.

/etc/wdqs/vars.sh is a broken symlink to ../../srv/deployment/wdqs/wdqs-cache/revs/${HASH}/.git/config-files/etc/query_service/vars.sh

/etc/wdqs/ldf-config.json is also a broken symlink. It doesn't look scap is uploading them during the deployment process. Will investigate further tomorrow.

Per Tuesday's pairing session, if we use the --force flag when we scap deploy , we can fix the issues with incomplete deployment and finally get these things production-ready.

Gehel claimed this task.
Gehel moved this task from Incoming to Done on the Data-Platform-SRE board.