As an administrator of WDQS, I want reboots to be fast so that I can run cluster wide operations in a reasonable amount of time.
While doing a full cluster restart of WDQS for kernel upgrade, multiple servers took at least 30 minutes to reboot. Looking at console, it looks like the shutdown is waiting to unmount disks. Stopping blazegaph (both wdqs and categories) and the wdqs-updater before the reboot does not have a significant impact on shutdown time.
Maybe related logs (wdqs1007:/var/log/syslog):
Feb 9 16:21:50 wdqs1007 blkdeactivate[16486]: [SKIP]: unmount of vg0-swap (dm-1) mounted on [SWAP] Feb 9 16:21:51 wdqs1007 blkdeactivate[16486]: [UMOUNT]: unmounting vg0-srv (dm-2) mounted on /srv... skipping Feb 9 16:21:51 wdqs1007 blkdeactivate[16486]: [SKIP]: unmount of vg0-root (dm-0) mounted on /
AC:
- wdqs servers can be rebooted in < 5 minutes