Page MenuHomePhabricator

Write an icinga check to ensure that toollabs instances are appropriately distributed across labvirt** hosts
Closed, ResolvedPublic

Description

We have a lot of redundancies in place now, but that's useless if the instances are in the same underlying host. While OpenStack rules can be a solution for this later, put an icinga check in place now to ensure this doesn't happen much.

Event Timeline

yuvipanda claimed this task.
yuvipanda raised the priority of this task from to High.
yuvipanda updated the task description. (Show Details)
yuvipanda moved this task from To Do to Doing on the Labs-Sprint-100 board.
yuvipanda added subscribers: hashar, mark, Aklapper and 5 others.

Change 216661 had a related patch set uploaded (by Yuvipanda):
openstack: Add a check to see if Tool Labs instances are spread enough

https://gerrit.wikimedia.org/r/216661

Change 216661 merged by Yuvipanda:
openstack: Add a check to see if Tool Labs instances are spread enough

https://gerrit.wikimedia.org/r/216661

PROBLEM - Tool Labs instance distribution on virt1000 is CRITICAL master class instances not spread out enough done!

Better metric at T101725, and actually fixing the alert tracked at T101636