Productionising the IDM:
- Create Ganeti instances and apply the Puppet roles
- Setup monitoring for basic availability of the end point
Productionising the IDM:
Cookbook cookbooks.sre.ganeti.reimage was started by slyngshede@cumin1001 for host idm1001.wikimedia.org with OS bullseye
Cookbook cookbooks.sre.ganeti.reimage started by slyngshede@cumin1001 for host idm1001.wikimedia.org with OS bullseye completed:
Cookbook cookbooks.sre.ganeti.reimage was started by slyngshede@cumin1001 for host idm2001.wikimedia.org with OS bullseye
Cookbook cookbooks.sre.ganeti.reimage started by slyngshede@cumin1001 for host idm2001.wikimedia.org with OS bullseye completed:
Change 890801 had a related patch set uploaded (by Slyngshede; author: Slyngshede):
[operations/puppet@production] C:idm::deployment use Redis password
Change 890801 merged by Slyngshede:
[operations/puppet@production] C:idm::deployment use Redis password
Change 890815 had a related patch set uploaded (by Muehlenhoff; author: Muehlenhoff):
[operations/puppet@production] Set ferm access for redis
Change 890815 merged by Slyngshede:
[operations/puppet@production] Set ferm access for redis
Change 891318 had a related patch set uploaded (by Muehlenhoff; author: Muehlenhoff):
[operations/puppet@production] idm::jobs: Adapt auto restart to only run of idm-rq is active/present
Change 891318 merged by Muehlenhoff:
[operations/puppet@production] idm::jobs: Adapt auto restart to only run of idm-rq is active/present
Mentioned in SAL (#wikimedia-operations) [2023-02-27T14:33:02Z] <jbond@cumin2002> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on idm2001.wikimedia.org with reason: host still been configuered - T320797
Mentioned in SAL (#wikimedia-operations) [2023-02-27T14:33:06Z] <jbond@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on idm2001.wikimedia.org with reason: host still been configuered - T320797
Mentioned in SAL (#wikimedia-operations) [2023-02-27T14:33:13Z] <jbond@cumin2002> START - Cookbook sre.hosts.downtime for 7 days, 0:00:00 on idm2001.wikimedia.org with reason: host still been configuered - T320797
Mentioned in SAL (#wikimedia-operations) [2023-02-27T14:33:19Z] <jbond@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 7 days, 0:00:00 on idm2001.wikimedia.org with reason: host still been configuered - T320797
Change 896112 had a related patch set uploaded (by Muehlenhoff; author: Muehlenhoff):
[operations/puppet@production] Add Cumin aliases for IDM
Change 896112 merged by Muehlenhoff:
[operations/puppet@production] Add Cumin aliases for IDM