We need to have a salt master in codfw for redundancy if for no other reason. Our tools are all set up to rely on having one salt master. We need a script that accepts keys or deletes them on both masters, at a minimum. There is probably code work needed for the minion to behave properly at connection time if one master is down, as well (retrying at periodic internals. for example). What else? Need to get a list together.
Description
Description
Status | Subtype | Assigned | Task | ||
---|---|---|---|---|---|
Resolved | ArielGlenn | T125752 setup/deploy sarin(WMF5851) as a salt master in codfw | |||
Declined | ArielGlenn | T126000 determine all fixes to scripts, tools needed for salt multimaster deployment |
Event Timeline
Comment Actions
should find all references to refresh_pillar, sync_all and verify they will Do The Right Thing
modules/salt/templates/reactors/auth.sls.erb may be ok but double-check.
grain-ensure.py probably ok but double-check.
modules/puppetmaster/templates/certcleanup.py almost certainly wrong, needs to be fixed.
modules/puppetmaster/files/wmf-reimage needs fixup and there is a ticket T124761