Run everything in a tmux named switchover
Services
- `scap lock --all "Datacenter Switchover: Services & Traffic - T346330" on deploy1002
- sudo cookbook sre.discovery.datacenter depool eqiad --all --reason "Datacenter Switchover: Services" --task-id T346330 on cumin1001
Traffic
- merge https://gerrit.wikimedia.org/r/c/operations/dns/+/958920
- run authdns-update
deployment server
- log SAL: !log Switch deployment server - T346330
- sudo cumin 'R:class = role::deployment_server' 'disable-puppet "Switchover of the deployment server"'
- merge https://gerrit.wikimedia.org/r/c/operations/dns/+/957734
- merge https://gerrit.wikimedia.org/r/c/operations/puppet/+/957736
- Run puppet on deploy2002.codfw.wmnet: sudo cumin deploy2002.codfw.wmnet 'run-puppet-agent --enable "Switchover of the deployment server"'
- Run puppet on all other deployment servers sudo cumin 'R:class = role::deployment_server' 'run-puppet-agent --enable "Switchover of the deployment server"'
- Run puppet on alert* sudo cumin 'A:icinga' 'run-puppet-agent -q'
-
Cronjob check sudo cumin deploy2002.codfw.wmnet 'systemctl list-units | grep -A1 sync_deployment_dir'out of date docs, TODO fix - remove scap lock
- Test scap deployment cd /srv/mediawiki-staging; scap sync-world "check the deployment server after switchover"
-
Test scap3 deployments work (restbase?)=> nope, due to T346354 - email ops@ and wikitech-l about the switch