Page MenuHomePhabricator

Gitlab-replica switchover (gitlab1004 → gitlab1003)
Closed, ResolvedPublic

Description

Docs: https://wikitech.wikimedia.org/wiki/GitLab/Failover

Checklist:

Preparations before downtime:

  • prepare the required Puppet changes (patch)
  • Prepare the required DNS changes (patch)
  • apply gitlab-settings to gitlab1003 and gitlab1004 (merge request)
  • announce downtime some days ahead on ops/releng list/broadcast message not needed for replicas
  • run a failover backup on the source host one day in advance sudo /srv/gitlab-backup/gitlab-backup.sh failover - we should double check what is the purpose of this

Scheduled downtime:

  • Announce downtime in #wikimedia-gitlab
  • Start gitlab failover cookbook on the cumin host with cookbook sre.gitlab.failover --switch-from gitlab1004 --switch-to gitlab1003 -t T400121
  • When prompted, merge the puppet change prepared above
  • When prompted, merge the DNS change prepared above
  • run authdns-update on the DNS master, following the DNS update instructions

Event Timeline

ABran-WMF changed the task status from Open to In Progress.Jul 22 2025, 7:34 AM
ABran-WMF triaged this task as High priority.
ABran-WMF moved this task from Incoming to Work in Progress on the collaboration-services board.
ABran-WMF updated the task description. (Show Details)

Change #1171537 had a related patch set uploaded (by Arnaudb; author: Arnaudb):

[operations/dns@master] Gitlab: switchover between gitlab-replica-a and gitlab-replica-b

https://gerrit.wikimedia.org/r/1171537

Change #1171539 had a related patch set uploaded (by Arnaudb; author: Arnaudb):

[operations/puppet@production] Gitlab: switchover between gitlab-replica-a and gitlab-replica-b

https://gerrit.wikimedia.org/r/1171539

ABran-WMF updated the task description. (Show Details)

Cookbook cookbooks.sre.gitlab.failover (Failover of gitlab from gitlab1004.wikimedia.org to gitlab1003.wikimedia.org) started

Change #1171539 merged by Arnaudb:

[operations/puppet@production] Gitlab: switchover between gitlab-replica-a and gitlab-replica-b

https://gerrit.wikimedia.org/r/1171539

Change #1171537 merged by Arnaudb:

[operations/dns@master] Gitlab: switchover between gitlab-replica-a and gitlab-replica-b

https://gerrit.wikimedia.org/r/1171537

Cookbook cookbooks.sre.gitlab.failover (Failover of gitlab from gitlab1004.wikimedia.org to gitlab1003.wikimedia.org) finished

Change #1172015 had a related patch set uploaded (by Jelto; author: Jelto):

[operations/cookbooks@master] gitlab failover: improve message for API token

https://gerrit.wikimedia.org/r/1172015

Change #1172015 merged by jenkins-bot:

[operations/cookbooks@master] gitlab failover: improve message for API token

https://gerrit.wikimedia.org/r/1172015