Page MenuHomePhabricator

Gerrit failover process
Open, Needs TriagePublic

Description

In T387833: Gerrit switchover process we created a switchover process as highlighted in T411583: Gerrit backups are growing.
The parent task has been renamed to match its scope and this will help us track and review the failover process.

Event Timeline

ABran-WMF moved this task from Incoming to Backlog on the collaboration-services board.
ABran-WMF added a project: Gerrit.

Now that Gerrit is (more or less) behind the CDN (T411895), there will be this step to actually change backends in the future.

common/profile/trafficserver/backend.yaml configures:

target: http://gerrit.wikimedia.org
replacement: https://gerrit.discovery.wmnet

and the DNS repo `dns/templates/wmnet'; in the DISCOVERY section; configures:

gerrit 300 IN CNAME gerrit1003.wikimedia.org.

So changing that DNS discovery record is the actual switch.

But also gerrit-replica.wikimedia.org has its own record.

neat, that will simplify things!

I've added an item with a draft document to our team meeting so we can address missing bits to make handling a gerrit failover manageable in an emergency