⚓ T330847 Switchover m5 master (db1183 -> db1176)

	Subject	Repo	Branch	Lines +/-
	mariadb: Promote db1176 to m5 master	operations/puppet	production	+7 -8
	m5-proxies: Add db1176 for testing	operations/puppet	production	+4 -4

Marostegui created this task.Mar 1 2023, 10:31 AM

Restricted Application added a subscriber: Aklapper. · View Herald TranscriptMar 1 2023, 10:31 AM

This needs to happen after 7th March once T329073 is finished

Marostegui updated the task description. (Show Details)Mar 1 2023, 10:32 AM

Marostegui moved this task from Triage to Blocked on the DBA board.

Marostegui mentioned this in T329478: Move db1176 to m5.

Marostegui added a project: Wikimedia-Mailing-lists.Mar 1 2023, 10:45 AM

Maintenance_bot added a project: SRE.Mar 1 2023, 10:45 AM

@bd808 @Andrew @Legoktm would Thursday 9th at 16:00 UTC work for you all?

Marostegui updated the task description. (Show Details)Mar 1 2023, 10:47 AM

In T330847#8656124, @Marostegui wrote:

would Thursday 9th at 16:00 UTC work for you all?

That date and time work for me.

cc. @Raymond_Ndibe in case you want to try maintaindbusers at that time (uses labsdbaccounts)

I am going to schedule this on Thursday 9th at 16:00 UTC - if someone has objections, please let me know!

Marostegui updated the task description. (Show Details)Mar 2 2023, 7:26 AM

Marostegui moved this task from Blocked to Ready on the DBA board.Mar 2 2023, 8:02 AM

Marostegui updated the task description. (Show Details)

Marostegui mentioned this in T330977: Move db1183 to m1.Mar 6 2023, 12:43 PM

Marostegui mentioned this in T330165: eqiad row B switches upgrade.Mar 7 2023, 8:59 AM

Marostegui changed the task status from Stalled to Open.Mar 8 2023, 8:43 AM

Marostegui moved this task from Ready to In progress on the DBA board.

Marostegui added a parent task: T330165: eqiad row B switches upgrade.Mar 8 2023, 9:00 AM

Change 895908 had a related patch set uploaded (by Marostegui; author: Marostegui):

[operations/puppet@production] m5-proxies: Add db1176 for testing

https://gerrit.wikimedia.org/r/895908

Change 895908 merged by Marostegui:

[operations/puppet@production] m5-proxies: Add db1176 for testing

https://gerrit.wikimedia.org/r/895908

Checked that haproxy sees db1176 just fine

Change 895910 had a related patch set uploaded (by Marostegui; author: Marostegui):

[operations/puppet@production] mariadb: Promote db1176 to m5 master

https://gerrit.wikimedia.org/r/895910

Marostegui updated the task description. (Show Details)Mar 9 2023, 7:45 AM

Marostegui updated the task description. (Show Details)

Marostegui updated the task description. (Show Details)Mar 9 2023, 7:48 AM

Marostegui updated the task description. (Show Details)Mar 9 2023, 1:27 PM

Mentioned in SAL (#wikimedia-operations) [2023-03-09T15:02:00Z] <marostegui@cumin1001> START - Cookbook sre.hosts.downtime for 1:30:00 on db[2135,2160].codfw.wmnet,db[1117,1176,1183].eqiad.wmnet with reason: m5 master switch T330847

Mentioned in SAL (#wikimedia-operations) [2023-03-09T15:02:16Z] <marostegui@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:30:00 on db[2135,2160].codfw.wmnet,db[1117,1176,1183].eqiad.wmnet with reason: m5 master switch T330847

Marostegui updated the task description. (Show Details)Mar 9 2023, 3:02 PM

Marostegui updated the task description. (Show Details)

Change 895910 merged by Marostegui:

[operations/puppet@production] mariadb: Promote db1176 to m5 master

https://gerrit.wikimedia.org/r/895910

Marostegui updated the task description. (Show Details)Mar 9 2023, 3:05 PM

Maintenance_bot removed a project: Patch-For-Review.Mar 9 2023, 3:10 PM

All the pre-failover steps are done. Waiting for 16:00 UTC to perform the actual switch.

Mentioned in SAL (#wikimedia-operations) [2023-03-09T16:00:09Z] <marostegui> Failover m5 from db1183 to db1176 - T330847

Marostegui updated the task description. (Show Details)Mar 9 2023, 4:02 PM

This was done, the RO time was around 15 seconds.
Thanks @bd808 for the support!

Marostegui updated the task description. (Show Details)Mar 9 2023, 4:09 PM

Maintenance_bot moved this task from In progress to Done on the DBA board.Mar 9 2023, 4:15 PM

Status	Assigned	Task
Open	None	T253824 planned upstream deprecation of the ssh-rsa signing algorithm (RSA with SHA-1)
Resolved	ayounsi	T254013 all network devices must run OpenSSH >= 7.2p1 but != 7.4p1
Resolved	ayounsi	T317175 Junos: resolve DNS through mgmt_junos
Resolved	ayounsi	T327862 Use mgmt_junos on all network devices
		Restricted Task
Open	None	T316539 Upgrade network devices to Junos 20+
Resolved	ayounsi	T327248 eqiad/codfw virtual-chassis upgrades
		Restricted Task
		Restricted Task
Resolved	Clement_Goubert	T327920 March 2023 Datacenter Switchover
Resolved	ayounsi	T330165 eqiad row B switches upgrade
Resolved	Marostegui	T330847 Switchover m5 master (db1183 -> db1176)

Switchover m5 master (db1183 -> db1176)
Closed, ResolvedPublic
Actions

Description

Details

Related Objects
Search...

Event Timeline

Switchover m5 master (db1183 -> db1176)Closed, ResolvedPublicActions

Description

Details

Related ObjectsSearch...

Event Timeline

Switchover m5 master (db1183 -> db1176)
Closed, ResolvedPublic
Actions

Related Objects
Search...