⚓ T329259 Switchover m1 master (db1176 -> db1164)

Subject	Repo	Branch	Lines +/-
dbbackups: Replace m1 master	operations/puppet	production	+2 -2
mariadb: Promote db1164 to m1 master	operations/puppet	production	+9 -10
mariadb: Test db1164 in m1	operations/puppet	production	+4 -4

Marostegui created this task.Feb 9 2023, 8:29 AM

Restricted Application added a subscriber: Aklapper. · View Herald TranscriptFeb 9 2023, 8:29 AM

Marostegui triaged this task as Medium priority.Feb 9 2023, 8:30 AM

Marostegui added a project: DBA.

Marostegui updated the task description. (Show Details)

Marostegui moved this task from Triage to Ready on the DBA board.

Marostegui updated the task description. (Show Details)

Marostegui updated the task description. (Show Details)Feb 9 2023, 8:39 AM

Marostegui mentioned this in T329143: Move db1164 to m1.Feb 9 2023, 8:48 AM

Change 887885 had a related patch set uploaded (by Marostegui; author: Marostegui):

[operations/puppet@production] monitoring.yaml: Replace m1 master

https://gerrit.wikimedia.org/r/887885

gerritbot added a project: Patch-For-Review.Feb 9 2023, 8:58 AM

Marostegui updated the task description. (Show Details)Feb 9 2023, 8:59 AM

Marostegui moved this task from Ready to In progress on the DBA board.Feb 9 2023, 9:51 AM

Change 888359 had a related patch set uploaded (by Marostegui; author: Marostegui):

[operations/puppet@production] mariadb: Promote db1164 to m1 master

https://gerrit.wikimedia.org/r/888359

Marostegui updated the task description. (Show Details)Feb 13 2023, 6:22 AM

Change 888395 had a related patch set uploaded (by Marostegui; author: Marostegui):

[operations/puppet@production] mariadb: Test db1164 in m1

https://gerrit.wikimedia.org/r/888395

Change 888395 merged by Marostegui:

[operations/puppet@production] mariadb: Test db1164 in m1

https://gerrit.wikimedia.org/r/888395

Mentioned in SAL (#wikimedia-operations) [2023-02-13T06:59:25Z] <marostegui@cumin1001> START - Cookbook sre.hosts.downtime for 1:00:00 on db[2132,2160].codfw.wmnet,db[1117,1164,1176].eqiad.wmnet with reason: Primary switchover m1 T329259

Mentioned in SAL (#wikimedia-operations) [2023-02-13T06:59:41Z] <marostegui@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on db[2132,2160].codfw.wmnet,db[1117,1164,1176].eqiad.wmnet with reason: Primary switchover m1 T329259

Marostegui updated the task description. (Show Details)Feb 13 2023, 7:00 AM

Marostegui updated the task description. (Show Details)Feb 13 2023, 7:51 AM

Marostegui mentioned this in T329478: Move db1176 to m5.Feb 13 2023, 7:53 AM

Change 888359 merged by Marostegui:

[operations/puppet@production] mariadb: Promote db1164 to m1 master

https://gerrit.wikimedia.org/r/888359

Mentioned in SAL (#wikimedia-operations) [2023-02-13T10:06:40Z] <marostegui@cumin1001> START - Cookbook sre.hosts.downtime for 1:00:00 on db[2132,2160].codfw.wmnet,db[1117,1164,1176].eqiad.wmnet with reason: Primary switchover m1 T329259

Mentioned in SAL (#wikimedia-operations) [2023-02-13T10:06:56Z] <marostegui@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on db[2132,2160].codfw.wmnet,db[1117,1164,1176].eqiad.wmnet with reason: Primary switchover m1 T329259

Marostegui updated the task description. (Show Details)Feb 13 2023, 10:07 AM

Mentioned in SAL (#wikimedia-operations) [2023-02-13T10:16:28Z] <jynus> stopping bacula and disabling puppet at backup1001 for m1 switchover T329259

Marostegui updated the task description. (Show Details)Feb 13 2023, 10:29 AM

Mentioned in SAL (#wikimedia-operations) [2023-02-13T11:00:01Z] <marostegui> Failover m1 from db1176 to db1164 - T329259

Marostegui updated the task description. (Show Details)Feb 13 2023, 11:02 AM

Marostegui updated the task description. (Show Details)

Change 887885 merged by Marostegui:

[operations/puppet@production] dbbackups: Replace m1 master

https://gerrit.wikimedia.org/r/887885

This was done

Marostegui updated the task description. (Show Details)Feb 13 2023, 11:06 AM

Marostegui mentioned this in T329073: eqiad row A switches upgrade.

Maintenance_bot moved this task from In progress to Done on the DBA board.Feb 13 2023, 11:15 AM

Maintenance_bot removed a project: Patch-For-Review.Feb 13 2023, 11:31 AM

Status	Assigned	Task
Open	None	T253824 planned upstream deprecation of the ssh-rsa signing algorithm (RSA with SHA-1)
Resolved	• ayounsi	T254013 all network devices must run OpenSSH >= 7.2p1 but != 7.4p1
Resolved	• ayounsi	T317175 Junos: resolve DNS through mgmt_junos
Resolved	• ayounsi	T327862 Use mgmt_junos on all network devices
		Restricted Task
Open	None	T316539 Upgrade network devices to Junos 20+
Resolved	• ayounsi	T327248 eqiad/codfw virtual-chassis upgrades
Resolved	Clement_Goubert	T327920 March 2023 Datacenter Switchover
Resolved	cmooney	T329073 eqiad row A switches upgrade
Resolved	Marostegui	T329259 Switchover m1 master (db1176 -> db1164)

Switchover m1 master (db1176 -> db1164)
Closed, ResolvedPublic
Actions

Description

Details

Related Objects
Search...

Event Timeline

Switchover m1 master (db1176 -> db1164)Closed, ResolvedPublicActions

Description

Details

Related ObjectsSearch...

Event Timeline

Switchover m1 master (db1176 -> db1164)
Closed, ResolvedPublic
Actions

Related Objects
Search...