Page MenuHomePhabricator

Move one host temporarily to m2
Closed, ResolvedPublic

Description

m2 master needs to be replaced.
Let's place a temporary host there to take over.

Event Timeline

Marostegui added parent tasks: Restricted Task, Restricted Task.

Mentioned in SAL (#wikimedia-operations) [2024-06-25T09:32:22Z] <marostegui@cumin1002> dbctl commit (dc=all): 'Depool db1228 T368374', diff saved to https://phabricator.wikimedia.org/P65408 and previous config saved to /var/cache/conftool/dbconfig/20240625-093221-root.json

Change #1049485 had a related patch set uploaded (by Marostegui; author: Marostegui):

[operations/puppet@production] instances.yaml: Remove db1228 from dbctl

https://gerrit.wikimedia.org/r/1049485

Change #1049485 merged by Marostegui:

[operations/puppet@production] instances.yaml: Remove db1228 from dbctl

https://gerrit.wikimedia.org/r/1049485

Mentioned in SAL (#wikimedia-operations) [2024-06-25T09:34:55Z] <marostegui@cumin1002> dbctl commit (dc=all): 'Remove db1228 from dbctl T368374', diff saved to https://phabricator.wikimedia.org/P65409 and previous config saved to /var/cache/conftool/dbconfig/20240625-093454-marostegui.json

Icinga downtime and Alertmanager silence (ID=d0a02871-1560-4aff-b541-0cb93b9c0ed0) set by marostegui@cumin1002 for 2 days, 0:00:00 on 2 host(s) and their services with reason: Cloning

db[1217,1228].eqiad.wmnet

1*************************** 1. row ***************************
2 Slave_IO_State:
3 Master_Host: db1195.eqiad.wmnet
4 Master_User: repl2024
5 Master_Port: 3306
6 Connect_Retry: 60
7 Master_Log_File: db1195-bin.000533
8 Read_Master_Log_Pos: 451132773
9 Relay_Log_File: db1217-relay-bin.000097
10 Relay_Log_Pos: 451133073
11 Relay_Master_Log_File: db1195-bin.000533
12 Slave_IO_Running: No
13 Slave_SQL_Running: No
14 Replicate_Do_DB:
15 Replicate_Ignore_DB:
16 Replicate_Do_Table:
17 Replicate_Ignore_Table:
18 Replicate_Wild_Do_Table:
19 Replicate_Wild_Ignore_Table:
20 Last_Errno: 0
21 Last_Error:
22 Skip_Counter: 0
23 Exec_Master_Log_Pos: 451132773
24 Relay_Log_Space: 451133431
25 Until_Condition: None
26 Until_Log_File:
27 Until_Log_Pos: 0
28 Master_SSL_Allowed: Yes
29 Master_SSL_CA_File:
30 Master_SSL_CA_Path:
31 Master_SSL_Cert:
32 Master_SSL_Cipher:
33 Master_SSL_Key:
34 Seconds_Behind_Master: NULL
35 Master_SSL_Verify_Server_Cert: No
36 Last_IO_Errno: 0
37 Last_IO_Error:
38 Last_SQL_Errno: 0
39 Last_SQL_Error:
40 Replicate_Ignore_Server_Ids:
41 Master_Server_Id: 172001292
42 Master_SSL_Crl:
43 Master_SSL_Crlpath:
44 Using_Gtid: Slave_Pos
45 Gtid_IO_Pos: 171970595-171970595-143176442,0-171970569-1006906062,171970573-171970573-2371267,171970746-171970746-135654591,171978772-171978772-139568865,171966512-171966512-126314222,171970569-171970569-156638323,171966678-171966678-232525439,171970778-171970778-108155596,171970636-171970636-23122305,172001292-172001292-344164559
46 Replicate_Do_Domain_Ids:
47 Replicate_Ignore_Domain_Ids:
48 Parallel_Mode: optimistic
49 SQL_Delay: 0
50 SQL_Remaining_Delay: NULL
51 Slave_SQL_Running_State:
52 Slave_DDL_Groups: 0
53Slave_Non_Transactional_Groups: 0
54 Slave_Transactional_Groups: 48583481

Change #1049497 had a related patch set uploaded (by Marostegui; author: Marostegui):

[operations/puppet@production] mariadb: Move db1228 to m2

https://gerrit.wikimedia.org/r/1049497

Change #1049497 merged by Marostegui:

[operations/puppet@production] mariadb: Move db1228 to m2

https://gerrit.wikimedia.org/r/1049497

Mentioned in SAL (#wikimedia-operations) [2024-06-25T10:40:06Z] <marostegui> m2 dbmaint eqiad Stop db1217:3322 to clone db1228 T368374

This is done, I will track the master switchover in a different track