⚓ T206450 Reorganize our redis rdb1/rdb2 clusters

Subject	Repo	Branch	Lines +/-
install_server: reimage rdb2001, rdb2002 to stretch	operations/puppet	production	+3 -15
role::codfw::scb: switch rdb2001:6382 with rdb2003:6379	operations/puppet	production	+2 -2
Reimage rdb2003/rdb2004 to stretch	operations/puppet	production	+0 -4
Reimage rdb2003/rdb2004, switch rdb100[123478] to spare::system	operations/puppet	production	+7 -7
role::codfw::scb: switch rdb2003:6382 with rdb2005:6379	operations/puppet	production	+2 -2
role::eqiad::scb: switch rdb1003:6382 with rdb1005:6379	operations/puppet	production	+2 -2
Reimage rdb2005/rdb2006	operations/puppet	production	+8 -7
Change rdb1005 and rdb1006 to redis::misc master/slave	operations/puppet	production	+3 -9
Change rdb1005 to spare:system	operations/puppet	production	+6 -1
install_server: Reimage rdb1005 to stretch	operations/puppet	production	+0 -2
role::eqiad::scb: switch rdb1001:6382 with rdb1009:6379	operations/puppet	production	+2 -2
prometheus: fix redis misc role	operations/puppet	production	+2 -2
prometheus: fix for redis_misc metrics	operations/puppet	production	+2 -2
prometheus: added redis_misc metrics	operations/puppet	production	+20 -0
deployment-prep: fixed suffix for deployment-rd3-cptest-master01	operations/puppet	production	+1 -1
deployment-prep: added hieradata for deployment-rd3 host	operations/puppet	production	+12 -0
redis::misc Fixed typos	operations/puppet	production	+2 -2
Added new role::redis::misc for general purposes redis servers	operations/puppet	production	+60 -3
Added dummy pass for role redis::misc::master	labs/private	master	+1 -0

Status	Assigned	Task
Resolved	None	T206450 Reorganize our redis rdb1/rdb2 clusters
Resolved	None	T196685 rack/setup/install rdb10[09\|10].eqiad.wmnet
Resolved	Jclark-ctr	T209181 Decommission rdb1001, rdb1002, rdb1003, rdb1004, rdb1007, rdb1008
Declined	None	T209890 Memory consumption in Redis 3.2 vs Redis 2.8
Resolved	jijiki	T209064 Changeprop: Error during deduplication
Resolved	Papaul	T209425 Decommission rdb2001, rdb2002

Joe created this task.Oct 8 2018, 9:30 AM

Restricted Application added a subscriber: Aklapper. · View Herald TranscriptOct 8 2018, 9:30 AM

Joe added a subtask: T196685: rack/setup/install rdb10[09|10].eqiad.wmnet.Oct 8 2018, 9:32 AM

jijiki moved this task from Inbox 🐅 to In Progress 🏋️‍♀️ on the User-jijiki board.Oct 12 2018, 9:35 AM

MoritzMuehlenhoff subscribed.Oct 12 2018, 12:54 PM

Change 467734 had a related patch set uploaded (by Effie Mouzeli; owner: Effie Mouzeli):
[operations/puppet@production] WIP: Added new role::redis::misc for general purposes redis servers

https://gerrit.wikimedia.org/r/467734

gerritbot added a project: Patch-For-Review.Oct 16 2018, 5:19 PM

Change 468310 had a related patch set uploaded (by Effie Mouzeli; owner: Effie Mouzeli):
[labs/private@master] Added dummy pass for role redis::misc::master

https://gerrit.wikimedia.org/r/468310

Change 468310 merged by Effie Mouzeli:
[labs/private@master] Added dummy pass for role redis::misc::master

https://gerrit.wikimedia.org/r/468310

Change 467734 merged by Effie Mouzeli:
[operations/puppet@production] Added new role::redis::misc for general purposes redis servers

https://gerrit.wikimedia.org/r/467734

Change 468586 had a related patch set uploaded (by Effie Mouzeli; owner: Effie Mouzeli):
[operations/puppet@production] redis::misc Fixed typos

https://gerrit.wikimedia.org/r/468586

Change 468586 merged by Effie Mouzeli:
[operations/puppet@production] redis::misc Fixed typos

https://gerrit.wikimedia.org/r/468586

jijiki updated the task description. (Show Details)Oct 19 2018, 3:02 PM

jijiki triaged this task as Medium priority.Oct 22 2018, 3:55 PM

Change 470615 had a related patch set uploaded (by Effie Mouzeli; owner: Effie Mouzeli):
[operations/puppet@production] deployment-prep: added hieradata for deployment-rd3 host

https://gerrit.wikimedia.org/r/470615

Change 470615 merged by Effie Mouzeli:
[operations/puppet@production] deployment-prep: added hieradata for deployment-rd3 host

https://gerrit.wikimedia.org/r/470615

Change 470623 had a related patch set uploaded (by Effie Mouzeli; owner: Effie Mouzeli):
[operations/puppet@production] deployment-prep: fixed suffix for deployment-rd3-cptest-master01

https://gerrit.wikimedia.org/r/470623

Change 470623 merged by Effie Mouzeli:
[operations/puppet@production] deployment-prep: fixed suffix for deployment-rd3-cptest-master01

https://gerrit.wikimedia.org/r/470623

Change 471745 had a related patch set uploaded (by Effie Mouzeli; owner: Effie Mouzeli):
[operations/puppet@production] prometheus: added redis_misc metrics

https://gerrit.wikimedia.org/r/471745

Change 471745 merged by Effie Mouzeli:
[operations/puppet@production] prometheus: added redis_misc metrics

https://gerrit.wikimedia.org/r/471745

Change 471757 had a related patch set uploaded (by Effie Mouzeli; owner: Effie Mouzeli):
[operations/puppet@production] prometheus: fix for redis_misc metrics

https://gerrit.wikimedia.org/r/471757

Change 471757 merged by Effie Mouzeli:
[operations/puppet@production] prometheus: fix for redis_misc metrics

https://gerrit.wikimedia.org/r/471757

Change 471929 had a related patch set uploaded (by Effie Mouzeli; owner: Effie Mouzeli):
[operations/puppet@production] prometheus: fix redis misc role

https://gerrit.wikimedia.org/r/471929

Change 471929 merged by Effie Mouzeli:
[operations/puppet@production] prometheus: fix redis misc role

https://gerrit.wikimedia.org/r/471929

Change 471959 had a related patch set uploaded (by Effie Mouzeli; owner: Effie Mouzeli):
[operations/puppet@production] role::eqiad::scb: switch rdb1001:6382 with rdb1009:6379

https://gerrit.wikimedia.org/r/471959

Change 471959 merged by Effie Mouzeli:
[operations/puppet@production] role::eqiad::scb: switch rdb1001:6382 with rdb1009:6379

https://gerrit.wikimedia.org/r/471959

Change 472240 had a related patch set uploaded (by Effie Mouzeli; owner: Effie Mouzeli):
[operations/puppet@production] install_server: Reimage rdb1005 to stretch

https://gerrit.wikimedia.org/r/472240

Change 472240 merged by Effie Mouzeli:
[operations/puppet@production] install_server: Reimage rdb1005 to stretch

https://gerrit.wikimedia.org/r/472240

Change 472251 had a related patch set uploaded (by Effie Mouzeli; owner: Effie Mouzeli):
[operations/puppet@production] Change rdb1005 to spare:system

https://gerrit.wikimedia.org/r/472251

Change 472251 merged by Effie Mouzeli:
[operations/puppet@production] Change rdb1005 to spare:system

https://gerrit.wikimedia.org/r/472251

Script wmf-auto-reimage was launched by jiji on cumin1001.eqiad.wmnet for hosts:

rdb1005.eqiad.wmnet

The log can be found in /var/log/wmf-auto-reimage/201811072242_jiji_158394_rdb1005_eqiad_wmnet.log.

Completed auto-reimage of hosts:

['rdb1005.eqiad.wmnet']

and were ALL successful.

Mentioned in SAL (#wikimedia-operations) [2018-11-07T23:21:21Z] <jiji> Disabled nagios checks on rdb1006 and rdb2005 due to rdb1005 reimaging - T206450

Mentioned in SAL (#wikimedia-operations) [2018-11-08T10:52:36Z] <jiji> Reimaging rdb1006 to stretch - T206450

Change 472412 had a related patch set uploaded (by Effie Mouzeli; owner: Effie Mouzeli):
[operations/puppet@production] Change rdb1005 and rdb1006 to redis::misc master/slave

https://gerrit.wikimedia.org/r/472412

Change 472412 merged by Effie Mouzeli:
[operations/puppet@production] Change rdb1005 and rdb1006 to redis::misc master/slave

https://gerrit.wikimedia.org/r/472412

Script wmf-auto-reimage was launched by jiji on cumin1001.eqiad.wmnet for hosts:

rdb1006.eqiad.wmnet

The log can be found in /var/log/wmf-auto-reimage/201811081152_jiji_140192_rdb1006_eqiad_wmnet.log.

Completed auto-reimage of hosts:

['rdb1006.eqiad.wmnet']

and were ALL successful.

Mentioned in SAL (#wikimedia-operations) [2018-11-08T12:41:23Z] <jiji> Shutdown and reimage rdb200[56] - T206450

Mentioned in SAL (#wikimedia-operations) [2018-11-08T13:38:40Z] <jiji> Done reimaging rdb1006 - T206450

Change 472449 had a related patch set uploaded (by Effie Mouzeli; owner: Effie Mouzeli):
[operations/puppet@production] Reimage rdb2005/rdb2006

https://gerrit.wikimedia.org/r/472449

Change 472454 had a related patch set uploaded (by Effie Mouzeli; owner: Effie Mouzeli):
[operations/puppet@production] role::eqiad::scb: switch rdb1003:6382 with rdb1005:6379

https://gerrit.wikimedia.org/r/472454

jijiki mentioned this in T209064: Changeprop: Error during deduplication .Nov 8 2018, 3:14 PM

Change 472449 merged by Effie Mouzeli:
[operations/puppet@production] Reimage rdb2005/rdb2006

https://gerrit.wikimedia.org/r/472449

jijiki updated the task description. (Show Details)Nov 8 2018, 6:13 PM

Script wmf-auto-reimage was launched by jiji on cumin1001.eqiad.wmnet for hosts:

['rdb2005.codfw.wmnet']

The log can be found in /var/log/wmf-auto-reimage/201811081821_jiji_1886.log.

Script wmf-auto-reimage was launched by jiji on cumin1001.eqiad.wmnet for hosts:

['rdb2006.codfw.wmnet']

The log can be found in /var/log/wmf-auto-reimage/201811081823_jiji_2571.log.

Completed auto-reimage of hosts:

['rdb2005.codfw.wmnet']

and were ALL successful.

Completed auto-reimage of hosts:

['rdb2006.codfw.wmnet']

and were ALL successful.

Change 472454 merged by Effie Mouzeli:
[operations/puppet@production] role::eqiad::scb: switch rdb1003:6382 with rdb1005:6379

https://gerrit.wikimedia.org/r/472454

Change 472669 had a related patch set uploaded (by Effie Mouzeli; owner: Effie Mouzeli):
[operations/puppet@production] role::codfw::scb: switch rdb2003:6382 with rdb2005:6379

https://gerrit.wikimedia.org/r/472669

Change 472669 merged by Effie Mouzeli:
[operations/puppet@production] role::codfw::scb: switch rdb2003:6382 with rdb2005:6379

https://gerrit.wikimedia.org/r/472669

jijiki updated the task description. (Show Details)Nov 9 2018, 7:07 PM

jijiki updated the task description. (Show Details)

Change 472714 had a related patch set uploaded (by Effie Mouzeli; owner: Effie Mouzeli):
[operations/puppet@production] Reimage rdb2003/rdb2004

https://gerrit.wikimedia.org/r/472714

jijiki closed subtask T196685: rack/setup/install rdb10[09|10].eqiad.wmnet as Resolved.Nov 9 2018, 8:25 PM

Change 472714 merged by Effie Mouzeli:
[operations/puppet@production] Reimage rdb2003/rdb2004, switch rdb100[123478] to spare::system

https://gerrit.wikimedia.org/r/472714

Change 472729 had a related patch set uploaded (by Effie Mouzeli; owner: Effie Mouzeli):
[operations/puppet@production] Reimage rdb2003/rdb2004 to stretch

https://gerrit.wikimedia.org/r/472729

Change 472729 merged by Effie Mouzeli:
[operations/puppet@production] Reimage rdb2003/rdb2004 to stretch

https://gerrit.wikimedia.org/r/472729

Script wmf-auto-reimage was launched by jiji on cumin1001.eqiad.wmnet for hosts:

['rdb2003.codfw.wmnet']

The log can be found in /var/log/wmf-auto-reimage/201811092114_jiji_244778.log.

Mentioned in SAL (#wikimedia-operations) [2018-11-09T21:16:24Z] <jiji> Reimaging rdb2003, rdb2004 - T206450

Script wmf-auto-reimage was launched by jiji on cumin1001.eqiad.wmnet for hosts:

['rdb2004.codfw.wmnet']

The log can be found in /var/log/wmf-auto-reimage/201811092116_jiji_245283.log.

Completed auto-reimage of hosts:

['rdb2004.codfw.wmnet']

and were ALL successful.

Completed auto-reimage of hosts:

['rdb2003.codfw.wmnet']

and were ALL successful.

Change 472964 had a related patch set uploaded (by Effie Mouzeli; owner: Effie Mouzeli):
[operations/puppet@production] role::codfw::scb: switch rdb2001:6382 with rdb2003:6379

https://gerrit.wikimedia.org/r/472964

jijiki updated the task description. (Show Details)Nov 12 2018, 11:39 AM

Change 472964 merged by Effie Mouzeli:
[operations/puppet@production] role::codfw::scb: switch rdb2001:6382 with rdb2003:6379

https://gerrit.wikimedia.org/r/472964

Mentioned in SAL (#wikimedia-operations) [2018-11-12T12:15:16Z] <jiji> Restarting nutcracker on scb200[1-6] - T206450

Change 472970 had a related patch set uploaded (by Effie Mouzeli; owner: Effie Mouzeli):
[operations/puppet@production] install_server: reimage rdb2001, rdb2002 to stretch

https://gerrit.wikimedia.org/r/472970

Change 472970 merged by Effie Mouzeli:
[operations/puppet@production] install_server: reimage rdb2001, rdb2002 to stretch

https://gerrit.wikimedia.org/r/472970

jijiki moved this task from In Progress 🏋️‍♀️ to Misc on the User-jijiki board.Nov 19 2018, 9:10 AM

jijiki closed this task as Resolved.Nov 19 2018, 6:10 PM

jijiki updated the task description. (Show Details)

jijiki added a subtask: T209890: Memory consumption in Redis 3.2 vs Redis 2.8 .Nov 19 2018, 10:38 PM

jijiki added a subtask: T209064: Changeprop: Error during deduplication .

jijiki added a subtask: T209425: Decommission rdb2001, rdb2002.Nov 20 2018, 12:21 AM

Papaul closed subtask T209425: Decommission rdb2001, rdb2002 as Resolved.Jun 5 2019, 2:34 PM

• Cmjohnson closed subtask T209181: Decommission rdb1001, rdb1002, rdb1003, rdb1004, rdb1007, rdb1008 as Resolved.May 13 2020, 6:38 PM

Maintenance_bot removed a project: Patch-For-Review.May 13 2020, 7:11 PM