Page MenuHomePhabricator

Decommission db1026
Closed, ResolvedPublic

Description

db1026 is ready to be decommissioned after its data was copied to db1096

  • - all system services confirmed offline from production use: Removed from mediawiki-config: https://gerrit.wikimedia.org/r/#/c/375364/
  • - set all icinga checks to maint mode/disabled while reclaim/decommmission takes place.
  • - remove system from all lvs/pybal active configuration
  • - any service group puppet/heira/dsh config removed
  • - remove site.pp (replace with role::spare if system isn't shut down immediately during this process.):
  • Set it to spare until @Cmjohnson starts his steps: https://gerrit.wikimedia.org/r/#/c/375365/

START NON-INTERRUPPTABLE STEPS

  • - disable puppet on host
  • - remove all remaining puppet references (include role::spare)
  • - power down host
  • - disable switch port & change switch port label to asset tag
  • - remove production dns entries & remove hostname entries in mgmt dns
  • - puppet node clean, puppet node deactivate, salt key removed

END NON-INTERRUPPTABLE STEPS

  • - system disks wiped (by onsite)
  • - remove hostname label, remove hostname from visible label field in racktables (by onsite)
  • - system added back to spares tracking (by onsite)

Event Timeline

Restricted Application added a subscriber: Aklapper. · View Herald Transcript
Marostegui triaged this task as Medium priority.Sep 1 2017, 6:55 AM
Marostegui moved this task from Triage to In progress on the DBA board.

Change 375364 had a related patch set uploaded (by Marostegui; owner: Marostegui):
[operations/mediawiki-config@master] db-codfw,db-eqiad.php: Remove db1026

https://gerrit.wikimedia.org/r/375364

Change 375364 merged by jenkins-bot:
[operations/mediawiki-config@master] db-codfw,db-eqiad.php: Remove db1026

https://gerrit.wikimedia.org/r/375364

Mentioned in SAL (#wikimedia-operations) [2017-09-01T12:17:42Z] <marostegui@tin> Synchronized wmf-config/db-codfw.php: Remove db1026 as it will be decommissioned - T174763 (duration: 00m 43s)

Mentioned in SAL (#wikimedia-operations) [2017-09-01T12:19:51Z] <marostegui@tin> Synchronized wmf-config/db-eqiad.php: Remove db1026 as it will be decommissioned - T174763 (duration: 00m 43s)

Change 375365 had a related patch set uploaded (by Marostegui; owner: Marostegui):
[operations/puppet@production] mariadb: Decommission db1026

https://gerrit.wikimedia.org/r/375365

Change 375365 merged by Marostegui:
[operations/puppet@production] mariadb: Decommission db1026

https://gerrit.wikimedia.org/r/375365

Change 375370 had a related patch set uploaded (by Marostegui; owner: Marostegui):
[operations/software@master] s5.hosts: Remove db1026

https://gerrit.wikimedia.org/r/375370

Change 375370 merged by jenkins-bot:
[operations/software@master] s5.hosts: Remove db1026

https://gerrit.wikimedia.org/r/375370

Mentioned in SAL (#wikimedia-operations) [2017-09-01T13:10:40Z] <marostegui> Stop MySQL on db1026 as it will be decommissioned - T174763

Marostegui moved this task from In progress to Done on the DBA board.

db1026 is now ready to be decommissioned and all the pending steps are DC Ops ones, so I am handing this over to @Cmjohnson

Change 375950 had a related patch set uploaded (by Marostegui; owner: Marostegui):
[operations/puppet@production] db1026.yaml: Remove file

https://gerrit.wikimedia.org/r/375950

Change 375950 merged by Marostegui:
[operations/puppet@production] db1026.yaml: Remove file

https://gerrit.wikimedia.org/r/375950

This host still shows up in puppetdb, i.e. misses the deactivate step (e.g. visible in https://servermon.wikimedia.org/hosts/)