Page MenuHomePhabricator

Decommission db2061.codfw.wmnet
Closed, ResolvedPublic

Description

This task will track the decommission of server db2061.codfw.wmnet

With the launch of updates to the decom cookbook, the majority of these steps can be handled by the service owners directly. The DC Ops team only gets involved once the system has been fully removed from service and powered down by the decommission cookbook.

db2061

Steps for service owner:

End service owner steps / Begin DC-Ops team steps:

  • - disable switch port / set to asset tag if host isn't being unracked / remove from switch if being unracked.
  • - system disks wiped (by onsite)
  • - determine system age, under 5 years are reclaimed to spare, over 5 years are decommissioned. If uncertain, ask @wiki_willy.
  • - IF DECOM: system unracked and decommissioned (by onsite), update netbox with result and set state to offline
  • - IF DECOM: switch port configration removed from switch once system is unracked.
  • - IF DECOM: add system to decommission tracking google sheet
  • - IF DECOM: mgmt dns entries removed.

Details

Related Gerrit Patches:
operations/dns : masterDNS: Remove mgmt DNS for db2048 and db2061
operations/dns : masterDNS: Remove mgnt DNS for db2048 and db2061
operations/dns : masterwmnet: Remove production DNS entries for db2061
operations/puppet : productionmariadb: Remove db2061 from puppet
operations/puppet : productionmariadb: Set db2061 to spare
operations/mediawiki-config : masterdb-eqiad,db-codfw.php. Remove db2061 from config

Event Timeline

Restricted Application added a subscriber: Aklapper. · View Herald TranscriptMon, Nov 18, 10:21 AM
Marostegui triaged this task as Medium priority.Mon, Nov 18, 10:21 AM
Marostegui moved this task from Triage to Next on the DBA board.
Marostegui updated the task description. (Show Details)

Change 551728 had a related patch set uploaded (by Marostegui; owner: Marostegui):
[operations/mediawiki-config@master] db-eqiad,db-codfw.php. Remove db2061 from config

https://gerrit.wikimedia.org/r/551728

Marostegui updated the task description. (Show Details)Tue, Nov 19, 6:36 AM

Change 551728 merged by jenkins-bot:
[operations/mediawiki-config@master] db-eqiad,db-codfw.php. Remove db2061 from config

https://gerrit.wikimedia.org/r/551728

Mentioned in SAL (#wikimedia-operations) [2019-11-19T06:38:12Z] <marostegui@deploy1001> Synchronized wmf-config/db-codfw.php: Remove db2061 from config T238526 (duration: 00m 53s)

Mentioned in SAL (#wikimedia-operations) [2019-11-19T06:39:09Z] <marostegui@deploy1001> Synchronized wmf-config/db-eqiad.php: Remove db2061 from config T238526 (duration: 00m 52s)

Change 551729 had a related patch set uploaded (by Marostegui; owner: Marostegui):
[operations/puppet@production] mariadb: Set db2061 to spare

https://gerrit.wikimedia.org/r/551729

Change 551729 merged by Marostegui:
[operations/puppet@production] mariadb: Set db2061 to spare

https://gerrit.wikimedia.org/r/551729

Marostegui updated the task description. (Show Details)Tue, Nov 19, 6:44 AM

Mentioned in SAL (#wikimedia-operations) [2019-11-19T06:44:34Z] <marostegui> Remove db2061 from tendril and zarcillo T238526

Mentioned in SAL (#wikimedia-operations) [2019-11-19T06:45:06Z] <marostegui> Stop MySQL on db2061 T238526

Marostegui moved this task from Next to In progress on the DBA board.Tue, Nov 19, 1:41 PM

cookbooks.sre.hosts.decommission executed by marostegui@cumin1001 for hosts: db2061.codfw.wmnet

  • db2061.codfw.wmnet (PASS)
    • Downtimed host on Icinga
    • Downtimed management interface on Icinga
    • Wiped bootloaders
    • Powered off
    • Set Netbox status to Decommissioning
    • Removed from DebMonitor
    • Removed from Puppet master and PuppetDB
Marostegui updated the task description. (Show Details)Wed, Nov 20, 8:05 AM

Change 551968 had a related patch set uploaded (by Marostegui; owner: Marostegui):
[operations/puppet@production] mariadb: Remove db2061 from puppet

https://gerrit.wikimedia.org/r/551968

Change 551969 had a related patch set uploaded (by Marostegui; owner: Marostegui):
[operations/dns@master] wmnet: Remove production DNS entries for db2061

https://gerrit.wikimedia.org/r/551969

Change 551968 merged by Marostegui:
[operations/puppet@production] mariadb: Remove db2061 from puppet

https://gerrit.wikimedia.org/r/551968

Change 551969 merged by Marostegui:
[operations/dns@master] wmnet: Remove production DNS entries for db2061

https://gerrit.wikimedia.org/r/551969

Marostegui reassigned this task from Marostegui to Papaul.Wed, Nov 20, 8:11 AM
Marostegui edited projects, added decommission, ops-codfw; removed DBA.
Marostegui updated the task description. (Show Details)
Restricted Application added a project: Operations. · View Herald TranscriptWed, Nov 20, 8:11 AM

Host ready for @Papaul to take over

papaul@asw-d-codfw# show | compare 
[edit interfaces interface-range vlan-private1-d-codfw]
-    member ge-6/0/9;
[edit interfaces interface-range disabled]
     member ge-6/0/1 { ... }
+    member ge-6/0/9;
[edit interfaces]
-   ge-6/0/9 {
-       description db2061;
-       enable;
-   }
Papaul updated the task description. (Show Details)Fri, Nov 22, 1:58 AM

Change 552539 had a related patch set uploaded (by Papaul; owner: Papaul):
[operations/dns@master] DNS: Remove mgnt DNS for db2048 and db2061

https://gerrit.wikimedia.org/r/552539

Change 552539 abandoned by Papaul:
DNS: Remove mgnt DNS for db2048 and db2061

Reason:
Mistakenly removed a lot on DNS entries for others servers in the wmnet file. Have to redo.

https://gerrit.wikimedia.org/r/552539

Change 552542 had a related patch set uploaded (by Papaul; owner: Papaul):
[operations/dns@master] DNS: Remove mgmt DNS for db2048 and db2061

https://gerrit.wikimedia.org/r/552542

Change 552542 merged by Papaul:
[operations/dns@master] DNS: Remove mgmt DNS for db2048 and db2061

https://gerrit.wikimedia.org/r/552542

Papaul closed this task as Resolved.Fri, Nov 22, 11:18 PM
Papaul updated the task description. (Show Details)

Complete

Volans reopened this task as Open.Sat, Nov 30, 6:29 PM
Volans added a subscriber: Volans.

Netbox status is currently Decommissioning, if the host has been unracked it should be Offline.

Papaul closed this task as Resolved.Sat, Nov 30, 6:40 PM