Page MenuHomePhabricator

decom cp40(09|1[078])
Closed, ResolvedPublic

Description

This task will track the decomission of old cp systems in ulsfo, cp40(09|1[078])

cp4009:

  • - all system services confirmed offline from production use
  • - set all icinga checks to maint mode/disabled while reclaim/decommmission takes place.
  • - remove system from all lvs/pybal active configuration
  • - any service group puppet/hiera/dsh config removed
  • - remove site.pp (replace with role::spare if system isn't shut down immediately during this process.)

START NON-INTERRUPPTABLE STEPS

  • - disable puppet on host
  • - remove all remaining puppet references (include role::spare)
  • - power down host
  • - disable switch port
  • - switch port assignment noted on this task (for later removal)
  • - remove production dns entries
  • - puppet node clean, puppet node deactivate, salt key removed

END NON-INTERRUPPTABLE STEPS

  • - system disks wiped (by onsite)
  • - switch port configration removed
  • - mgmt dns entries removed.

cp4010:

  • - all system services confirmed offline from production use
  • - set all icinga checks to maint mode/disabled while reclaim/decommmission takes place.
  • - remove system from all lvs/pybal active configuration
  • - any service group puppet/hiera/dsh config removed
  • - remove site.pp (replace with role::spare if system isn't shut down immediately during this process.)

START NON-INTERRUPPTABLE STEPS

  • - disable puppet on host
  • - remove all remaining puppet references (include role::spare)
  • - power down host
  • - disable switch port
  • - switch port assignment noted on this task (for later removal)
  • - remove production dns entries
  • - puppet node clean, puppet node deactivate, salt key removed

END NON-INTERRUPPTABLE STEPS

  • - system disks wiped (by onsite)
  • - system disks wiped (by onsite)
  • - switch port configration removed
  • - mgmt dns entries removed.

cp4017:

  • - all system services confirmed offline from production use
  • - set all icinga checks to maint mode/disabled while reclaim/decommmission takes place.
  • - remove system from all lvs/pybal active configuration
  • - any service group puppet/hiera/dsh config removed
  • - remove site.pp (replace with role::spare if system isn't shut down immediately during this process.)

START NON-INTERRUPPTABLE STEPS

  • - disable puppet on host
  • - remove all remaining puppet references (include role::spare)
  • - power down host
  • - disable switch port
  • - switch port assignment noted on this task (for later removal)
  • - remove production dns entries
  • - puppet node clean, puppet node deactivate, salt key removed

END NON-INTERRUPPTABLE STEPS

  • - system disks wiped (by onsite)
  • - system disks wiped (by onsite)
  • - switch port configration removed
  • - mgmt dns entries removed.

cp4018:

  • - all system services confirmed offline from production use
  • - set all icinga checks to maint mode/disabled while reclaim/decommmission takes place.
  • - remove system from all lvs/pybal active configuration
  • - any service group puppet/hiera/dsh config removed
  • - remove site.pp (replace with role::spare if system isn't shut down immediately during this process.)

START NON-INTERRUPPTABLE STEPS

  • - disable puppet on host
  • - remove all remaining puppet references (include role::spare)
  • - power down host
  • - disable switch port
  • - switch port assignment noted on this task (for later removal)
  • - remove production dns entries
  • - puppet node clean, puppet node deactivate, salt key removed

END NON-INTERRUPPTABLE STEPS

  • - system disks wiped (by onsite)
  • - system disks wiped (by onsite)
  • - switch port configration removed
  • - mgmt dns entries removed.

Related Objects

Event Timeline

RobH updated the task description. (Show Details)
RobH updated the task description. (Show Details)
RobH moved this task from Backlog to Decommission on the ops-ulsfo board.
RobH changed the task status from Open to Stalled.Nov 17 2017, 6:19 PM
RobH removed RobH as the assignee of this task.Dec 14 2017, 7:27 PM

cp4009 and cp4018 (also also cp4013) are marked as removed from puppet, but still show in https://servermon.wikimedia.org/hosts/, that usually means that "puppet deactivate" was forgotten.

I've run clean + deactivate for cp4018 as part of cleanup of stale puppet certs.

Change 454241 had a related patch set uploaded (by Muehlenhoff; owner: Muehlenhoff):
[operations/puppet@production] Remove stray cp4 hosts from site.pp

https://gerrit.wikimedia.org/r/454241

Change 454241 merged by Muehlenhoff:
[operations/puppet@production] Remove stray cp4 hosts from site.pp

https://gerrit.wikimedia.org/r/454241

Change 478176 had a related patch set uploaded (by BBlack; owner: BBlack):
[operations/dns@master] Remove various dead cp4005-20 DNS entries

https://gerrit.wikimedia.org/r/478176

Change 478176 merged by BBlack:
[operations/dns@master] Remove various dead cp4005-20 DNS entries

https://gerrit.wikimedia.org/r/478176

Change 493094 had a related patch set uploaded (by BBlack; owner: BBlack):
[operations/dns@master] Remove dead ulsfo cp servers

https://gerrit.wikimedia.org/r/493094

Change 493094 merged by BBlack:
[operations/dns@master] Remove dead ulsfo cp servers

https://gerrit.wikimedia.org/r/493094

RobH mentioned this in Unknown Object (Task).Jun 25 2019, 4:20 PM
RobH removed a project: Patch-For-Review.
RobH updated the task description. (Show Details)

Change 570659 had a related patch set uploaded (by Vgutierrez; owner: Vgutierrez):
[operations/puppet@production] install_server: Remove already decommissioned cp40[09,10,17,18]

https://gerrit.wikimedia.org/r/570659

Change 570659 merged by Vgutierrez:
[operations/puppet@production] install_server: Remove already decommissioned cp40[09,10,17,18]

https://gerrit.wikimedia.org/r/570659