Page MenuHomePhabricator

(Need By: 2021-03-31) rack/setup/install ms-backup200[12]
Closed, ResolvedPublic

Description

This task will track the racking, setup, and OS installation of ms-backup200[12]

Hostname / Racking / Installation Details

Hostnames: ms-backup2001, ms-backup2002
Racking Proposal: Redundant to each other, preferentially different rows, it not, different racks, as to have power and network redundancy.
Networking/Subnet/VLAN/IP: 10G, single production link, single mgmt link.
Partitioning/Raid: Software RAID1 between disks is enough.
OS Distro: Buster

Per host setup checklist

Each host should have its own setup checklist copied and pasted into the list below.

ms-backup2001: C4: U6 xe-4/0/3

  • - receive in system on procurement task T272029 & in coupa
  • - rack system with proposed racking plan (see above) & update netbox (include all system info plus location, state of planned)
  • - bios/drac/serial setup/testing
  • - add mgmt dns (asset tag and hostname) and production dns entries in netbox, run cookbook sre.dns.netbox.
  • - network port setup via netbox, run homer to commit
    • end on-site specific steps
  • - operations/puppet update - this should include updates to install_server dhcp and netboot, and site.pp role(insetup) or cp systems use role(insetup::nofirm).
  • - OS installation & initital puppet run via wmf-auto-reimage or wmf-auto-reimage-host
  • - host state in netbox set to staged

ms-backup2002: D2; U4 xe-2/0/2

  • - receive in system on procurement task T272029 & in coupa
  • - rack system with proposed racking plan (see above) & update netbox (include all system info plus location, state of planned)
  • - bios/drac/serial setup/testing
  • - add mgmt dns (asset tag and hostname) and production dns entries in netbox, run cookbook sre.dns.netbox.
  • - network port setup via netbox, run homer to commit
    • end on-site specific steps
  • - operations/puppet update - this should include updates to install_server dhcp and netboot, and site.pp role(insetup) or cp systems use role(insetup::nofirm).
  • - OS installation & initital puppet run via wmf-auto-reimage or wmf-auto-reimage-host
  • - host state in netbox set to staged

Once the system(s) above have had all checkbox steps completed, this task can be resolved.

Event Timeline

RobH mentioned this in Unknown Object (Task).
RobH added a parent task: Unknown Object (Task).Feb 8 2021, 10:02 PM
RobH removed a subscriber: RobH.
RobH renamed this task from (Need By: TBD) rack/setup/install ms-backup200[12] to (Need By: 2021-03-31) rack/setup/install ms-backup200[12].Feb 8 2021, 10:11 PM
RobH added subscribers: Jclark-ctr, jcrespo, RobH, Papaul.

@jcrespo: Conversation between you and Arzhel on T272018 seems to indicate some kind of discussion is still pending for these to determine where they can be racked? They just need to be in different 10G racks than one another as far as I can tell, please advise on this task and assign to @Jclark-ctr once you have done so, as this is now pending order.

Same thing here but please comment and assign to Papaul

Internal production vlan, same as ms-fe hosts.

Change 668145 had a related patch set uploaded (by Papaul; owner: Papaul):
[operations/puppet@production] Add ms-backup200[1-2] MAC address and partman recipe

https://gerrit.wikimedia.org/r/668145

Change 668145 merged by Papaul:
[operations/puppet@production] Add ms-backup200[1-2] MAC address and partman recipe

https://gerrit.wikimedia.org/r/668145

Change 668171 had a related patch set uploaded (by Papaul; owner: Papaul):
[operations/puppet@production] Add ms-backup200[1-2] to site.pp

https://gerrit.wikimedia.org/r/668171

Change 668171 merged by Papaul:
[operations/puppet@production] Add ms-backup200[1-2] to site.pp

https://gerrit.wikimedia.org/r/668171

Script wmf-auto-reimage was launched by pt1979 on cumin2001.codfw.wmnet for hosts:

ms-backup2001.codfw.wmnet

The log can be found in /var/log/wmf-auto-reimage/202103031830_pt1979_20811_ms-backup2001_codfw_wmnet.log.

Completed auto-reimage of hosts:

['ms-backup2001.codfw.wmnet']

and were ALL successful.

Script wmf-auto-reimage was launched by pt1979 on cumin2001.codfw.wmnet for hosts:

ms-backup2002.codfw.wmnet

The log can be found in /var/log/wmf-auto-reimage/202103031917_pt1979_31148_ms-backup2002_codfw_wmnet.log.

Completed auto-reimage of hosts:

['ms-backup2002.codfw.wmnet']

and were ALL successful.

Papaul updated the task description. (Show Details)

@jcrespo this is complete