Page MenuHomePhabricator

(Need By: 2021-04-30) rack/setup/install wcqs200[123]
Closed, ResolvedPublic

Description

This task will track the racking, setup, and OS installation of wcqs200[123]

Hostname / Racking / Installation Details

Hostnames: wcqs200[123]
Racking Proposal: the 3 servers should be in different rows
Networking/Subnet/VLAN/IP: 1G, single port, production VLAN
Partitioning/Raid: S/W RAID10 - partman: raid10-4dev.cfg
OS Distro: Buster

Per host setup checklist

wcqs2001: B1 U21 ge-1/0/26

  • - receive in system on procurement task T274642 & in coupa
  • - rack system with proposed racking plan (see above) & update netbox (include all system info plus location, state of planned)
  • - bios/drac/serial setup/testing
  • - add mgmt dns (asset tag and hostname) and production dns entries in netbox, run cookbook sre.dns.netbox.
  • - network port setup via netbox, run homer to commit
    • end on-site specific steps
  • - update bios and idrac firmware
  • - operations/puppet update - this should include updates to install_server dhcp and netboot, and site.pp role(insetup) or cp systems use role(insetup::nofirm).
  • - OS installation & initital puppet run via wmf-auto-reimage or wmf-auto-reimage-host
  • - host state in netbox set to staged

    wcqs2002: C1U3 ge-1/0/15
  • - receive in system on procurement task T274642 & in coupa
  • - rack system with proposed racking plan (see above) & update netbox (include all system info plus location, state of planned)
  • - bios/drac/serial setup/testing
  • - add mgmt dns (asset tag and hostname) and production dns entries in netbox, run cookbook sre.dns.netbox.
  • - network port setup via netbox, run homer to commit
    • end on-site specific steps
  • - update bios and idrac firmware
  • - operations/puppet update - this should include updates to install_server dhcp and netboot, and site.pp role(insetup) or cp systems use role(insetup::nofirm).
  • - OS installation & initital puppet run via wmf-auto-reimage or wmf-auto-reimage-host
  • - host state in netbox set to staged

    wcqs2003: D1U6 ge-1/0/5
  • - receive in system on procurement task T274642 & in coupa
  • - rack system with proposed racking plan (see above) & update netbox (include all system info plus location, state of planned)
  • - bios/drac/serial setup/testing
  • - add mgmt dns (asset tag and hostname) and production dns entries in netbox, run cookbook sre.dns.netbox.
  • - network port setup via netbox, run homer to commit
    • end on-site specific steps
  • - update bios and idrac firmware
  • - operations/puppet update - this should include updates to install_server dhcp and netboot, and site.pp role(insetup) or cp systems use role(insetup::nofirm).
  • - OS installation & initital puppet run via wmf-auto-reimage or wmf-auto-reimage-host
  • - host state in netbox set to staged

Once the system(s) above have had all checkbox steps completed, this task can be resolved.

Event Timeline

RobH added a parent task: Unknown Object (Task).
RobH moved this task from Backlog to Racking Tasks on the ops-codfw board.
RobH unsubscribed.
RobH renamed this task from (Need By: TBD) rack/setup/install wcqs200[123] to (Need By: 2021-04-30) rack/setup/install wcqs200[123].Mar 6 2021, 12:19 AM

@Gehel each server has 4x1.9TB disks. I want to make sure we are doing both HW RAID 10 and SW RAID 10 .
Thanks

@Gehel each server has 4x1.9TB disks. I want to make sure we are doing both HW RAID 10 and SW RAID 10 .

Nope, we should do just S/W RAID 10, using raid10-4dev.cfg.

Papaul updated the task description. (Show Details)

Change 675929 had a related patch set uploaded (by Papaul; author: Papaul):

[operations/puppet@production] Add MAC Address and partman recipe for wcqs200[1-3]

https://gerrit.wikimedia.org/r/675929

Change 675929 merged by Papaul:

[operations/puppet@production] Add MAC Address and partman recipe for wcqs200[1-3]

https://gerrit.wikimedia.org/r/675929

Change 675931 had a related patch set uploaded (by Papaul; author: Papaul):

[operations/puppet@production] Add wcqd200[1-3] to site.pp

https://gerrit.wikimedia.org/r/675931

Change 675931 merged by Papaul:

[operations/puppet@production] Add wcqs200[1-3] to site.pp

https://gerrit.wikimedia.org/r/675931

Script wmf-auto-reimage was launched by pt1979 on cumin2001.codfw.wmnet for hosts:

wcqs2001.codfw.wmnet

The log can be found in /var/log/wmf-auto-reimage/202103310048_pt1979_11998_wcqs2001_codfw_wmnet.log.

Completed auto-reimage of hosts:

['wcqs2001.codfw.wmnet']

and were ALL successful.

Script wmf-auto-reimage was launched by pt1979 on cumin2001.codfw.wmnet for hosts:

wcqs2002.codfw.wmnet

The log can be found in /var/log/wmf-auto-reimage/202103310117_pt1979_19062_wcqs2002_codfw_wmnet.log.

Completed auto-reimage of hosts:

['wcqs2002.codfw.wmnet']

and were ALL successful.

Script wmf-auto-reimage was launched by pt1979 on cumin2001.codfw.wmnet for hosts:

wcqs2003.codfw.wmnet

The log can be found in /var/log/wmf-auto-reimage/202103310146_pt1979_25329_wcqs2003_codfw_wmnet.log.

Completed auto-reimage of hosts:

['wcqs2003.codfw.wmnet']

and were ALL successful.

Papaul updated the task description. (Show Details)

@Gehel This is ready