Page MenuHomePhabricator

Q2: (Need By: TBD) rack/setup/install kubestage100[34].eqiad.wmnet
Closed, ResolvedPublic

Description

This task will track the racking, setup, and OS installation of kubestage100[34].eqiad.wmnet

Hostname / Racking / Installation Details

Hostnames: kubestage100[34]
Racking Proposal: Replacing kubestage100[12], so just place into different rows than one another in 1G racks.
Networking/Subnet/VLAN/IP: single 1g prod connection to private1-vlan
Partitioning/Raid: sw raid, standard raid1-2dev
OS Distro: Stretch (cannot run Buster at this time)

Per host setup checklist

Each host should have its own setup checklist copied and pasted into the list below.

kubestage1003:

  • - receive in system on procurement task T290199 & in coupa
  • - rack system with proposed racking plan (see above) & update netbox (include all system info plus location, state of planned)
  • - bios/drac/serial setup/testing
  • - add mgmt dns (asset tag and hostname) and production dns entries in netbox, run cookbook sre.dns.netbox.
  • - network port setup via netbox, run homer to commit
  • - firmware update (idrac, bios, network, raid controller)
  • - operations/puppet update - this should include updates to install_server dhcp and netboot, and site.pp role(insetup) or cp systems use role(insetup::nofirm).
  • - OS installation & initital puppet run via wmf-auto-reimage or wmf-auto-reimage-host
  • - host state in netbox set to staged

kubestage1004:

  • - receive in system on procurement task T290199 & in coupa
  • - rack system with proposed racking plan (see above) & update netbox (include all system info plus location, state of planned)
  • - bios/drac/serial setup/testing
  • - add mgmt dns (asset tag and hostname) and production dns entries in netbox, run cookbook sre.dns.netbox.
  • - network port setup via netbox, run homer to commit
  • - firmware update (idrac, bios, network, raid controller)
  • - operations/puppet update - this should include updates to install_server dhcp and netboot, and site.pp role(insetup) or cp systems use role(insetup::nofirm).
  • - OS installation & initital puppet run via wmf-auto-reimage or wmf-auto-reimage-host
  • - host state in netbox set to staged

Once the system(s) above have had all checkbox steps completed, this task can be resolved.

Event Timeline

RobH moved this task from Backlog to Racking Tasks on the ops-eqiad board.
RobH added a parent task: Unknown Object (Task).
RobH mentioned this in Unknown Object (Task).
RobH renamed this task from Q1: (Need By: TBD) rack/setup/install kubestage100[34].eqiad.wmnet to Q2: (Need By: TBD) rack/setup/install kubestage100[34].eqiad.wmnet.Sep 13 2021, 4:40 PM
RobH raised the priority of this task from Medium to High.
Jclark-ctr added a subscriber: Jclark-ctr.

kubestage1003 B1 U21 Port29 Cableid# 201333910847
kubestage1004 D3 U37 Port 32 Cableid#1943

Change 728630 had a related patch set uploaded (by Cmjohnson; author: Cmjohnson):

[operations/puppet@production] Adding dhcpd and netboot.cfg for kubestage servers

https://gerrit.wikimedia.org/r/728630

Change 728630 merged by Cmjohnson:

[operations/puppet@production] Adding site.pp, dhcpd and netboot.cfg for kubestage servers

https://gerrit.wikimedia.org/r/728630

Script wmf-auto-reimage was launched by cmjohnson on cumin1001.eqiad.wmnet for hosts:

kubestage1003.eqiad.wmnet

The log can be found in /var/log/wmf-auto-reimage/202110081954_cmjohnson_26603_kubestage1003_eqiad_wmnet.log.

Script wmf-auto-reimage was launched by cmjohnson on cumin1001.eqiad.wmnet for hosts:

kubestage1004.eqiad.wmnet

The log can be found in /var/log/wmf-auto-reimage/202110081954_cmjohnson_26657_kubestage1004_eqiad_wmnet.log.

Completed auto-reimage of hosts:

['kubestage1003.eqiad.wmnet']

and were ALL successful.

Completed auto-reimage of hosts:

['kubestage1004.eqiad.wmnet']

and were ALL successful.

Cmjohnson updated the task description. (Show Details)