Page MenuHomePhabricator

(Need By: 2021-04-30) rack/setup/install wcqs100[123]
Closed, ResolvedPublic

Description

This task will track the racking, setup, and OS installation of wcqs100[123]

Hostname / Racking / Installation Details

Hostnames: wcqs1001, wcqs1002, wcqs1003
Racking Proposal: the 3 servers should be in different rows
Networking/Subnet/VLAN/IP: 1G, single port, production VLAN
Partitioning/Raid: S/W RAID10 - partman: raid10-4dev.cfg
OS Distro: Buster

Per host setup checklist

wcqs1001:

  • - receive in system on procurement task T274389 & in coupa
  • - rack system with proposed racking plan (see above) & update netbox (include all system info plus location, state of planned)
  • - bios/drac/serial setup/testing
  • - add mgmt dns (asset tag and hostname) and production dns entries in netbox, run cookbook sre.dns.netbox.
  • - network port setup via netbox, run homer to commit
  • - bios and idrac firmware updated
  • - operations/puppet update - this should include updates to install_server dhcp and netboot, and site.pp role(insetup) or cp systems use role(insetup::nofirm). - https://gerrit.wikimedia.org/r/682704
  • - OS installation & initital puppet run via wmf-auto-reimage or wmf-auto-reimage-host
  • - host state in netbox set to staged

wcqs1002:

  • - receive in system on procurement task T274389 & in coupa
  • - rack system with proposed racking plan (see above) & update netbox (include all system info plus location, state of planned)
  • - bios/drac/serial setup/testing
  • - add mgmt dns (asset tag and hostname) and production dns entries in netbox, run cookbook sre.dns.netbox.
  • - network port setup via netbox, run homer to commit
  • - bios and idrac firmware updated
  • - operations/puppet update - this should include updates to install_server dhcp and netboot, and site.pp role(insetup) or cp systems use role(insetup::nofirm). - https://gerrit.wikimedia.org/r/682704
  • - OS installation & initital puppet run via wmf-auto-reimage or wmf-auto-reimage-host
  • - host state in netbox set to staged

wcqs1003:

  • - receive in system on procurement task T274389 & in coupa
  • - rack system with proposed racking plan (see above) & update netbox (include all system info plus location, state of planned)
  • - bios/drac/serial setup/testing
  • - add mgmt dns (asset tag and hostname) and production dns entries in netbox, run cookbook sre.dns.netbox.
  • - network port setup via netbox, run homer to commit
  • - bios and idrac firmware updated
  • - operations/puppet update - this should include updates to install_server dhcp and netboot, and site.pp role(insetup) or cp systems use role(insetup::nofirm). - https://gerrit.wikimedia.org/r/682704
  • - OS installation & initital puppet run via wmf-auto-reimage or wmf-auto-reimage-host
  • - host state in netbox set to staged

Once the system(s) above have had all checkbox steps completed, this task can be resolved.

Details

Related Changes in Gerrit:

Event Timeline

RobH assigned this task to Jclark-ctr.
RobH mentioned this in Unknown Object (Task).
RobH added a parent task: Unknown Object (Task).
RobH moved this task from Backlog to Racking Tasks on the ops-eqiad board.
RobH unsubscribed.
RobH renamed this task from (Need By: TBD) rack/setup/install wcqs100[123] to (Need By: 2021-04-30) rack/setup/install wcqs100[123].Mar 6 2021, 12:19 AM

wcqs1001. A3. U37 Port 26 ID3962
wcqs1002 B1. U34 Port 19 ID2950
wcqs1003. C4. U17 Port 15 ID1844

Cmjohnson subscribed.

@Jclark-ctr netbox script ran for wcqs1001 and 1002. I'm not sure why 1003 is in C4, that's a 10G rack. If it is can you please move to a standard rack please.

@Cmjohnson asw2-a-eqiad and asw2-b-eqiad have outstanding changes, please make sure to commit them.

Changes for 1 devices: ['asw2-a-eqiad.mgmt.eqiad.wmnet']

[edit interfaces interface-range disabled]
-    member ge-3/0/26;
[edit interfaces interface-range vlan-private1-a-eqiad]
     member ge-3/0/24 { ... }
+    member ge-3/0/26;
     member ge-3/0/29 { ... }
[edit interfaces]
+   ge-3/0/26 {
+       description "wcqs1001 {#3962}";
+   }

Changes for 1 devices: ['asw2-b-eqiad.mgmt.eqiad.wmnet']

[edit interfaces interface-range disabled]
-    member ge-1/0/19;
[edit interfaces interface-range vlan-private1-b-eqiad]
     member ge-1/0/16 { ... }
+    member ge-1/0/19;
     member ge-1/0/22 { ... }
[edit interfaces]
+   ge-1/0/19 {
+       description "wcqs1002 {#2950}";
+   }

@Cmjohnson my mistake it was C5 for 1003. netbox was correct though

Cmjohnson updated the task description. (Show Details)
Cmjohnson added a subscriber: RobH.

Assigning this to @RobH to complete install

Change 682704 had a related patch set uploaded (by RobH; author: RobH):

[operations/puppet@production] wcqs100[123] setup info

https://gerrit.wikimedia.org/r/682704

Change 682704 merged by RobH:

[operations/puppet@production] wcqs100[123] setup info

https://gerrit.wikimedia.org/r/682704

Script wmf-auto-reimage was launched by robh on cumin1001.eqiad.wmnet for hosts:

['wcqs1001.eqiad.wmnet', 'wcqs1002.eqiad.wmnet', 'wcqs1003.eqiad.wmnet']

The log can be found in /var/log/wmf-auto-reimage/202104261830_robh_12667.log.

Completed auto-reimage of hosts:

['wcqs1001.eqiad.wmnet', 'wcqs1002.eqiad.wmnet', 'wcqs1003.eqiad.wmnet']

and were ALL successful.

RobH removed a project: Patch-For-Review.
RobH updated the task description. (Show Details)