Page MenuHomePhabricator

(Need By: 2021-04-30) rack/setup/install wcqs100[123]
Closed, ResolvedPublic

Description

This task will track the racking, setup, and OS installation of wcqs100[123]

Hostname / Racking / Installation Details

Hostnames: wcqs1001, wcqs1002, wcqs1003
Racking Proposal: the 3 servers should be in different rows
Networking/Subnet/VLAN/IP: 1G, single port, production VLAN
Partitioning/Raid: S/W RAID10 - partman: raid10-4dev.cfg
OS Distro: Buster

Per host setup checklist

wcqs1001:

  • - receive in system on procurement task T274389 & in coupa
  • - rack system with proposed racking plan (see above) & update netbox (include all system info plus location, state of planned)
  • - bios/drac/serial setup/testing
  • - add mgmt dns (asset tag and hostname) and production dns entries in netbox, run cookbook sre.dns.netbox.
  • - network port setup via netbox, run homer to commit
  • - bios and idrac firmware updated
  • - operations/puppet update - this should include updates to install_server dhcp and netboot, and site.pp role(insetup) or cp systems use role(insetup::nofirm). - https://gerrit.wikimedia.org/r/682704
  • - OS installation & initital puppet run via wmf-auto-reimage or wmf-auto-reimage-host
  • - host state in netbox set to staged

wcqs1002:

  • - receive in system on procurement task T274389 & in coupa
  • - rack system with proposed racking plan (see above) & update netbox (include all system info plus location, state of planned)
  • - bios/drac/serial setup/testing
  • - add mgmt dns (asset tag and hostname) and production dns entries in netbox, run cookbook sre.dns.netbox.
  • - network port setup via netbox, run homer to commit
  • - bios and idrac firmware updated
  • - operations/puppet update - this should include updates to install_server dhcp and netboot, and site.pp role(insetup) or cp systems use role(insetup::nofirm). - https://gerrit.wikimedia.org/r/682704
  • - OS installation & initital puppet run via wmf-auto-reimage or wmf-auto-reimage-host
  • - host state in netbox set to staged

wcqs1003:

  • - receive in system on procurement task T274389 & in coupa
  • - rack system with proposed racking plan (see above) & update netbox (include all system info plus location, state of planned)
  • - bios/drac/serial setup/testing
  • - add mgmt dns (asset tag and hostname) and production dns entries in netbox, run cookbook sre.dns.netbox.
  • - network port setup via netbox, run homer to commit
  • - bios and idrac firmware updated
  • - operations/puppet update - this should include updates to install_server dhcp and netboot, and site.pp role(insetup) or cp systems use role(insetup::nofirm). - https://gerrit.wikimedia.org/r/682704
  • - OS installation & initital puppet run via wmf-auto-reimage or wmf-auto-reimage-host
  • - host state in netbox set to staged

Once the system(s) above have had all checkbox steps completed, this task can be resolved.

Event Timeline

RobH created this task.
RobH mentioned this in Unknown Object (Task).
RobH added a parent task: Unknown Object (Task).
RobH moved this task from Backlog to Racking Tasks on the ops-eqiad board.
RobH unsubscribed.
RobH renamed this task from (Need By: TBD) rack/setup/install wcqs100[123] to (Need By: 2021-04-30) rack/setup/install wcqs100[123].Mar 6 2021, 12:19 AM

wcqs1001. A3. U37 Port 26 ID3962
wcqs1002 B1. U34 Port 19 ID2950
wcqs1003. C4. U17 Port 15 ID1844

Cmjohnson subscribed.

@Jclark-ctr netbox script ran for wcqs1001 and 1002. I'm not sure why 1003 is in C4, that's a 10G rack. If it is can you please move to a standard rack please.

@Cmjohnson asw2-a-eqiad and asw2-b-eqiad have outstanding changes, please make sure to commit them.

Changes for 1 devices: ['asw2-a-eqiad.mgmt.eqiad.wmnet']

[edit interfaces interface-range disabled]
-    member ge-3/0/26;
[edit interfaces interface-range vlan-private1-a-eqiad]
     member ge-3/0/24 { ... }
+    member ge-3/0/26;
     member ge-3/0/29 { ... }
[edit interfaces]
+   ge-3/0/26 {
+       description "wcqs1001 {#3962}";
+   }

Changes for 1 devices: ['asw2-b-eqiad.mgmt.eqiad.wmnet']

[edit interfaces interface-range disabled]
-    member ge-1/0/19;
[edit interfaces interface-range vlan-private1-b-eqiad]
     member ge-1/0/16 { ... }
+    member ge-1/0/19;
     member ge-1/0/22 { ... }
[edit interfaces]
+   ge-1/0/19 {
+       description "wcqs1002 {#2950}";
+   }

@Cmjohnson my mistake it was C5 for 1003. netbox was correct though

Cmjohnson updated the task description. (Show Details)
Cmjohnson added a subscriber: RobH.

Assigning this to @RobH to complete install

Change 682704 had a related patch set uploaded (by RobH; author: RobH):

[operations/puppet@production] wcqs100[123] setup info

https://gerrit.wikimedia.org/r/682704

Change 682704 merged by RobH:

[operations/puppet@production] wcqs100[123] setup info

https://gerrit.wikimedia.org/r/682704

Script wmf-auto-reimage was launched by robh on cumin1001.eqiad.wmnet for hosts:

['wcqs1001.eqiad.wmnet', 'wcqs1002.eqiad.wmnet', 'wcqs1003.eqiad.wmnet']

The log can be found in /var/log/wmf-auto-reimage/202104261830_robh_12667.log.

Completed auto-reimage of hosts:

['wcqs1001.eqiad.wmnet', 'wcqs1002.eqiad.wmnet', 'wcqs1003.eqiad.wmnet']

and were ALL successful.

RobH removed a project: Patch-For-Review.
RobH updated the task description. (Show Details)