Page MenuHomePhabricator

rack/setup/install labnet100[34]
Closed, ResolvedPublic

Description

This task will track the receiving, setup, and installation of two new labnet servers, labnet1003 and labnet1004, ordered on T163822.

Racking Proposal: These have to be in the labs-hosts subnet to be with the other labs hosts, similar to where labnet1001 and labnet1002 are racked. Both of those older systems are out of warranty, and my understanding is these two systems will replace them. The old ones will have to stay online until these have completed service implementation. So rack in any free space/power/network 1Gbps rack.

We may want to have someone in cloud services confirm that these MUST be in the row B, sharing the vlan/subnet with the virt hosts. I imagine it does, but if it doesn't, we should spread these to other rows.

labnet1003:

  • - receive in system on T163822
  • - rack system with proposed racking plan (see above) & update racktables (include all system info plus location)
  • - bios/drac/serial setup/testing
  • - mgmt dns entries added for both asset tag and hostname
  • - production dns entries added (labs-hosts1-b-eqiad subnet)
  • - network port setup (description, enable, labs-hosts1-b-eqiad vlan)
  • - operations/puppet update (install_server at minimum, other files if possible) https://gerrit.wikimedia.org/r/#/c/365270/
  • - OS installation
  • - puppet/salt accept/initial run
  • - handoff to @chasemp for service implementation

labnet1004:

  • - receive in system on T163822
  • - rack system with proposed racking plan (see above) & update racktables (include all system info plus location)
  • - bios/drac/serial setup/testing
  • - mgmt dns entries added for both asset tag and hostname
  • - production dns entries added (labs-hosts1-b-eqiad subnet)
  • - network port setup (description, enable, labs-hosts1-b-eqiad vlan)
  • - operations/puppet update (install_server at minimum, other files if possible) https://gerrit.wikimedia.org/r/#/c/365270/
  • - OS installation
  • - puppet/salt accept/initial run
  • - handoff to @chasemp for service implementation

Event Timeline

Change 364795 had a related patch set uploaded (by Cmjohnson; owner: Cmjohnson):
[operations/dns@master] Adding production and mgmt ip address for labnet1003/1004 T165779

https://gerrit.wikimedia.org/r/364795

Change 364795 merged by Cmjohnson:
[operations/dns@master] Adding production and mgmt ip address for labnet1003/1004 T165779

https://gerrit.wikimedia.org/r/364795

@RobH can you take this from here please.

update from irc chat with @chasemp: please install these hosts with jessie.

This comment was removed by RobH.

Change 365270 had a related patch set uploaded (by RobH; owner: RobH):
[operations/puppet@production] setting labnet100[34] install params

https://gerrit.wikimedia.org/r/365270

Change 365270 merged by RobH:
[operations/puppet@production] setting labnet100[34] install params

https://gerrit.wikimedia.org/r/365270

The switch shows:

ge-7/0/9        up    up   labnet1003
ge-8/0/12       up    up   labnet1004

But when attempting to PXE boot both labnet100[34] I get the following on their serial console:

QLogic UNDI PXE-2.1 v7.14.5
Copyright (C) 2016 QLogic Corporation
Copyright (C) 1997-2000 Intel Corporation
All rights reserved.
PXE-E61: Media test failure, check cable                                       
PXE-M0F: Exiting QLogic PXE ROM.

Perhaps these both have their eth1 plugged in, rather than eth0?

Chris, can you check this out? It seems odd that both give the exact same error, but you racked them both at the same time so I'm hoping its just eth1 plugged in instead.

@RobH These came with 10G nics, the nics have been disabled.

RobH removed a project: ops-eqiad.
RobH updated the task description. (Show Details)

These are both all setup and ready for cloud team to take over. Assigned to @chasemp for followup.

Change 397635 had a related patch set uploaded (by Dzahn; owner: Dzahn):
[operations/puppet@production] labnet100[34]: use role::spare until used

https://gerrit.wikimedia.org/r/397635

Change 397635 merged by Dzahn:
[operations/puppet@production] labnet100[34]: use role::spare until used

https://gerrit.wikimedia.org/r/397635

Note T193196 is related for next phases here but this is racked/stack/imaged