Page MenuHomePhabricator

rack/setup/install an-master100[12].eqiad.wmnet
Closed, ResolvedPublic

Description

This task will track the racking, setup, and installation of two new hosts, ordered on T196079 to replace analytics100[12].

Racking Proposal: The master/standby should be located in different rows for maximum horizontal redundancy. Place them in different rows from one another.

an-master1001.eqiad.wmnet:

  • - receive in system on procurement task T196079
  • - hostname determined and hostname label applied (non-blocker to other onsite steps)
  • - rack system with proposed racking plan (see above) & update racktables (include all system info plus location)
  • - bios/drac/serial setup/testing
  • - mgmt dns entries added for both asset tag and hostname (if determined)
  • - network port setup (description hostname or asset tag, enable, vlan same as analytics100[12])
    • end on-site specific steps
  • - production dns entries added
  • - operations/puppet update (install_server at minimum, other files if possible)
  • - OS installation (stretch)
  • - puppet accept/initial run
  • - handoff for service implementation

an-master1002.eqiad.wmnet

  • - receive in system on procurement task T196079
  • - hostname determined and hostname label applied (non-blocker to other onsite steps)
  • - rack system with proposed racking plan (see above) & update racktables (include all system info plus location)
  • - bios/drac/serial setup/testing
  • - mgmt dns entries added for both asset tag and hostname (if determined)
  • - network port setup (description hostname or asset tag, enable, vlan same as analytics100[12])
    • end on-site specific steps
  • - production dns entries added
  • - operations/puppet update (install_server at minimum, other files if possible)
  • - OS installation (stretch)
  • - puppet accept/initial run
  • - handoff for service implementation

Event Timeline

RobH triaged this task as Medium priority.Aug 14 2018, 4:42 PM
RobH created this task.
This comment was removed by RobH.

Hm, tough question! I'd be ok with analytics-master1001 and analytics-master1002. Let's do it!

RobH renamed this task from rack/setup/install 2 new hadoop master/standby systems in eqiad to rack/setup/install analytics-master100[12].eqiad.wmnet.Aug 14 2018, 4:58 PM
RobH reassigned this task from Ottomata to Cmjohnson.
RobH updated the task description. (Show Details)
Cmjohnson updated the task description. (Show Details)

@Ottomata the name is entirely too long for labels and tracking. can we shorten it a bit?

@Ottomata the name is entirely too long for labels and tracking. can we shorten it a bit?

This was discussed in IRC. So the hostname labels that I've purchased for onsite use in the past is a bit too short. The solution is we'll order newer, longer hostname labels for those hostnames on the longer side. For now, the software will use the longer analytics-master1XXX hostnames, but the visible label field, and the actual hostname label, will use an abbreviated version. Racktables will tie them together, as does the asset tag mgmt entry with the long hostname.

Change 453161 had a related patch set uploaded (by Cmjohnson; owner: Cmjohnson):
[operations/dns@master] Adding mgmt dns for analyticsmaster1001-2

https://gerrit.wikimedia.org/r/453161

Change 453161 merged by Cmjohnson:
[operations/dns@master] Adding mgmt dns for analyticsmaster1001-2

https://gerrit.wikimedia.org/r/453161

Cmjohnson renamed this task from rack/setup/install analytics-master100[12].eqiad.wmnet to rack/setup/install analyticsmaster100[12].eqiad.wmnet.Aug 21 2018, 4:37 PM

@Cmjohnson hi! A couple of (probably stupid) questions:

  • are the final node names analytics-master100[12] or analyticsmaster100[12]? I saw the mgmt entries and I am a bit confused..
  • are the two hosts racked in two different rows?

@elukey they are analyticsmaster100x I think during a discussion with ottomata we agreed to remove the hyphen. They are racked in 2 different rows (A5, B8)

@Cmjohnson Rob's latest comment is what I thought we agreed to:

The solution is we'll order newer, longer hostname labels for those hostnames on the longer side. For now, the software will use the longer analytics-master1XXX hostnames

Would much prefer the hyphen for consistency.

I'll handle the dns changes to change the hostname, the label won't change for now since the longer name won't fit.

Chris: Go ahead and finish the onsite steps and task this over to me for installation. I'll handle the hostname stuff then!

@RobH network ports are setup and in analytics vlan

A6 ge-6/0/15 up up analytics-master1001
B8 ge-8/0/21 up up analytics-master1002

Cmjohnson renamed this task from rack/setup/install analyticsmaster100[12].eqiad.wmnet to rack/setup/install analytics-master100[12].eqiad.wmnet.Aug 30 2018, 4:45 PM

@RobH we thought to schedule the maintenance window to swap analytics100[1,2] with analytics-master100[1,2] for Sept 22nd, and I'd like to send a heads up to some mailing list today/tomorrow. As far as I can see there are no more blockers in this task except finalizing the hosts' production status, but please let me know if I am mistaken (so I'll amend the maintenance plans). Thanks :)

@RobH @Cmjohnson any chance that we can get this done this week? I am asking since we have an important maintenance window scheduled for Tuesday, if you guys are busy I'll reschedule for later on.

Ok, hostname update.

We need to change the port descriptions, rack table entries, and physical labels on these to an-master100X, not analytics-master100X.

RobH renamed this task from rack/setup/install analytics-master100[12].eqiad.wmnet to rack/setup/install an-master100[12].eqiad.wmnet.Sep 20 2018, 5:13 PM
RobH claimed this task.
RobH updated the task description. (Show Details)

Change 461692 had a related patch set uploaded (by RobH; owner: RobH):
[operations/dns@master] updating an-master100[12] dns entries

https://gerrit.wikimedia.org/r/461692

Change 461692 merged by RobH:
[operations/dns@master] updating an-master100[12] dns entries

https://gerrit.wikimedia.org/r/461692

Change 461695 had a related patch set uploaded (by RobH; owner: RobH):
[operations/puppet@production] setting base install params for an-master100[12]

https://gerrit.wikimedia.org/r/461695

Change 461695 merged by RobH:
[operations/puppet@production] setting base install params for an-master100[12]

https://gerrit.wikimedia.org/r/461695

Change 461698 had a related patch set uploaded (by RobH; owner: RobH):
[operations/dns@master] correcting an-master100[12] production dns

https://gerrit.wikimedia.org/r/461698

Change 461698 merged by RobH:
[operations/dns@master] correcting an-master100[12] production dns

https://gerrit.wikimedia.org/r/461698

RobH removed RobH as the assignee of this task.Sep 20 2018, 6:38 PM
RobH removed projects: Patch-For-Review, ops-eqiad.
RobH updated the task description. (Show Details)

Change 461729 had a related patch set uploaded (by Elukey; owner: Elukey):
[operations/puppet@production] Add IPv6 interface to an-master100[1,2]

https://gerrit.wikimedia.org/r/461729

Change 461729 merged by Elukey:
[operations/puppet@production] Add IPv6 interface to an-master100[1,2]

https://gerrit.wikimedia.org/r/461729

Change 461969 had a related patch set uploaded (by Elukey; owner: Elukey):
[operations/dns@master] Add IPv6 PTRs for an-master100[1,2]

https://gerrit.wikimedia.org/r/461969

Change 461969 merged by Elukey:
[operations/dns@master] Add IPv6 PTRs for an-master100[1,2]

https://gerrit.wikimedia.org/r/461969

elukey updated the task description. (Show Details)