Page MenuHomePhabricator

rack/setup/instal (4) CI ganeti nodes
Open, NormalPublic0 Story Points

Description

Please note that we ordered 14 total ganeti nodes via procurement task T214088.

The breakdown is as follows: eqiad: (4) ganeti refresh + (4) ganeti nodes for CI (releng) + (6) eqiad: ganeti nodes for expansion

This breakdown makes it seem like 10 of these will go into the general ganeti pool, and 4 will go for CI/release engineering use.

This task will track the racking and setup of the 4 CI/releng ganeti nodes.

Hostname Proposal: <no clue what to call these, but infrastructure naming conventions wikitech page will need updating if its a new hostname>

Racking Proposal: Unclear what the racking/service redundancy will be. Do these need to be in 4 different rows or just 4 different racks?

ganeti1019 checklist:

  • - receive in system on procurement task T214088
  • - rack system with proposed racking plan (see above) & update netbox (include all system info plus location, state of planned) D3/u36
  • - bios/drac/serial setup/testing
  • - mgmt dns entries added for both asset tag and hostname
  • - network port setup (description, enable, vlan)
    • end on-site specific steps
  • - production dns entries added
  • - operations/puppet update (install_server at minimum, other files if possible)
  • - OS installation
  • - puppet accept/initial run (with role:spare)
  • - host state in netbox set to staged
  • - handoff for service implementation
  • - service implementer changes from 'staged' status to 'active' status in netbox'

ganeti1020 checklist:

  • - receive in system on procurement task T214088
  • - rack system with proposed racking plan (see above) & update netbox (include all system info plus location, state of planned) D5/u39
  • - bios/drac/serial setup/testing
  • - mgmt dns entries added for both asset tag and hostname
  • - network port setup (description, enable, vlan)
    • end on-site specific steps
  • - production dns entries added
  • - operations/puppet update (install_server at minimum, other files if possible)
  • - OS installation
  • - puppet accept/initial run (with role:spare)
  • - host state in netbox set to staged
  • - handoff for service implementation
  • - service implementer changes from 'staged' status to 'active' status in netbox'

ganeti1021 checklist:

  • - receive in system on procurement task T214088
  • - rack system with proposed racking plan (see above) & update netbox (include all system info plus location, state of planned)D8/u39
  • - bios/drac/serial setup/testing
  • - mgmt dns entries added for both asset tag and hostname
  • - network port setup (description, enable, vlan)
    • end on-site specific steps
  • - production dns entries added
  • - operations/puppet update (install_server at minimum, other files if possible)
  • - OS installation
  • - puppet accept/initial run (with role:spare)
  • - host state in netbox set to staged
  • - handoff for service implementation
  • - service implementer changes from 'staged' status to 'active' status in netbox'

ganeti1022 checklist:

  • - receive in system on procurement task T214088
  • - rack system with proposed racking plan (see above) & update netbox (include all system info plus location, state of planned)D8/u38
  • - bios/drac/serial setup/testing
  • - mgmt dns entries added for both asset tag and hostname
  • - network port setup (description, enable, vlan)
    • end on-site specific steps
  • - production dns entries added
  • - operations/puppet update (install_server at minimum, other files if possible)
  • - OS installation
  • - puppet accept/initial run (with role:spare)
  • - host state in netbox set to staged
  • - handoff for service implementation
  • - service implementer changes from 'staged' status to 'active' status in netbox'

Event Timeline

RobH triaged this task as Normal priority.Jul 24 2019, 7:03 PM
RobH created this task.

@akosiaris,

Are you involved in this project, and if so would you be the one to provide details for this? Please comment and assign back to me for followup, thanks!

RobH added a parent task: Unknown Object (Task).Jul 24 2019, 7:04 PM
RobH moved this task from Backlog to Racking Tasks on the ops-eqiad board.

@RobH, @Cmjohnson

Despite the designation as CI, we will be treating these uniformly as far as ganeti goes (we will handling the capacity allocations within ganeti) so:

  • Single rack row and one that is not ganeti populated yet. Preferably row D (especially since the nodes in T228924 go to row B) and spread out across the 1G racks.
  • Naming: ganeti1019-ganeti2023

The partman recipe should be partman/ganeti-raid5.cfg

@Jclark-ctr Please rack 4 of the servers from the same ganeti stack in row D and label them as ganeti1019-1022. Please update netbox, and provide access switch port info.

Cmjohnson reassigned this task from akosiaris to Jclark-ctr.Aug 13 2019, 4:37 PM
Cmjohnson added a subscriber: akosiaris.
Cmjohnson updated the task description. (Show Details)Aug 14 2019, 1:44 PM
Jclark-ctr updated the task description. (Show Details)Aug 14 2019, 1:49 PM

@Jclark-ctr

ganeti1019 10.65.5.114
ganeti1020 10.65.5.115
ganeti1021 10.65.5.116
ganeti1022 10.65.5.117

entered ip addresses in IDRAC and set password ganeti10[19...22]