This task will track the racking, setup, and OS installation of dumpsdata100[67].
Please note this is also the first order with the PERC H750 controller, so support testing of this will be required/checked and updated back to T297913 during installation.
== Hostname / Racking / Installation Details ==
**Hostnames:** dumpsdata100[67]
**Racking Proposal:** Prefer E &F, but if those racks aren't available by the time the servers have arrived, try to avoid more than 2 dumpsdata hosts (new and current) in a rack.
**Networking/Subnet/VLAN/IP:** Internal vlan, 10G ports if possible, otherwise split one 10G port between them as was done for dumpsdata1004,5.
**Partitioning/Raid:** Let's see which configuration we order first before I can say which partman recipe we need.
**OS Distro:** Buster
== Per host setup checklist ==
Each host should have its own setup checklist copied and pasted into the list below.
==== dumpsdata1006
[x] - receive in system on #procurement task T297151 & in coupa
[x] - rack system with proposed racking plan (see above) & update netbox (include all system info plus location, state of planned)
[x] - add mgmt dns (asset tag and hostname) and production dns entries in netbox, run cookbook sre.dns.netbox.
[x] - network port setup via netbox, run homer to commit
[x] - bios/drac/serial setup/testing, see [[ https://wikitech.wikimedia.org/wiki/Server_Lifecycle#Requested_-%3E_Planned_additional_steps_&_Spare_-%3E_Planned | Lifecycle Steps ]] & [[ https://wikitech.wikimedia.org/wiki/SRE/Dc-operations/Platform-specific_documentation/Dell_Documentation#Automatic_setup | Automatic BIOS setup details ]]
[x] - firmware update (idrac, bios, network, raid controller)
[x] - operations/puppet update - this should include updates to netboot.pp, and site.pp role(insetup) or cp systems use role(insetup::nofirm).
[] - OS installation & initital puppet run via sre.hosts.reimage cookbook.
[x] - update T297913 with results of pass/fail of PERC H750 controller
==== dumpsdata1007
[x] - receive in system on #procurement task T297151 & in coupa
[x] - rack system with proposed racking plan (see above) & update netbox (include all system info plus location, state of planned)
[x] - add mgmt dns (asset tag and hostname) and production dns entries in netbox, run cookbook sre.dns.netbox.
[x] - network port setup via netbox, run homer to commit
[x] - bios/drac/serial setup/testing, see [[ https://wikitech.wikimedia.org/wiki/Server_Lifecycle#Requested_-%3E_Planned_additional_steps_&_Spare_-%3E_Planned | Lifecycle Steps ]] & [[ https://wikitech.wikimedia.org/wiki/SRE/Dc-operations/Platform-specific_documentation/Dell_Documentation#Automatic_setup | Automatic BIOS setup details ]]
[x] - firmware update (idrac, bios, network, raid controller)
[x] - operations/puppet update - this should include updates to netboot.pp, and site.pp role(insetup) or cp systems use role(insetup::nofirm).
[] - OS installation & initital puppet run via sre.hosts.reimage cookbook. - failing due to puppet failure on megacli monitoring commands, needs update to monitoring
[x] - update T297913 with results of pass/fail of PERC H750 controller
== post install puppet run failure due to raid monitoring ==
T297913#8038261
>>! In T297913#8038261, RobH wrote:
>>>! In T297913#8038091, MoritzMuehlenhoff wrote:
>>>>! In T297913#8038074, RobH wrote:
>>> So post dumpsdata1007 install it fails puppet due to megaraid monitoring items it seems?
>>
>> That's expected, we still need to adapt the "raid" fact in Puppet so that it installs perccli (but for that we needed a running system with Perc controller, so that we can figure out the device names which allow Puppet to detect the controller). Just leave the system in that state and we'll use dumpsdata1007 for that?
>
> Works for me, I'll put this comment reference on the setup task there. Thanks!