This task will track the racking, setup, and OS installation of dumpsdata100[67].
Please note this is also the first order with the PERC H750 controller, so support testing of this will be required/checked and updated back to T297913 during installation.
Hostname / Racking / Installation Details
Hostnames: dumpsdata100[67]
Racking Proposal: Prefer E &F, but if those racks aren't available by the time the servers have arrived, try to avoid more than 2 dumpsdata hosts (new and current) in a rack.
Networking/Subnet/VLAN/IP: Internal vlan, 10G ports if possible, otherwise split one 10G port between them as was done for dumpsdata1004,5.
Partitioning/Raid: Let's see which configuration we order first before I can say which partman recipe we need.
OS Distro: Buster
Per host setup checklist
Each host should have its own setup checklist copied and pasted into the list below.
dumpsdata1006
- - receive in system on procurement task T297151 & in coupa
- - rack system with proposed racking plan (see above) & update netbox (include all system info plus location, state of planned)
- - add mgmt dns (asset tag and hostname) and production dns entries in netbox, run cookbook sre.dns.netbox.
- - network port setup via netbox, run homer to commit
- - bios/drac/serial setup/testing, see Lifecycle Steps & Automatic BIOS setup details
- - firmware update (idrac, bios, network, raid controller)
- - operations/puppet update - this should include updates to netboot.pp, and site.pp role(insetup) or cp systems use role(insetup::nofirm).
- - OS installation & initital puppet run via sre.hosts.reimage cookbook.
- - update T297913 with results of pass/fail of PERC H750 controller
dumpsdata1007
- - receive in system on procurement task T297151 & in coupa
- - rack system with proposed racking plan (see above) & update netbox (include all system info plus location, state of planned)
- - add mgmt dns (asset tag and hostname) and production dns entries in netbox, run cookbook sre.dns.netbox.
- - network port setup via netbox, run homer to commit
- - bios/drac/serial setup/testing, see Lifecycle Steps & Automatic BIOS setup details
- - firmware update (idrac, bios, network, raid controller)
- - operations/puppet update - this should include updates to netboot.pp, and site.pp role(insetup) or cp systems use role(insetup::nofirm).
- - OS installation & initital puppet run via sre.hosts.reimage cookbook. - failing due to puppet failure on megacli monitoring commands, needs update to monitoring
- - update T297913 with results of pass/fail of PERC H750 controller
== post install puppet run failure due to raid monitoring ==