This task will track the racking, setup, and OS installation of X
Hostname / Racking / Installation Details
Hostnames: ml-serve1009-1011 (3x), ml-lab1001-1002 (2x), dse-k8s-worker1009 (1x)
Racking Proposal: the ml-serve ones should have rack diversity with the rest of ml-serve1xxx, ml-lab can be anywhere but not both on the same rack, dse-k8s-worker1009 should have rack diversity with the rest of dse-k8s-worker1xxx
Networking Setup: # of Connections: 1x10G, 1x management
Partitioning/Raid:No HW raid, ml-serve1xxx shouldbe covered by https://gerrit.wikimedia.org/r/c/operations/puppet/+/1057177, The DSE host will be covered by an upcoming change. ML-Lab we haven't decided yet, no need to install them, just racked and having power (but being off) is fine.
OS Distro: Bookworm
Sub-team Technical Contact: @klausman
Per host setup checklist
Each host should have its own setup checklist copied and pasted into the list below.
ml-serve1009
- Receive in system on procurement task T366522 & in Coupa
- Rack system with proposed racking plan (see above) & update Netbox (include all system info plus location, state of planned)
- Run the Provision a server's network attributes Netbox script - Note that you must run the DNS and Provision cookbook after completing this step
- Immediately run the sre.dns.netbox cookbook
- Immediately run the sre.hosts.provision cookbook
- Run the sre.hardware.upgrade-firmware cookbook
- Update the operations/puppet repo - this should include updates to preseed.yaml, and site.pp with roles defined by service group: https://wikitech.wikimedia.org/wiki/SRE/Dc-operations
- Run the sre.hosts.reimage cookbook
ml-serve1010
- Receive in system on procurement task T366522 & in Coupa
- Rack system with proposed racking plan (see above) & update Netbox (include all system info plus location, state of planned)
- Run the Provision a server's network attributes Netbox script - Note that you must run the DNS and Provision cookbook after completing this step
- Immediately run the sre.dns.netbox cookbook
- Immediately run the sre.hosts.provision cookbook
- Run the sre.hardware.upgrade-firmware cookbook
- Update the operations/puppet repo - this should include updates to preseed.yaml, and site.pp with roles defined by service group: https://wikitech.wikimedia.org/wiki/SRE/Dc-operations
- Run the sre.hosts.reimage cookbook
ml-serve1011
- Receive in system on procurement task T366522 & in Coupa
- Rack system with proposed racking plan (see above) & update Netbox (include all system info plus location, state of planned)
- Run the Provision a server's network attributes Netbox script - Note that you must run the DNS and Provision cookbook after completing this step
- Immediately run the sre.dns.netbox cookbook
- Immediately run the sre.hosts.provision cookbook
- Run the sre.hardware.upgrade-firmware cookbook
- Update the operations/puppet repo - this should include updates to preseed.yaml, and site.pp with roles defined by service group: https://wikitech.wikimedia.org/wiki/SRE/Dc-operations
- Run the sre.hosts.reimage cookbook
ml-lab1001
- Receive in system on procurement task T366522 & in Coupa
- Rack system with proposed racking plan (see above) & update Netbox (include all system info plus location, state of planned)
- Run the Provision a server's network attributes Netbox script - Note that you must run the DNS and Provision cookbook after completing this step
- Immediately run the sre.dns.netbox cookbook
- Immediately run the sre.hosts.provision cookbook
- Run the sre.hardware.upgrade-firmware cookbook
- Update the operations/puppet repo - this should include updates to preseed.yaml, and site.pp with roles defined by service group: https://wikitech.wikimedia.org/wiki/SRE/Dc-operations
- Run the sre.hosts.reimage cookbook
ml-lab1002
- Receive in system on procurement task T366522 & in Coupa
- Rack system with proposed racking plan (see above) & update Netbox (include all system info plus location, state of planned)
- Run the Provision a server's network attributes Netbox script - Note that you must run the DNS and Provision cookbook after completing this step
- Immediately run the sre.dns.netbox cookbook
- Immediately run the sre.hosts.provision cookbook
- Run the sre.hardware.upgrade-firmware cookbook
- Update the operations/puppet repo - this should include updates to preseed.yaml, and site.pp with roles defined by service group: https://wikitech.wikimedia.org/wiki/SRE/Dc-operations
- Run the sre.hosts.reimage cookbook
dse-k8s-worker1009
- Receive in system on procurement task T366522 & in Coupa
- Rack system with proposed racking plan (see above) & update Netbox (include all system info plus location, state of planned)
- Run the Provision a server's network attributes Netbox script - Note that you must run the DNS and Provision cookbook after completing this step
- Immediately run the sre.dns.netbox cookbook
- Immediately run the sre.hosts.provision cookbook
- Run the sre.hardware.upgrade-firmware cookbook
- Update the operations/puppet repo - this should include updates to preseed.yaml, and site.pp with roles defined by service group: https://wikitech.wikimedia.org/wiki/SRE/Dc-operations
- Run the sre.hosts.reimage cookbook
