This task will track the racking, setup, and OS installation of an-worker11[42-48].eqiad.wmnet
Hostname / Racking / Installation Details
Hostnames: an-worker11[42-48].eqiad.wmnet
Racking Proposal: Distributed across racks and rows in eqiad please, for resilience.
I've included the current Hadoop worker distribution below, in case it helps to avoid clustering the nodes.
Networking/Subnet/VLAN/IP: A single 10 Gbps connection per host please. Analytics VLAN. No special IP requirements.
Partitioning/Raid: We will use an existing partmen recipe: partman/custom/analytics-flex.cfg
OS Distro: Buster
Additional Information to Support Racking Configuration
Currently the row distribution for hadoop workers is as follows:
23 eqiad A 19 eqiad B 22 eqiad C 20 eqiad D
Taking into account rows and racks, the current distribution is as follows:
1 eqiad A 1 6 eqiad A 2 2 eqiad A 3 6 eqiad A 4 2 eqiad A 5 6 eqiad A 7 5 eqiad B 2 1 eqiad B 3 5 eqiad B 4 5 eqiad B 7 3 eqiad B 8 5 eqiad C 2 4 eqiad C 3 7 eqiad C 4 5 eqiad C 7 1 eqiad C 8 6 eqiad D 2 5 eqiad D 4 2 eqiad D 5 6 eqiad D 7 1 eqiad D 8
Also, in case it's helpful to know, the 12 nodes being refreshed (and therefore to be decommissioned) are in the following racks:
1 eqiad A 1 2 eqiad A 3 3 eqiad B 8 3 eqiad C 3 2 eqiad D 2 1 eqiad D 8
Please let me know if you'd like any further information.
Per host setup checklist
Each host should have its own setup checklist copied and pasted into the list below.
an-worker1142:
- - receive in system on procurement task T292002 & in coupa
- - rack system with proposed racking plan (see above) & update netbox (include all system info plus location, state of planned)
- - bios/drac/serial setup/testing
- - add mgmt dns (asset tag and hostname) and production dns entries in netbox, run cookbook sre.dns.netbox.
- - network port setup via netbox, run homer to commit
- - firmware update (idrac, bios, network, raid controller)
- - operations/puppet update - this should include updates to install_server dhcp and netboot, and site.pp role(insetup) or cp systems use role(insetup::nofirm).
- - OS installation & initital puppet run via wmf-auto-reimage or wmf-auto-reimage-host
- - host state in netbox set to staged
an-worker1143:
- - receive in system on procurement task T292002 & in coupa
- - rack system with proposed racking plan (see above) & update netbox (include all system info plus location, state of planned)
- - bios/drac/serial setup/testing
- - add mgmt dns (asset tag and hostname) and production dns entries in netbox, run cookbook sre.dns.netbox.
- - network port setup via netbox, run homer to commit
- - firmware update (idrac, bios, network, raid controller)
- - operations/puppet update - this should include updates to install_server dhcp and netboot, and site.pp role(insetup) or cp systems use role(insetup::nofirm).
- - OS installation & initital puppet run via wmf-auto-reimage or wmf-auto-reimage-host
- - host state in netbox set to staged
an-worker1144:
- - receive in system on procurement task T292002 & in coupa
- - rack system with proposed racking plan (see above) & update netbox (include all system info plus location, state of planned)
- - bios/drac/serial setup/testing
- - add mgmt dns (asset tag and hostname) and production dns entries in netbox, run cookbook sre.dns.netbox.
- - network port setup via netbox, run homer to commit
- - firmware update (idrac, bios, network, raid controller)
- - operations/puppet update - this should include updates to install_server dhcp and netboot, and site.pp role(insetup) or cp systems use role(insetup::nofirm).
- - OS installation & initital puppet run via wmf-auto-reimage or wmf-auto-reimage-host
- - host state in netbox set to staged
an-worker1145:
- - receive in system on procurement task T292002 & in coupa
- - rack system with proposed racking plan (see above) & update netbox (include all system info plus location, state of planned)
- - bios/drac/serial setup/testing
- - add mgmt dns (asset tag and hostname) and production dns entries in netbox, run cookbook sre.dns.netbox.
- - network port setup via netbox, run homer to commit
- - firmware update (idrac, bios, network, raid controller)
- - operations/puppet update - this should include updates to install_server dhcp and netboot, and site.pp role(insetup) or cp systems use role(insetup::nofirm).
- - OS installation & initital puppet run via wmf-auto-reimage or wmf-auto-reimage-host
- - host state in netbox set to staged
an-worker1146:
- - receive in system on procurement task T292002 & in coupa
- - rack system with proposed racking plan (see above) & update netbox (include all system info plus location, state of planned)
- - bios/drac/serial setup/testing
- - add mgmt dns (asset tag and hostname) and production dns entries in netbox, run cookbook sre.dns.netbox.
- - network port setup via netbox, run homer to commit
- - firmware update (idrac, bios, network, raid controller)
- - operations/puppet update - this should include updates to install_server dhcp and netboot, and site.pp role(insetup) or cp systems use role(insetup::nofirm).
- - OS installation & initital puppet run via wmf-auto-reimage or wmf-auto-reimage-host
- - host state in netbox set to staged
an-worker1147:
- - receive in system on procurement task T292002 & in coupa
- - rack system with proposed racking plan (see above) & update netbox (include all system info plus location, state of planned)
- - bios/drac/serial setup/testing
- - add mgmt dns (asset tag and hostname) and production dns entries in netbox, run cookbook sre.dns.netbox.
- - network port setup via netbox, run homer to commit
- - firmware update (idrac, bios, network, raid controller)
- - operations/puppet update - this should include updates to install_server dhcp and netboot, and site.pp role(insetup) or cp systems use role(insetup::nofirm).
- - OS installation & initital puppet run via wmf-auto-reimage or wmf-auto-reimage-host
- - host state in netbox set to staged
an-worker1148:
- - receive in system on procurement task T292002 & in coupa
- - rack system with proposed racking plan (see above) & update netbox (include all system info plus location, state of planned)
- - bios/drac/serial setup/testing
- - add mgmt dns (asset tag and hostname) and production dns entries in netbox, run cookbook sre.dns.netbox.
- - network port setup via netbox, run homer to commit
- - firmware update (idrac, bios, network, raid controller)
- - operations/puppet update - this should include updates to install_server dhcp and netboot, and site.pp role(insetup) or cp systems use role(insetup::nofirm).
- - OS installation & initital puppet run via wmf-auto-reimage or wmf-auto-reimage-host
- - host state in netbox set to staged
Once the system(s) above have had all checkbox steps completed, this task can be resolved.