Page MenuHomePhabricator

Q3:rack/setup/install bast2003
Closed, ResolvedPublic

Description

This task will track the racking, setup, and OS installation of bast2003

Hostname / Racking / Installation Details

Hostnames: bast2003
Racking Proposal: Replacing bast2002, no restrictions.
Networking Setup: # of Connections:1, Speed:1G, Vlan:Public, AAAA records:Y, Additional IP records (Cassandra)?: N
Partitioning/Raid: HW Raid:N, Partman recipe and/or desired Raid Level: sw raid 1 standard 2 dev
OS Distro: Bullseye
Sub-team Technical Contact: Moritz/JBond

Per host setup checklist

Each host should have its own setup checklist copied and pasted into the list below.

bast2003: RACK: D8-U18 Port: 17

Related Objects

StatusSubtypeAssignedTask
ResolvedPapaul

Event Timeline

RobH added a parent task: Unknown Object (Task).Apr 7 2023, 1:56 PM
RobH moved this task from Backlog to Racking Tasks on the ops-codfw board.

Change 910558 had a related patch set uploaded (by Papaul; author: Papaul):

[operations/puppet@production] Add bast2003 to site.pp

https://gerrit.wikimedia.org/r/910558

Change 910558 merged by Papaul:

[operations/puppet@production] Add bast2003 to site.pp

https://gerrit.wikimedia.org/r/910558

Cookbook cookbooks.sre.hosts.reimage was started by pt1979@cumin2002 for host bast2003.wikimedia.org with OS bullseye

Cookbook cookbooks.sre.hosts.reimage started by pt1979@cumin2002 for host bast2003.wikimedia.org with OS bullseye completed:

  • bast2003 (PASS)
    • Removed from Puppet and PuppetDB if present
    • Deleted any existing Puppet certificate
    • Removed from Debmonitor if present
    • Forced PXE for next reboot
    • Host rebooted via IPMI
    • Host up (Debian installer)
    • Host up (new fresh bullseye OS)
    • Generated Puppet certificate
    • Signed new Puppet certificate
    • Run Puppet in NOOP mode to populate exported resources in PuppetDB
    • Found Nagios_host resource for this host in PuppetDB
    • Downtimed the new host on Icinga/Alertmanager
    • First Puppet run completed and logged in /var/log/spicerack/sre/hosts/reimage/202304201724_pt1979_4124136_bast2003.out
    • Checked BIOS boot parameters are back to normal
    • configmaster.wikimedia.org updated with the host new SSH public key for wmf-update-known-hosts-production
    • Rebooted
    • Automatic Puppet run was successful
    • Forced a re-check of all Icinga services for the host
    • Icinga status is optimal
    • Icinga downtime removed
    • Updated Netbox data from PuppetDB
    • Updated Netbox status planned -> active
    • The sre.puppet.sync-netbox-hiera cookbook was run successfully
Papaul claimed this task.
Papaul updated the task description. (Show Details)
Papaul added subscribers: MoritzMuehlenhoff, Papaul.

Cookbook cookbooks.sre.hosts.reimage was started by jmm@cumin2002 for host bast2003.wikimedia.org with OS bookworm

Cookbook cookbooks.sre.hosts.reimage started by jmm@cumin2002 for host bast2003.wikimedia.org with OS bookworm completed:

  • bast2003 (WARN)
    • Downtimed on Icinga/Alertmanager
    • Disabled Puppet
    • Removed from Puppet and PuppetDB if present
    • Deleted any existing Puppet certificate
    • Removed from Debmonitor if present
    • Forced PXE for next reboot
    • Host rebooted via IPMI
    • Host up (Debian installer)
    • Host up (new fresh bookworm OS)
    • Generated Puppet certificate
    • Signed new Puppet certificate
    • Run Puppet in NOOP mode to populate exported resources in PuppetDB
    • Found Nagios_host resource for this host in PuppetDB
    • Downtimed the new host on Icinga/Alertmanager
    • Removed previous downtime on Alertmanager (old OS)
    • First Puppet run completed and logged in /var/log/spicerack/sre/hosts/reimage/202305040633_jmm_2267133_bast2003.out
    • Checked BIOS boot parameters are back to normal
    • Unable to run puppet on puppetmaster2001.codfw.wmnet,puppetmaster1001.eqiad.wmnet to update configmaster.wikimedia.org with the new host SSH public key for wmf-update-known-hosts-production
    • Rebooted
    • Automatic Puppet run was successful
    • Forced a re-check of all Icinga services for the host
    • Icinga status is optimal
    • Icinga downtime removed
    • Updated Netbox data from PuppetDB

Change 915363 had a related patch set uploaded (by Muehlenhoff; author: Muehlenhoff):

[operations/puppet@production] Assign bastion role to bast2003

https://gerrit.wikimedia.org/r/915363

Change 915363 merged by Muehlenhoff:

[operations/puppet@production] Assign bastion role to bast2003

https://gerrit.wikimedia.org/r/915363

Change 915631 had a related patch set uploaded (by Muehlenhoff; author: Muehlenhoff):

[operations/puppet@production] Add bast2003 to Bastion hiera settings

https://gerrit.wikimedia.org/r/915631

Change 915639 had a related patch set uploaded (by Muehlenhoff; author: Muehlenhoff):

[operations/debs/wmf-sre-laptop@master] wmf-laptop-sre: Add bast2003

https://gerrit.wikimedia.org/r/915639

Change 915631 merged by Muehlenhoff:

[operations/puppet@production] Add bast2003 to Bastion hiera settings

https://gerrit.wikimedia.org/r/915631

Change 915639 merged by Muehlenhoff:

[operations/debs/wmf-sre-laptop@master] wmf-laptop-sre: Add bast2003

https://gerrit.wikimedia.org/r/915639

Change 921247 had a related patch set uploaded (by Muehlenhoff; author: Muehlenhoff):

[operations/puppet@production] Remove bast2002 from bastion hiera entries

https://gerrit.wikimedia.org/r/921247

Change 921247 merged by Muehlenhoff:

[operations/puppet@production] Remove bast2002 from bastion hiera entries

https://gerrit.wikimedia.org/r/921247