Page MenuHomePhabricator

Hardware Automation Workflow - Overall Tracking
Open, MediumPublic

Description

This task will outline the overall steps needed (and link sub-tasks) for actionable items from the operations teams breakout group on Hardare provisioning & automation.

The notes of the session are listed on https://office.wikimedia.org/wiki/Operations/Operations_Meeting_Notes/2015-10-12_Ops_Offsite/hardware-automation-workflow

Action Items:

  • servermon fully in production (big one)
  • easier allocation of IPs for management (in dns.git)
    • this is easier for codfw, since its properly ordered. eqiad is ordered by service group and horrible to work with.
      • we should clean up and renumber eqiad when this is automated.
    • the same concept can be applied to add/remove machines from puppet.git (e.g. dhcp entries)
  • PXE-boot linux image to run administration tasks
  • investigate ssh keys for idrac/ilo
    • related {T113557}
  • lock mgmt vlan from non ops bastions
    • {T79294} OLD rtimport task for this.

Additionally, there has long been discussion about scripting all mgmt tasks.

Event Timeline

RobH claimed this task.
RobH raised the priority of this task from to Medium.
RobH updated the task description. (Show Details)
RobH added a project: acl*sre-team.
RobH added subscribers: RobH, fgiunchedi, Cmjohnson, Papaul.
RobH set Security to None.
ayounsi closed subtask Restricted Task as Resolved.Jul 12 2018, 4:54 PM
RobH removed RobH as the assignee of this task.Mar 18 2020, 5:15 PM
LSobanski subscribed.

Clinic Duty drive-by tagging.