Page MenuHomePhabricator

(2020-08-15) rack/setup/install dbprov1003.eqiad.wmnet
Closed, ResolvedPublic

Description

This task will track the racking, setup, and OS installation of dbprov1003.eqiad.wmnet

Hostname / Racking / Installation Details

Hostnames: dbprov1003
Racking Proposal: Should not share rows with existing hosts on A7 & B7 for redundancy. 10G required. If not possible, it should not share the same racks.
Networking/Subnet/VLAN/IP: Production network (where production mysql lives), 10G required for high transfer/backup restore to mysql servers.
Partitioning/Raid: Raid setup: Raid6 HDDs (create first so it is the first virtual drive-sda), Raid0 SSDs (create second so it is the second virtual drive-sdb). Both same options as the dbs- write back with 256K of stripe
OS Distro: Buster

Per host setup checklist

dbprov1003.eqiad.wmnet:

  • - receive in system on procurement task T257547 & in coupa
  • - rack system with proposed racking plan (see above) & update netbox (include all system info plus location, state of planned)
  • - bios/drac/serial setup/testing
  • - mgmt dns entries added for both asset tag and hostname
  • - network port setup (description, enable, vlan)
    • end on-site specific steps
  • - production dns entries added
  • - operations/puppet update (install_server at minimum, other files if possible)
  • - OS installation
  • - puppet accept/initial run (with role:spare)
  • - host state in netbox set to staged

Once the system(s) above have had all checkbox steps completed, this task can be resolved.

Event Timeline

RobH renamed this task from (<enter due date here>) rack/setup/install dbprov1003.eqiad.wmnet to (2020-09-30) rack/setup/install dbprov1003.eqiad.wmnet.Jul 23 2020, 8:20 PM
RobH added a parent task: Unknown Object (Task).
RobH unsubscribed.
RobH renamed this task from (2020-09-30) rack/setup/install dbprov1003.eqiad.wmnet to (2020-09-14) rack/setup/install dbprov1003.eqiad.wmnet.Jul 23 2020, 8:22 PM
RobH moved this task from Triage to Blocked external/Not db team on the DBA board.
RobH moved this task from Backlog to Acknowledged on the SRE board.

Change 618555 had a related patch set uploaded (by Jcrespo; owner: Jcrespo):
[operations/puppet@production] mariadb-backups: Add dbprov[12]003 to the db.cfg partman recipe list

https://gerrit.wikimedia.org/r/618555

Change 618555 merged by Jcrespo:
[operations/puppet@production] mariadb-backups: Add dbprov[12]003 to the db.cfg partman recipe list

https://gerrit.wikimedia.org/r/618555

wiki_willy renamed this task from (2020-09-14) rack/setup/install dbprov1003.eqiad.wmnet to (2020-08-15) rack/setup/install dbprov1003.eqiad.wmnet.Aug 10 2020, 4:17 PM
wiki_willy assigned this task to Cmjohnson.

Change 621959 had a related patch set uploaded (by Cmjohnson; owner: Cmjohnson):
[operations/dns@master] Adding production dns for dbprov1003

https://gerrit.wikimedia.org/r/621959

Change 621959 merged by Cmjohnson:
[operations/dns@master] Adding production dns for dbprov1003

https://gerrit.wikimedia.org/r/621959

Change 621960 had a related patch set uploaded (by Cmjohnson; owner: Cmjohnson):
[operations/puppet@production] Adding dbprov1003 mac address to dhcp file

https://gerrit.wikimedia.org/r/621960

Change 621960 merged by Cmjohnson:
[operations/puppet@production] Adding dbprov1003 mac address to dhcp file

https://gerrit.wikimedia.org/r/621960

Change 621961 had a related patch set uploaded (by Cmjohnson; owner: Cmjohnson):
[operations/puppet@production] Updating site.pp file for dbprov1003 switched it to role:insetup to be safe

https://gerrit.wikimedia.org/r/621961

Change 621961 merged by Cmjohnson:
[operations/puppet@production] Updating site.pp file for dbprov1003 switched it to role:insetup to be safe

https://gerrit.wikimedia.org/r/621961

Script wmf-auto-reimage was launched by cmjohnson on cumin1001.eqiad.wmnet for hosts:

dbprov1003.eqiad.wmnet

The log can be found in /var/log/wmf-auto-reimage/202008231944_cmjohnson_3546_dbprov1003_eqiad_wmnet.log.

Completed auto-reimage of hosts:

['dbprov1003.eqiad.wmnet']

Of which those FAILED:

['dbprov1003.eqiad.wmnet']

Script wmf-auto-reimage was launched by cmjohnson on cumin1001.eqiad.wmnet for hosts:

dbprov1003.eqiad.wmnet

The log can be found in /var/log/wmf-auto-reimage/202008232000_cmjohnson_5915_dbprov1003_eqiad_wmnet.log.

Completed auto-reimage of hosts:

['dbprov1003.eqiad.wmnet']

Of which those FAILED:

['dbprov1003.eqiad.wmnet']

Change 621964 had a related patch set uploaded (by Cmjohnson; owner: Cmjohnson):
[operations/puppet@production] Fixing dbprov1003 site.pp

https://gerrit.wikimedia.org/r/621964

Change 621964 merged by Cmjohnson:
[operations/puppet@production] Fixing dbprov1003 site.pp

https://gerrit.wikimedia.org/r/621964

Script wmf-auto-reimage was launched by cmjohnson on cumin1001.eqiad.wmnet for hosts:

dbprov1003.eqiad.wmnet

The log can be found in /var/log/wmf-auto-reimage/202008232022_cmjohnson_10221_dbprov1003_eqiad_wmnet.log.

Completed auto-reimage of hosts:

['dbprov1003.eqiad.wmnet']

and were ALL successful.

Cmjohnson added a subscriber: jcrespo.

@jcrespo This server is ready for you, I did update the site.pp role to insetup. I didn't want to install it right into the mariadb role. Resolving this task, the on-site portion has been completed.

@jcrespo This server is ready for you, I did update the site.pp role to insetup. I didn't want to install it right into the mariadb role. Resolving this task, the on-site portion has been completed.

Thanks, Chris, this unblocks us a lot! This is exactly what I would have wanted. Thank you again!