Page MenuHomePhabricator

Productionize pc8
Closed, ResolvedPublic

Description

  • pc1018
  • pc2018

Event Timeline

Marostegui triaged this task as Medium priority.May 14 2025, 5:48 AM
Marostegui moved this task from Triage to Blocked on the DBA board.

pc1018 has been installed - waiting for pc2018

[12:53:19] marostegui@cumin1002:~$ sudo cumin 'pc1018*' 'lvextend -L+1000G /dev/mapper/tank-data ; xfs_growfs /srv ; df -hT /srv ; pvs'
1 hosts will be targeted:
pc1018.eqiad.wmnet
OK to proceed on 1 hosts? Enter the number of affected hosts to confirm or "q" to quit: 1
----- OUTPUT of 'lvextend -L+1000...f -hT /srv ; pvs' -----
  Size of logical volume tank/data changed from <7.56 TiB (1981022 extents) to 8.53 TiB (2237022 extents).
  Logical volume tank/data successfully resized.
meta-data=/dev/mapper/tank-data  isize=512    agcount=32, agsize=63392704 blks
         =                       sectsz=512   attr=2, projid32bit=1
         =                       crc=1        finobt=1, sparse=1, rmapbt=0
         =                       reflink=1    bigtime=1 inobtcount=1 nrext64=0
data     =                       bsize=4096   blocks=2028566528, imaxpct=5
         =                       sunit=64     swidth=256 blks
naming   =version 2              bsize=4096   ascii-ci=0, ftype=1
log      =internal log           bsize=4096   blocks=521728, version=2
         =                       sectsz=512   sunit=64 blks, lazy-count=1
realtime =none                   extsz=4096   blocks=0, rtextents=0
data blocks changed from 2028566528 to 2290710528
Filesystem            Type  Size  Used Avail Use% Mounted on
/dev/mapper/tank-data xfs   8.6T   61G  8.5T   1% /srv
  PV         VG   Fmt  Attr PSize  PFree
  /dev/sda3  tank lvm2 a--  <8.69t 156.30g
================
PASS |█████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 100% (1/1) [00:00<00:00,  1.32hosts/s]
FAIL |                                                                                                                                                                                                                                         |   0% (0/1) [00:00<?, ?hosts/s]
100.0% (1/1) success ratio (>= 100.0% threshold) for command: 'lvextend -L+1000...f -hT /srv ; pvs'.
100.0% (1/1) success ratio (>= 100.0% threshold) of nodes successfully executed all commands.

Mentioned in SAL (#wikimedia-operations) [2025-05-15T05:07:24Z] <marostegui@cumin1002> dbctl commit (dc=all): 'Depool pc7 T394260', diff saved to https://phabricator.wikimedia.org/P76162 and previous config saved to /var/cache/conftool/dbconfig/20250515-050724-marostegui.json

Change #1146175 had a related patch set uploaded (by Marostegui; author: Marostegui):

[operations/puppet@production] dbconfig.schema: Add pc8

https://gerrit.wikimedia.org/r/1146175

Change #1146175 merged by Marostegui:

[operations/puppet@production] dbconfig.schema: Add pc8

https://gerrit.wikimedia.org/r/1146175

Change #1146176 had a related patch set uploaded (by Marostegui; author: Marostegui):

[operations/puppet@production] valid_section.pp: Add pc8

https://gerrit.wikimedia.org/r/1146176

Change #1146176 merged by Marostegui:

[operations/puppet@production] valid_section.pp: Add pc8

https://gerrit.wikimedia.org/r/1146176

Change #1146184 had a related patch set uploaded (by Marostegui; author: Marostegui):

[operations/puppet@production] pc1018: Productionize

https://gerrit.wikimedia.org/r/1146184

Change #1146184 merged by Marostegui:

[operations/puppet@production] pc1018: Productionize

https://gerrit.wikimedia.org/r/1146184

Mentioned in SAL (#wikimedia-operations) [2025-05-15T08:52:57Z] <marostegui@cumin1002> dbctl commit (dc=all): 'Repool pc7 T394260', diff saved to https://phabricator.wikimedia.org/P76208 and previous config saved to /var/cache/conftool/dbconfig/20250515-085256-marostegui.json

pc1018 has been recloned and emptied. Host added to zarcillo, pc8 added as a section and pc1018 as eqiad's master.

pc8 now showing up in orchestrator: https://orchestrator.wikimedia.org/web/cluster/alias/pc8
pending pc2018 which isn't yet racked T393110

Change #1146871 had a related patch set uploaded (by Marostegui; author: Marostegui):

[operations/puppet@production] sections.yaml: Add pc8

https://gerrit.wikimedia.org/r/1146871

Change #1146871 merged by Marostegui:

[operations/puppet@production] sections.yaml: Add pc8

https://gerrit.wikimedia.org/r/1146871

Change #1148786 had a related patch set uploaded (by Marostegui; author: Marostegui):

[operations/puppet@production] installserver: Do not reimage pc1018

https://gerrit.wikimedia.org/r/1148786

Change #1148786 merged by Marostegui:

[operations/puppet@production] installserver: Do not reimage pc1018

https://gerrit.wikimedia.org/r/1148786

Change #1148998 had a related patch set uploaded (by Marostegui; author: Marostegui):

[operations/puppet@production] mariadb: Productionize pc2018

https://gerrit.wikimedia.org/r/1148998

Change #1148998 merged by Marostegui:

[operations/puppet@production] mariadb: Productionize pc2018

https://gerrit.wikimedia.org/r/1148998

Change #1148999 had a related patch set uploaded (by Marostegui; author: Marostegui):

[operations/puppet@production] instances.yaml: Add pc1018,pc2018

https://gerrit.wikimedia.org/r/1148999

Change #1148999 merged by Marostegui:

[operations/puppet@production] instances.yaml: Add pc1018,pc2018

https://gerrit.wikimedia.org/r/1148999

Change #1149000 had a related patch set uploaded (by Marostegui; author: Marostegui):

[operations/puppet@production] instance.schema: Add pc8

https://gerrit.wikimedia.org/r/1149000

Change #1149000 merged by Marostegui:

[operations/puppet@production] instance.schema: Add pc8

https://gerrit.wikimedia.org/r/1149000

Mentioned in SAL (#wikimedia-operations) [2025-05-22T05:26:49Z] <marostegui@cumin1002> dbctl commit (dc=all): 'Add pc1018 and pc2018 to dbctl depooled T394260', diff saved to https://phabricator.wikimedia.org/P76372 and previous config saved to /var/cache/conftool/dbconfig/20250522-052649-marostegui.json

Both hosts added to dbctl.
Waiting for confirmation whether I can enable pc8 in dbctl.

Mentioned in SAL (#wikimedia-operations) [2025-05-22T09:50:17Z] <marostegui@cumin1002> dbctl commit (dc=all): 'Add pc8 T394260', diff saved to https://phabricator.wikimedia.org/P76390 and previous config saved to /var/cache/conftool/dbconfig/20250522-095017-marostegui.json

Mentioned in SAL (#wikimedia-operations) [2025-05-22T09:50:35Z] <marostegui> dbmaint codfw eqiad Pool pc8 new section T394260

Change #1149337 had a related patch set uploaded (by Marostegui; author: Marostegui):

[operations/puppet@production] pc1018,pc2018: Enable notifications

https://gerrit.wikimedia.org/r/1149337

Change #1149337 merged by Marostegui:

[operations/puppet@production] pc1018,pc2018: Enable notifications

https://gerrit.wikimedia.org/r/1149337

The section is receiving traffic now. Waiting to make sure grafana works well before resolving this.

Change #1149344 had a related patch set uploaded (by Ladsgroup; author: Amir Sarabadani):

[operations/puppet@production] parsercachepurging: Use for loop

https://gerrit.wikimedia.org/r/1149344

Change #1149344 merged by Ladsgroup:

[operations/puppet@production] parsercachepurging: Use for loop

https://gerrit.wikimedia.org/r/1149344

Change #1149353 had a related patch set uploaded (by Ladsgroup; author: Amir Sarabadani):

[operations/puppet@production] parsercachepurging: Add pc8

https://gerrit.wikimedia.org/r/1149353

Change #1149353 merged by Ladsgroup:

[operations/puppet@production] parsercachepurging: Add pc8

https://gerrit.wikimedia.org/r/1149353

Change #1149539 had a related patch set uploaded (by Marostegui; author: Marostegui):

[operations/dns@master] wmnet: Add pc8-master CNAME

https://gerrit.wikimedia.org/r/1149539

Change #1149539 merged by Marostegui:

[operations/dns@master] wmnet: Add pc8-master CNAME

https://gerrit.wikimedia.org/r/1149539