Page MenuHomePhabricator

Productionize es[2035-2040]
Closed, ResolvedPublic

Description

These will be the next two RW external store.
es4 and es5 will go RO once this ticket is done.

es6

  • es2035 master
  • es2036
  • es2037
  • Pending dbctl configuration
  • etcd.php changes
  • Set candidate masters

es7

  • es2038 master
  • es2039
  • es2040
  • Pending dbctl configuration
  • etcd.php changes
  • Set candidate masters

Event Timeline

There are a very large number of changes, so older changes are hidden. Show Older Changes
Marostegui changed the task status from Open to Stalled.Jan 19 2024, 1:39 PM
Marostegui triaged this task as Medium priority.
Marostegui moved this task from Triage to Blocked on the DBA board.
Marostegui changed the task status from Stalled to Open.Mar 5 2024, 4:55 PM
Marostegui moved this task from Blocked to Ready on the DBA board.
[16:55:08] marostegui@cumin1002:~$ sudo cumin es20[35-40].codfw.wmnet 'lvextend -L+1T /dev/mapper/tank-data ; xfs_growfs /srv/ ; df -hT /srv'
6 hosts will be targeted:
es[2035-2040].codfw.wmnet
OK to proceed on 6 hosts? Enter the number of affected hosts to confirm or "q" to quit: 6
===== NODE GROUP =====
(6) es[2035-2040].codfw.wmnet
----- OUTPUT of 'lvextend -L+1T /...v/ ; df -hT /srv' -----
  Size of logical volume tank/data changed from 9.09 TiB (2384188 extents) to 10.09 TiB (2646332 extents).
  Logical volume tank/data successfully resized.
meta-data=/dev/mapper/tank-data  isize=512    agcount=32, agsize=76294016 blks
         =                       sectsz=512   attr=2, projid32bit=1
         =                       crc=1        finobt=1, sparse=1, rmapbt=0
         =                       reflink=1    bigtime=1 inobtcount=1 nrext64=0
data     =                       bsize=4096   blocks=2441408512, imaxpct=5
         =                       sunit=64     swidth=256 blks
naming   =version 2              bsize=4096   ascii-ci=0, ftype=1
log      =internal log           bsize=4096   blocks=521728, version=2
         =                       sectsz=512   sunit=64 blks, lazy-count=1
realtime =none                   extsz=4096   blocks=0, rtextents=0
data blocks changed from 2441408512 to 2709843968
Filesystem            Type  Size  Used Avail Use% Mounted on
/dev/mapper/tank-data xfs    11T   73G   11T   1% /srv

Change #1025289 had a related patch set uploaded (by Marostegui; author: Marostegui):

[operations/puppet@production] mariadb: Productionize es6

https://gerrit.wikimedia.org/r/1025289

Change #1025289 merged by Marostegui:

[operations/puppet@production] mariadb: Productionize es6

https://gerrit.wikimedia.org/r/1025289

Change #1025291 had a related patch set uploaded (by Marostegui; author: Marostegui):

[operations/puppet@production] valid_section.pp: Add es6 and es7

https://gerrit.wikimedia.org/r/1025291

Change #1025291 merged by Marostegui:

[operations/puppet@production] valid_section.pp: Add es6 and es7

https://gerrit.wikimedia.org/r/1025291

es6 hosts added to zarcillo.
Also added es6 and es7 as valid sections

Change #1025370 had a related patch set uploaded (by Marostegui; author: Marostegui):

[operations/puppet@production] es2035: Make it es6 codfw master

https://gerrit.wikimedia.org/r/1025370

Change #1025370 merged by Marostegui:

[operations/puppet@production] es2035: Make it es6 codfw master

https://gerrit.wikimedia.org/r/1025370

es6 codfw is now showing up in orchestrator.

Change #1025603 had a related patch set uploaded (by Marostegui; author: Marostegui):

[operations/puppet@production] sections.yaml: Add es6 as valid dbctl section

https://gerrit.wikimedia.org/r/1025603

Change #1025603 merged by Marostegui:

[operations/puppet@production] sections.yaml: Add es6 as valid dbctl section

https://gerrit.wikimedia.org/r/1025603

Change #1025668 had a related patch set uploaded (by Marostegui; author: Marostegui):

[operations/puppet@production] instances.yaml: Add es6 hosts

https://gerrit.wikimedia.org/r/1025668

Change #1025668 merged by Marostegui:

[operations/puppet@production] instances.yaml: Add es6 hosts

https://gerrit.wikimedia.org/r/1025668

Change #1025670 had a related patch set uploaded (by Marostegui; author: Marostegui):

[operations/mediawiki-config@master] etcd.php: Add es6

https://gerrit.wikimedia.org/r/1025670

Change #1025672 had a related patch set uploaded (by Marostegui; author: Marostegui):

[operations/puppet@production] mariadb: Remove comments from es2035, es2036

https://gerrit.wikimedia.org/r/1025672

Change #1025672 merged by Marostegui:

[operations/puppet@production] mariadb: Remove comments from es2035, es2036

https://gerrit.wikimedia.org/r/1025672

Change #1025670 merged by jenkins-bot:

[operations/mediawiki-config@master] etcd.php: Add es6

https://gerrit.wikimedia.org/r/1025670

Mentioned in SAL (#wikimedia-operations) [2024-04-30T09:14:59Z] <marostegui@deploy1002> Started scap: Backport for [[gerrit:1025670|etcd.php: Add es6 (T355285 T355424)]]

Mentioned in SAL (#wikimedia-operations) [2024-04-30T09:17:48Z] <marostegui@deploy1002> marostegui: Backport for [[gerrit:1025670|etcd.php: Add es6 (T355285 T355424)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug)

Mentioned in SAL (#wikimedia-operations) [2024-04-30T09:30:01Z] <marostegui@deploy1002> Finished scap: Backport for [[gerrit:1025670|etcd.php: Add es6 (T355285 T355424)]] (duration: 15m 01s)

Change #1025699 had a related patch set uploaded (by Marostegui; author: Marostegui):

[operations/puppet@production] profile: Add es6 to the regex of valid sections

https://gerrit.wikimedia.org/r/1025699

Change #1025699 merged by Marostegui:

[operations/puppet@production] profile: Add es6 to the regex of valid sections

https://gerrit.wikimedia.org/r/1025699

Mentioned in SAL (#wikimedia-operations) [2024-04-30T09:51:19Z] <marostegui@cumin1002> dbctl commit (dc=all): 'Push es6 codfw config T355424', diff saved to https://phabricator.wikimedia.org/P61487 and previous config saved to /var/cache/conftool/dbconfig/20240430-095119-marostegui.json

Change #1025713 had a related patch set uploaded (by Marostegui; author: Marostegui):

[operations/puppet@production] es203[57]: Enable notifications

https://gerrit.wikimedia.org/r/1025713

Change #1025713 merged by Marostegui:

[operations/puppet@production] es203[57]: Enable notifications

https://gerrit.wikimedia.org/r/1025713

Mentioned in SAL (#wikimedia-operations) [2024-05-01T05:33:34Z] <marostegui@cumin1002> START - Cookbook sre.hosts.downtime for 7 days, 0:00:00 on 6 hosts with reason: Setting up T355285 T355424

Mentioned in SAL (#wikimedia-operations) [2024-05-01T05:33:51Z] <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 7 days, 0:00:00 on 6 hosts with reason: Setting up T355285 T355424

Mentioned in SAL (#wikimedia-operations) [2024-05-01T05:33:56Z] <marostegui@cumin1002> START - Cookbook sre.hosts.downtime for 7 days, 0:00:00 on es[1035,1039-1040].eqiad.wmnet with reason: Setting up T355285 T355424

Mentioned in SAL (#wikimedia-operations) [2024-05-01T05:34:11Z] <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 7 days, 0:00:00 on es[1035,1039-1040].eqiad.wmnet with reason: Setting up T355285 T355424

Change #1025899 had a related patch set uploaded (by Marostegui; author: Marostegui):

[operations/puppet@production] mariadb: Productionize codfw es7 servers

https://gerrit.wikimedia.org/r/1025899

Change #1025899 merged by Marostegui:

[operations/puppet@production] mariadb: Productionize codfw es7 servers

https://gerrit.wikimedia.org/r/1025899

Change #1025900 had a related patch set uploaded (by Marostegui; author: Marostegui):

[operations/puppet@production] es2038: Make it es7 master

https://gerrit.wikimedia.org/r/1025900

Change #1025900 merged by Marostegui:

[operations/puppet@production] es2038: Make it es7 master

https://gerrit.wikimedia.org/r/1025900

Change #1025902 had a related patch set uploaded (by Marostegui; author: Marostegui):

[operations/puppet@production] conftool: Add es7 as valid section

https://gerrit.wikimedia.org/r/1025902

Change #1025902 merged by Marostegui:

[operations/puppet@production] conftool: Add es7 as valid section

https://gerrit.wikimedia.org/r/1025902

dbctl section (not config per host) created for es7

Change #1026092 had a related patch set uploaded (by Marostegui; author: Marostegui):

[operations/puppet@production] instances.yaml: Add es7 codfw hosts

https://gerrit.wikimedia.org/r/1026092

Change #1026092 merged by Marostegui:

[operations/puppet@production] instances.yaml: Add es7 codfw hosts

https://gerrit.wikimedia.org/r/1026092

Mentioned in SAL (#wikimedia-operations) [2024-05-01T09:13:52Z] <marostegui@cumin1002> dbctl commit (dc=all): 'Push es7 codfw config T355424', diff saved to https://phabricator.wikimedia.org/P61561 and previous config saved to /var/cache/conftool/dbconfig/20240501-091352-marostegui.json

Pushed es7 dbctl config - tables aren't created

1--- codfw/externalLoads/es7 live
2+++ codfw/externalLoads/es7 generated
3@@ -1 +1,9 @@
4-{}
5+[
6+ {
7+ "es2038": 10
8+ },
9+ {
10+ "es2039": 100,
11+ "es2040": 100
12+ }
13+]
14--- codfw/hostsByName live
15+++ codfw/hostsByName generated
16@@ -100,5 +100,8 @@
17 "es2034": "10.192.48.164",
18 "es2035": "10.192.14.7",
19 "es2036": "10.192.32.22",
20- "es2037": "10.192.32.25"
21+ "es2037": "10.192.32.25",
22+ "es2038": "10.192.32.42",
23+ "es2039": "10.192.48.126",
24+ "es2040": "10.192.48.132"
25 }

Change #1026096 had a related patch set uploaded (by Marostegui; author: Marostegui):

[operations/mediawiki-config@master] etcd.php: Add es7

https://gerrit.wikimedia.org/r/1026096

Change #1026096 merged by jenkins-bot:

[operations/mediawiki-config@master] etcd.php: Add es7

https://gerrit.wikimedia.org/r/1026096

Mentioned in SAL (#wikimedia-operations) [2024-05-01T09:27:32Z] <marostegui@deploy1002> Started scap: Backport for [[gerrit:1026096|etcd.php: Add es7 (T355285 T355424)]]

Mentioned in SAL (#wikimedia-operations) [2024-05-01T09:30:18Z] <marostegui@deploy1002> marostegui: Backport for [[gerrit:1026096|etcd.php: Add es7 (T355285 T355424)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug)

Mentioned in SAL (#wikimedia-operations) [2024-05-01T09:42:26Z] <marostegui@deploy1002> Finished scap: Backport for [[gerrit:1026096|etcd.php: Add es7 (T355285 T355424)]] (duration: 14m 53s)

es2037 and es2039 set as candidate masters.

Change #1030621 had a related patch set uploaded (by Marostegui; author: Marostegui):

[operations/puppet@production] es2039: Remove "to be setup"

https://gerrit.wikimedia.org/r/1030621

Change #1030621 merged by Marostegui:

[operations/puppet@production] es2039: Remove "to be setup"

https://gerrit.wikimedia.org/r/1030621