Page MenuHomePhabricator

deploy three new snapshots as replacements for snapshot1005,6,7 and set 1005,6,7 as spare
Closed, ResolvedPublic

Description

The new hosts are racked with OS and first puppet run, so they are ready for deployment (see T272509). These are snapshot10011,12,13 After their deployment, snapshot1005,6,7 can be decommissioned. There are two other new hosts but they will be covered by another task.

Event Timeline

ArielGlenn triaged this task as Medium priority.Apr 28 2021, 6:10 AM
ArielGlenn created this task.

Change 683151 had a related patch set uploaded (by ArielGlenn; author: ArielGlenn):

[operations/puppet@production] bring up snapshot1001,12,13 as dumps testbed hosts

https://gerrit.wikimedia.org/r/683151

Change 683151 merged by ArielGlenn:

[operations/puppet@production] bring up snapshot1001,12,13 as dumps testbed hosts

https://gerrit.wikimedia.org/r/683151

Change 683262 had a related patch set uploaded (by ArielGlenn; author: ArielGlenn):

[labs/private@master] Add fake mcrouter secrets for snapshot1011,12,13

https://gerrit.wikimedia.org/r/683262

Change 683262 merged by ArielGlenn:

[labs/private@master] Add fake mcrouter secrets for snapshot1011,12,13

https://gerrit.wikimedia.org/r/683262

Change 683263 had a related patch set uploaded (by ArielGlenn; author: ArielGlenn):

[operations/dumps/scap@master] Add snapshot1011,12,13 to scap targets for the dumps repo

https://gerrit.wikimedia.org/r/683263

Change 683263 merged by ArielGlenn:

[operations/dumps/scap@master] Add snapshot1011,12,13 to scap targets for the dumps repo

https://gerrit.wikimedia.org/r/683263

I had to manually edit /srv/deployment/dumps/dumps-cache/.config on all three hosts to change the name of the upstream host from deploy1001 to deploy1002, still wrong in the repo on deploy1002. See T197470

I also needed to add mcrouter secrets to all three, see https://wikitech.wikimedia.org/wiki/Memcached_for_MediaWiki/mcrouter#Generate_certs_for_a_new_host (both labs/private repo and the private repo on the prod puppetmaster).

Change 683293 had a related patch set uploaded (by ArielGlenn; author: ArielGlenn):

[operations/puppet@production] make snapshot1011 the new wikidata dumper and snapshot1012 the new enwiki dumper

https://gerrit.wikimedia.org/r/683293

Change 683293 merged by ArielGlenn:

[operations/puppet@production] make snapshot1011 the new wikidata dumper and snapshot1012 the new enwiki dumper

https://gerrit.wikimedia.org/r/683293

After merging the above patch, I needed to remove the cron jobs from the dumspgen crontab manually on snapshot1006,7 since switching the role to testbed does not and can't really do that. I also tested angwikibooks and skwikibooks full dump runs with the test config file that writes output to a test directory. The first wiki had previous runs so we tested prefetch with that; the second one did not so we tested db fetches of text content with that. Everything looks ready to go.

The new hosts will be in operation for the dump run beginning on May 1.

Next week I'll convert snapshot1005,6,7 to spare.

Change 685636 had a related patch set uploaded (by ArielGlenn; author: ArielGlenn):

[operations/puppet@production] Remove snapshot1005,6,7 from mediawiki scap targets

https://gerrit.wikimedia.org/r/685636

Change 685636 merged by ArielGlenn:

[operations/puppet@production] Remove snapshot1005,6,7 from mediawiki scap targets

https://gerrit.wikimedia.org/r/685636

Change 685638 had a related patch set uploaded (by ArielGlenn; author: ArielGlenn):

[operations/dumps/scap@master] remove snapshot1005,6,7 from dump scap targets

https://gerrit.wikimedia.org/r/685638

Change 685638 merged by ArielGlenn:

[operations/dumps/scap@master] remove snapshot1005,6,7 from dump scap targets

https://gerrit.wikimedia.org/r/685638

The new hosts are busily running dumps and the old ones have been marked as spare. Closing!