Page MenuHomePhabricator

Ensure that all appserver-related roles can be cleanly applied on bootstrap
Open, In Progress, LowPublic

Description

Currently, we have a few roles that don't apply cleanly on the first puppet run.

We should set up a pontoon project to check that it's the case and as a general sandbox for OS upgrades/etc for serviceops stuff.

I would say we need to support:

  • role::configcluster (etcd is a prerequisite for the appservers)
  • role::mediawiki::memcached
  • role::mediawiki::appserver
  • role::mediawiki::jobrunner

I expect the route to be quite bumpy with the appserver roles, so I would suggest we set up memcached first, to get acquainted with pontoon.

Event Timeline

Joe triaged this task as Medium priority.Sep 27 2022, 9:16 AM
Clement_Goubert changed the task status from Open to In Progress.Sep 27 2022, 9:47 AM
Clement_Goubert moved this task from Incoming 🐫 to Doing 😎 on the serviceops board.

Starting tests in sre-sandbox while a specific WMCS project gets created for this.

Thank you for kick starting this! Agreed on the bumpy road (having set up configcluster role myself for o11y), please reach out when in doubt and/or hitting roadblocks.

I'm starting "easy" with memcached for now. Once that's working OOTB I'll move on to configcluster. Thanks for the offer to help, much appreciated.

Change 836133 had a related patch set uploaded (by Clément Goubert; author: Clément Goubert):

[operations/puppet@production] pontoon: move sops-appservers stack to appservers

https://gerrit.wikimedia.org/r/836133

Change 836133 abandoned by Clément Goubert:

[operations/puppet@production] pontoon: move sops-appservers stack to appservers

Reason:

Keeping as-is to avoid namespace confusion

https://gerrit.wikimedia.org/r/836133

memcached role works fine on pontoon.

Onto configcluster. After battling with certificates for the etcd cluster, joe dug up https://gerrit.wikimedia.org/r/c/operations/puppet/+/668701 that should probably help.

Clement_Goubert lowered the priority of this task from Medium to Low.Oct 19 2022, 3:52 PM
Clement_Goubert moved this task from Doing 😎 to this.quarter 🍕 on the serviceops board.