Page MenuHomePhabricator

apt: improve apt failover orchestration
Open, MediumPublic

Description

Some time ago apt was migrated to use active/passive DNS discovery records. this allows us to easily move the apt.wikimedia.org domain between sites. however there are still some aspects of what we consider the primary server to be controlled by puppet (see this change. We should investigate if we can leverage confd to to also control theses aspects.

A simple solution could be to writ a custom fact that interacts with confd to pull this information and feed it into puppet.

This task should also consider the work to move the apt serveres behind the caches (T317178)

We should also update the dc switch over wiki with any changes

Event Timeline

jbond triaged this task as Medium priority.Mar 1 2023, 10:42 AM
jbond created this task.
Restricted Application added a subscriber: Aklapper. · View Herald Transcript

Change 893468 had a related patch set uploaded (by Jbond; author: jbond):

[operations/puppet@production] profile::confd: add a confd profile

https://gerrit.wikimedia.org/r/893468

Change 893471 had a related patch set uploaded (by Jbond; author: jbond):

[operations/puppet@production] confd::file: drop relative prefix

https://gerrit.wikimedia.org/r/893471

We should find a standard setup for those use cases, I can see Netbox having exactly the same issue/requirement (some puppet-driver resources based on primary status).

Aklapper renamed this task from apt: improve apt failover ochastration to apt: improve apt failover orchestration.Mar 1 2023, 2:01 PM

Change 893496 had a related patch set uploaded (by Jbond; author: jbond):

[operations/puppet@production] P:confd: Add support for discovery facts

https://gerrit.wikimedia.org/r/893496

Change 893522 had a related patch set uploaded (by Jbond; author: John Bond):

[operations/puppet@production] P:aptrepo: use new wmflib::discovery::pooled_sites

https://gerrit.wikimedia.org/r/893522

We should find a standard setup for those use cases, I can see Netbox having exactly the same issue/requirement (some puppet-driver resources based on primary status).

fyi this and the preceding patch is what i have as a proposal, however joe wants to chat about the use of conftool be fore we move forward

Change 893468 abandoned by Jbond:

[operations/puppet@production] profile::confd: add a confd profile

Reason:

this got done recently in a different change

https://gerrit.wikimedia.org/r/893468