Page MenuHomePhabricator

Ensure select dse-k8s-hosted microservices can run active/active
Closed, ResolvedPublic

Description

While we added a second dse-k8s cluster in CODFW, we never completed the necessary steps for our hosted services to run active/active.

There's not much use to having a second cluster if we can't do active/active for services, so I'm creating this ticket to:

  • Complete all steps necessary for select dse-k8s services to run active/active*
  • Verify operation

*Not all services can or should run active/active. For the purposes of this ticket, we're scoping to opensearch-ipoid and opensearch-test.

Event Timeline

bking triaged this task as Medium priority.
bking renamed this task from Deploy active/active for select dse-k8s microservices to Ensure select dse-k8s-hosted microservices can run active/active.Feb 17 2026, 8:19 PM
bking updated the task description. (Show Details)

This morning, we attempted to enable active-active via this maintenance plan and it had to be rolled back.

After some discussion with Traffic, we do not think the previous approach is going to work. One of the previous approaches we discussed is creating a new domain and service, distinct from disc-k8s-ingress-dse, that we can use for active-active services.

I'll begin the discussion with my team/Traffic in Slack and update the ticket as needed.

Change #1248605 had a related patch set uploaded (by Bking; author: Bking):

[operations/puppet@production] WIP: Add new active-active discovery service for dse-k8s

https://gerrit.wikimedia.org/r/1248605

Change #1248625 had a related patch set uploaded (by Bking; author: Bking):

[operations/dns@master] WIP: Add new active-active discovery records for dse-k8s

https://gerrit.wikimedia.org/r/1248625

Change #1250063 had a related patch set uploaded (by Bking; author: Bking):

[operations/dns@master] Add new active/active discovery records for dse-k8s opensearch test ns

https://gerrit.wikimedia.org/r/1250063

Change #1250068 had a related patch set uploaded (by Bking; author: Bking):

[operations/dns@master] Add new active/active discovery records for dse-k8s opensearch prod ns

https://gerrit.wikimedia.org/r/1250068

Change #1248605 merged by Bking:

[operations/puppet@production] Add new active-active discovery service for dse-k8s

https://gerrit.wikimedia.org/r/1248605

Change #1248625 merged by Ssingh:

[operations/dns@master] Add new active-active discovery records for dse-k8s

https://gerrit.wikimedia.org/r/1248625

Change #1250063 merged by Bking:

[operations/dns@master] Add new active/active discovery records for dse-k8s opensearch test ns

https://gerrit.wikimedia.org/r/1250063

Change #1250068 merged by Bking:

[operations/dns@master] Add new active/active discovery records for dse-k8s opensearch prod ns

https://gerrit.wikimedia.org/r/1250068

bking closed this task as Resolved.EditedMon, Mar 23, 3:39 PM

After merging the above changes, I can confirm that opensearch-ipoid.discovery.wmnet is now correctly balanced between EQIAD and CODFW, instead of just going to EQIAD.

This change has improved iPoid's P50 response times as well. Closing...