Page MenuHomePhabricator

bking (Brian King)
Senior Site Reliability Engineer, Search Platform Team

Projects (8)

Today

  • No visible events.

Tomorrow

  • No visible events.

Tuesday

  • No visible events.

User Details

User Since
Dec 15 2021, 9:19 PM (233 w, 3 d)
Availability
Available
LDAP User
Unknown
MediaWiki User
BKing (WMF) [ Global Accounts ]

Recent Activity

Fri, May 29

bking updated subscribers of T324335: Remove logstash from the CirrusSearch servers.

Per this Slack conversation, it sounds like @colewhite is evaluating Datadog's vector . Cole, I'm not sure how far along you are in your evaluation, but this seems like a potential use case for Vector. I'm happy to help test if you are in fact working on vector stuff. CC @brouberol as he has some history with Datadog products ;)

Fri, May 29, 5:22 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26), Observability-Logging, Discovery-Search (Current work)

Thu, May 28

bking changed the status of T427348: S3 credentials for the WDQS K8S deployment from In Progress to Open.
Thu, May 28, 10:05 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26), Wikidata Platform Team
bking moved T427517: Issue 6 month certificates for OpenSearch-on-K8S from Backlog - project to In Progress on the Data-Platform-SRE (2026-04-24 - 2026-05-15) board.
Thu, May 28, 9:57 PM · Patch-For-Review, Data-Platform-SRE (2026-06-05 - 2026-06-26)
bking changed the status of T427517: Issue 6 month certificates for OpenSearch-on-K8S, a subtask of T421757: ☂️ Migrate production OpenSearch clusters from 1.x-2.x ☂️, from Open to In Progress.
Thu, May 28, 9:57 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26), Discovery-Search (2026.05.04 - 2026.05.29)
bking changed the status of T427517: Issue 6 month certificates for OpenSearch-on-K8S from Open to In Progress.
Thu, May 28, 9:57 PM · Patch-For-Review, Data-Platform-SRE (2026-06-05 - 2026-06-26)
bking placed T427348: S3 credentials for the WDQS K8S deployment up for grabs.
Thu, May 28, 9:55 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26), Wikidata Platform Team
bking moved T427348: S3 credentials for the WDQS K8S deployment from Backlog - project to In Progress on the Data-Platform-SRE (2026-04-24 - 2026-05-15) board.

I'll take a closer look tomorrow, but from what I can tell, the secret is being used correctly. I'm comparing the Airflow helm chart's handling of secrets against the secret creation in your linked CR , and they look identical.

Thu, May 28, 9:54 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26), Wikidata Platform Team
bking changed the status of T427319: request: s3 access for wdqs::alternatives from Open to In Progress.
Thu, May 28, 7:59 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26), Wikidata Platform Team
bking edited projects for T427348: S3 credentials for the WDQS K8S deployment, added: Data-Platform-SRE (2026-04-24 - 2026-05-15); removed Data-Platform-SRE.
Thu, May 28, 7:40 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26), Wikidata Platform Team
bking changed the status of T427348: S3 credentials for the WDQS K8S deployment from Open to In Progress.
Thu, May 28, 7:40 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26), Wikidata Platform Team

Wed, May 27

bking updated the task description for T421757: ☂️ Migrate production OpenSearch clusters from 1.x-2.x ☂️.
Wed, May 27, 10:04 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26), Discovery-Search (2026.05.04 - 2026.05.29)
bking updated subscribers of T427306: Migrate Relforge clusters from OpenSearch 1.x->2x.

Working on relforge, I've noticed the cluster won't form unless cluster.initial_cluster_manager_nodes and`discovery.seed_hosts` are set. This matches up with the OpenSearch docs , but it doesn't match up with our current "discovery" (cluster formation) settings in production .

Wed, May 27, 8:55 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26), Discovery-Search (2026.05.04 - 2026.05.29)
bking updated the task description for T427306: Migrate Relforge clusters from OpenSearch 1.x->2x.
Wed, May 27, 8:42 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26), Discovery-Search (2026.05.04 - 2026.05.29)
bking updated the task description for T421757: ☂️ Migrate production OpenSearch clusters from 1.x-2.x ☂️.
Wed, May 27, 1:25 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26), Discovery-Search (2026.05.04 - 2026.05.29)

Tue, May 26

bking closed T424860: Consider managing OpenSearch cluster dynamic settings with terraform as Declined.

Per today's discussion at DPE SRE standup, there are concerns about Terraform introducing too much complexity. We also discussed the possibility of managing these settings with a Kubernetes operator in the future.

Tue, May 26, 7:24 PM · Data-Platform-SRE (2026-04-24 - 2026-05-15), Discovery-Search
bking removed a subtask for T422860: Migrate Cloudelastic to OpenSearch 2.x: T425422: Some discovery alerts are rejected due to overly long headers.
Tue, May 26, 7:15 PM · Data-Platform-SRE (2026-04-24 - 2026-05-15), Patch-For-Review, Discovery-Search (2026.04.06 - 2026.05.01)
bking removed a parent task for T425422: Some discovery alerts are rejected due to overly long headers: T422860: Migrate Cloudelastic to OpenSearch 2.x.
Tue, May 26, 7:15 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26), Discovery-Search (2026.05.04 - 2026.05.29)
bking changed the status of T427306: Migrate Relforge clusters from OpenSearch 1.x->2x from Open to In Progress.
Tue, May 26, 6:30 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26), Discovery-Search (2026.05.04 - 2026.05.29)
bking changed the status of T427306: Migrate Relforge clusters from OpenSearch 1.x->2x, a subtask of T421757: ☂️ Migrate production OpenSearch clusters from 1.x-2.x ☂️, from Open to In Progress.
Tue, May 26, 6:30 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26), Discovery-Search (2026.05.04 - 2026.05.29)
bking updated the task description for T427306: Migrate Relforge clusters from OpenSearch 1.x->2x.
Tue, May 26, 5:33 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26), Discovery-Search (2026.05.04 - 2026.05.29)
bking created T427306: Migrate Relforge clusters from OpenSearch 1.x->2x.
Tue, May 26, 5:23 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26), Discovery-Search (2026.05.04 - 2026.05.29)
bking updated the task description for T421757: ☂️ Migrate production OpenSearch clusters from 1.x-2.x ☂️.
Tue, May 26, 5:13 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26), Discovery-Search (2026.05.04 - 2026.05.29)
bking moved T423993: Upgrade old indices in the CirrusSearch opensearch clusters from Backlog - project to In Progress on the Data-Platform-SRE (2026-04-24 - 2026-05-15) board.
Tue, May 26, 2:03 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26), Discovery-Search (2026.05.04 - 2026.05.29), Language and Product Localization, MediaWiki-extensions-Translate, Toolhub
bking changed the status of T423993: Upgrade old indices in the CirrusSearch opensearch clusters from Open to In Progress.
Tue, May 26, 2:02 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26), Discovery-Search (2026.05.04 - 2026.05.29), Language and Product Localization, MediaWiki-extensions-Translate, Toolhub
bking changed the status of T423993: Upgrade old indices in the CirrusSearch opensearch clusters, a subtask of T421757: ☂️ Migrate production OpenSearch clusters from 1.x-2.x ☂️, from Open to In Progress.
Tue, May 26, 2:02 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26), Discovery-Search (2026.05.04 - 2026.05.29)
bking renamed T425585: Write lightweight OCI-image-based Puppet plans for beta cluster from Write lightweight OCI-image-based Puppet plans for beta cluster/relforge to Write lightweight OCI-image-based Puppet plans for beta cluster.
Tue, May 26, 2:01 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26), Beta-Cluster-Infrastructure, Discovery-Search (2026.03.03 - 2026.04.03)
bking updated the task description for T421757: ☂️ Migrate production OpenSearch clusters from 1.x-2.x ☂️.
Tue, May 26, 1:13 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26), Discovery-Search (2026.05.04 - 2026.05.29)
bking closed T424035: Drop the CirrusSearch metastore as Resolved.
Tue, May 26, 1:13 PM · Discovery-Search (2026.05.04 - 2026.05.29), CirrusSearch
bking updated the task description for T421757: ☂️ Migrate production OpenSearch clusters from 1.x-2.x ☂️.
Tue, May 26, 1:12 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26), Discovery-Search (2026.05.04 - 2026.05.29)

Fri, May 22

bking updated subscribers of T427094: Gitlab: extra spaces in gitlab servers' known_hosts entries are confounding paramiko.
Fri, May 22, 8:14 PM · GitLab, collaboration-services, Release-Engineering-Team
bking updated the task description for T427094: Gitlab: extra spaces in gitlab servers' known_hosts entries are confounding paramiko.
Fri, May 22, 8:01 PM · GitLab, collaboration-services, Release-Engineering-Team
bking created T427094: Gitlab: extra spaces in gitlab servers' known_hosts entries are confounding paramiko.
Fri, May 22, 8:00 PM · GitLab, collaboration-services, Release-Engineering-Team

Thu, May 21

bking moved T426101: Create detailed migration plan for OpenSearch 1.x-2x from Backlog - project to In Progress on the Data-Platform-SRE (2026-04-24 - 2026-05-15) board.
Thu, May 21, 4:35 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26), Discovery-Search (2026.05.04 - 2026.05.29)
bking placed T423620: OpenSearch on K8s: update docker images and think about monitoring options up for grabs.
Thu, May 21, 4:03 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26)
bking changed the status of T423620: OpenSearch on K8s: update docker images and think about monitoring options from In Progress to Open.

I haven't had time to progress this ticket for a few weeks, moving back to backlog...

Thu, May 21, 4:03 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26)

Wed, May 20

bking added a comment to T426862: Ensure cookbooks work with OpenSearch 2.x.

We ran into this bug when running sre.elasticsearch.rolling-operation today. There is a pretty easy workaround posted in the Github issue, but we'll need to document that and possibly update our cookbooks to work around it.

Wed, May 20, 5:05 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26), Discovery-Search (2026.05.04 - 2026.05.29)
bking updated the task description for T426862: Ensure cookbooks work with OpenSearch 2.x.
Wed, May 20, 5:02 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26), Discovery-Search (2026.05.04 - 2026.05.29)
bking added a comment to T423312: Q4:rack/setup/install dse-k8s-wdqs200[1-4] (formerly wdqs20[28-31]).

NP, thanks for your help on this! Feel free to ping me in IRC (inflatador) if you need anything else.

Wed, May 20, 3:05 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26), Wikidata Platform Team, SRE, ops-codfw, DC-Ops
bking changed the status of T426101: Create detailed migration plan for OpenSearch 1.x-2x, a subtask of T421757: ☂️ Migrate production OpenSearch clusters from 1.x-2.x ☂️, from Open to In Progress.
Wed, May 20, 2:59 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26), Discovery-Search (2026.05.04 - 2026.05.29)
bking changed the status of T426101: Create detailed migration plan for OpenSearch 1.x-2x from Open to In Progress.
Wed, May 20, 2:59 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26), Discovery-Search (2026.05.04 - 2026.05.29)
bking updated subscribers of T426101: Create detailed migration plan for OpenSearch 1.x-2x.
Wed, May 20, 2:59 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26), Discovery-Search (2026.05.04 - 2026.05.29)
bking added a comment to T426101: Create detailed migration plan for OpenSearch 1.x-2x.

I've created a tentative plan here, feel free to look over and offer suggestions

Wed, May 20, 2:58 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26), Discovery-Search (2026.05.04 - 2026.05.29)
bking added a comment to T423312: Q4:rack/setup/install dse-k8s-wdqs200[1-4] (formerly wdqs20[28-31]).

Hello @Jhancock.wm , per above I have requested one in each row, avoid row D if possible since it already has 4 wdqs-main hosts.

Wed, May 20, 2:41 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26), Wikidata Platform Team, SRE, ops-codfw, DC-Ops
bking changed the status of T426862: Ensure cookbooks work with OpenSearch 2.x from Open to In Progress.
Wed, May 20, 1:54 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26), Discovery-Search (2026.05.04 - 2026.05.29)
bking changed the status of T426862: Ensure cookbooks work with OpenSearch 2.x, a subtask of T421757: ☂️ Migrate production OpenSearch clusters from 1.x-2.x ☂️, from Open to In Progress.
Wed, May 20, 1:54 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26), Discovery-Search (2026.05.04 - 2026.05.29)
bking created T426862: Ensure cookbooks work with OpenSearch 2.x.
Wed, May 20, 1:51 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26), Discovery-Search (2026.05.04 - 2026.05.29)
bking updated the task description for T421757: ☂️ Migrate production OpenSearch clusters from 1.x-2.x ☂️.
Wed, May 20, 1:42 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26), Discovery-Search (2026.05.04 - 2026.05.29)

Tue, May 19

bking moved T423314: Q4:rack/setup/install dse-k8s-wdqs100[1-3] (formerly wdqs103[6-8]) from Backlog - project to In Progress on the Data-Platform-SRE (2026-04-24 - 2026-05-15) board.

Moving to in progress per IRC conversation with @Jclark-ctr . These are our first servers with the hardware profile Config F single CPU, but with smaller NVME drives (2x 2TB). Traffic's cp (CDN) servers use almost the same profile, so we should be able to borrow their partman config. I'm checking now and should have a patch up within the next day or so.

Tue, May 19, 9:56 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26), Patch-For-Review, Wikidata Platform Team, ops-eqiad, SRE, DC-Ops
bking updated the task description for T421757: ☂️ Migrate production OpenSearch clusters from 1.x-2.x ☂️.
Tue, May 19, 2:38 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26), Discovery-Search (2026.05.04 - 2026.05.29)

Mon, May 18

bking changed the status of T414345: Consider donating our time to enhance the official OpenSearch Prometheus exporter from Resolved to Declined.
Mon, May 18, 3:21 PM · Data-Platform-SRE (2026-04-24 - 2026-05-15), Discovery-Search (2026.05.04 - 2026.05.29), Essential-Work
bking edited projects for T414345: Consider donating our time to enhance the official OpenSearch Prometheus exporter, added: Data-Platform-SRE (2026-04-24 - 2026-05-15); removed Data-Platform-SRE.
Mon, May 18, 3:21 PM · Data-Platform-SRE (2026-04-24 - 2026-05-15), Discovery-Search (2026.05.04 - 2026.05.29), Essential-Work
bking closed T414345: Consider donating our time to enhance the official OpenSearch Prometheus exporter as Resolved.

Per today's Search Platform standup, we don't have the bandwidth to contribute at the moment. As such, I'll close this out. We can always revisit as time permits.

Mon, May 18, 3:20 PM · Data-Platform-SRE (2026-04-24 - 2026-05-15), Discovery-Search (2026.05.04 - 2026.05.29), Essential-Work

Sat, May 16

bking closed T426228: Create depool hiera keys for cirrussearch hosts, a subtask of T426199: codfw: rack A2 maintenance, as Resolved.
Sat, May 16, 1:56 PM · ServiceOps-Upgrades-Hardware, ServiceOps new, Infrastructure-Foundations, netops
bking closed T426228: Create depool hiera keys for cirrussearch hosts as Resolved.

This is complete. Closing...

Sat, May 16, 1:56 PM · Infrastructure-Foundations, Data-Platform-SRE (2026-04-24 - 2026-05-15), netops

Fri, May 15

bking renamed T425585: Write lightweight OCI-image-based Puppet plans for beta cluster from Write lightweight quadlet-based Puppet plans for beta cluster/relforge to Write lightweight OCI-image-based Puppet plans for beta cluster/relforge.
Fri, May 15, 2:18 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26), Beta-Cluster-Infrastructure, Discovery-Search (2026.03.03 - 2026.04.03)
bking changed the status of T425585: Write lightweight OCI-image-based Puppet plans for beta cluster, a subtask of T421763: Migrate beta cluster to OpenSearch 2.x, from Open to In Progress.
Fri, May 15, 2:18 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26), Beta-Cluster-Infrastructure, Discovery-Search (2026.03.03 - 2026.04.03)

Thu, May 14

bking changed the status of T426228: Create depool hiera keys for cirrussearch hosts, a subtask of T426199: codfw: rack A2 maintenance, from Open to In Progress.
Thu, May 14, 8:34 PM · ServiceOps-Upgrades-Hardware, ServiceOps new, Infrastructure-Foundations, netops
bking changed the status of T426228: Create depool hiera keys for cirrussearch hosts from Open to In Progress.
Thu, May 14, 8:34 PM · Infrastructure-Foundations, Data-Platform-SRE (2026-04-24 - 2026-05-15), netops
bking changed the status of T422378: Move OpenSearch-iPoid behind TLS/Update to OpenSearch 3.x, a subtask of T419289: Ensure OpenSearch on k8s clusters can safely use envoy TLS termination, from Open to In Progress.
Thu, May 14, 7:06 PM · Data-Platform-SRE (2026-03-27 - 2026-04-17), Patch-For-Review
bking changed the status of T422378: Move OpenSearch-iPoid behind TLS/Update to OpenSearch 3.x from Open to In Progress.
Thu, May 14, 7:06 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26), Patch-For-Review
bking updated Other Assignee for T393966: Update WDQS SLOs to reflect graph split changes, removed: bking.
Thu, May 14, 6:46 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26), Wikidata Platform Team, Wikidata, Wikidata-Query-Service, User-Elukey, Essential-Work, SRE-SLO, observability
bking added a comment to T426073: Migrate toolhub indices from production OpenSearch to OpenSearch on k8s.

@bd808 I believe we are forced by the OpenSearch operator to use basic and/or mutual TLS auth. I'll check again and have an answer for you by this time next week.

Thu, May 14, 6:31 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26), Discovery-Search (2026.05.04 - 2026.05.29), User-bd808, Toolhub
bking updated the task description for T426101: Create detailed migration plan for OpenSearch 1.x-2x.
Thu, May 14, 6:25 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26), Discovery-Search (2026.05.04 - 2026.05.29)
bking updated Other Assignee for T418175: Create SLO for the opensearch-ipoid cluster that runs on our OpenSearch on K8s platform, removed: bking.
Thu, May 14, 4:39 PM · Data-Platform-SRE (2026-04-24 - 2026-05-15)

Wed, May 13

bking added a comment to T426199: codfw: rack A2 maintenance.

@ayounsi Sorry for the trouble, confirming that the depool and repool commands are enough for cirrussearch hosts.

Wed, May 13, 4:58 PM · ServiceOps-Upgrades-Hardware, ServiceOps new, Infrastructure-Foundations, netops
bking created T426228: Create depool hiera keys for cirrussearch hosts.
Wed, May 13, 4:58 PM · Infrastructure-Foundations, Data-Platform-SRE (2026-04-24 - 2026-05-15), netops
bking moved T426196: Investigate an-druid1007 unresponsiveness from Backlog - project to In Progress on the Data-Platform-SRE (2026-04-24 - 2026-05-15) board.
Wed, May 13, 3:28 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26)
bking updated the task description for T426196: Investigate an-druid1007 unresponsiveness.
Wed, May 13, 2:16 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26)
bking changed the status of T426196: Investigate an-druid1007 unresponsiveness from Open to In Progress.
Wed, May 13, 2:05 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26)
bking created T426196: Investigate an-druid1007 unresponsiveness.
Wed, May 13, 1:26 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26)

Tue, May 12

bking added a comment to T426073: Migrate toolhub indices from production OpenSearch to OpenSearch on k8s.

^^ Basically what he said 🙃

  • Data Platform sets up new containerized opensearch cluster for Toolhub use Done, see this page for how to access
Tue, May 12, 9:54 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26), Discovery-Search (2026.05.04 - 2026.05.29), User-bd808, Toolhub
bking moved T426073: Migrate toolhub indices from production OpenSearch to OpenSearch on k8s from Backlog - project to In Progress on the Data-Platform-SRE (2026-04-24 - 2026-05-15) board.
Tue, May 12, 6:57 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26), Discovery-Search (2026.05.04 - 2026.05.29), User-bd808, Toolhub
bking changed the status of T426073: Migrate toolhub indices from production OpenSearch to OpenSearch on k8s from Open to In Progress.
Tue, May 12, 6:56 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26), Discovery-Search (2026.05.04 - 2026.05.29), User-bd808, Toolhub
bking changed the status of T426073: Migrate toolhub indices from production OpenSearch to OpenSearch on k8s, a subtask of T424248: ☂️Migrate non-cirrus indices from production OpenSearch to OpenSearch on k8s ☂️, from Open to In Progress.
Tue, May 12, 6:56 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26), Discovery-Search (2026.05.04 - 2026.05.29)
bking moved T417700: Possible to improve latency for OpenSearch iPoid? from Backlog - project to Done on the Data-Platform-SRE (2026-04-24 - 2026-05-15) board.
Tue, May 12, 5:30 PM · Data-Platform-SRE (2026-04-24 - 2026-05-15)
bking edited projects for T417700: Possible to improve latency for OpenSearch iPoid?, added: Data-Platform-SRE (2026-04-24 - 2026-05-15); removed Data-Platform-SRE.
Tue, May 12, 5:29 PM · Data-Platform-SRE (2026-04-24 - 2026-05-15)
bking closed T417700: Possible to improve latency for OpenSearch iPoid? as Resolved.

Closing in favor of T421293, which has more concrete steps to improve performance (move service into mesh). Feel free to reopen this task if that's not acceptable.

Tue, May 12, 5:29 PM · Data-Platform-SRE (2026-04-24 - 2026-05-15)
bking added a comment to T422378: Move OpenSearch-iPoid behind TLS/Update to OpenSearch 3.x.

Tentative maintenance plan:

  • Merge changes to deployment-charts that include the OpenSearch 3.x image
  • Failover opensearch-ipoid to EQIAD only using DNS discovery (DPE SRE)
  • Upgrade to OpenSearch 3.x in CODFW (DPE SRE)
  • Confirm application is working on OpenSearch 3.x (PSI)
  • Repeat in EQIAD
Tue, May 12, 5:27 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26), Patch-For-Review
bking updated the task description for T422378: Move OpenSearch-iPoid behind TLS/Update to OpenSearch 3.x.
Tue, May 12, 5:19 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26), Patch-For-Review
bking moved T422378: Move OpenSearch-iPoid behind TLS/Update to OpenSearch 3.x from Backlog - project to In Progress on the Data-Platform-SRE (2026-04-24 - 2026-05-15) board.
Tue, May 12, 5:18 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26), Patch-For-Review
bking updated the task description for T422378: Move OpenSearch-iPoid behind TLS/Update to OpenSearch 3.x.
Tue, May 12, 5:18 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26), Patch-For-Review
bking created T426101: Create detailed migration plan for OpenSearch 1.x-2x.
Tue, May 12, 4:57 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26), Discovery-Search (2026.05.04 - 2026.05.29)
bking updated the task description for T426073: Migrate toolhub indices from production OpenSearch to OpenSearch on k8s.
Tue, May 12, 3:01 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26), Discovery-Search (2026.05.04 - 2026.05.29), User-bd808, Toolhub
bking created T426073: Migrate toolhub indices from production OpenSearch to OpenSearch on k8s.
Tue, May 12, 2:23 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26), Discovery-Search (2026.05.04 - 2026.05.29), User-bd808, Toolhub
bking updated the task description for T426067: Create WDQS meltdown runbook.
Tue, May 12, 2:17 PM · Wikidata Platform Team, Data-Platform-SRE
bking placed T426067: Create WDQS meltdown runbook up for grabs.
Tue, May 12, 2:16 PM · Wikidata Platform Team, Data-Platform-SRE
bking added a project to T426067: Create WDQS meltdown runbook: Wikidata Platform Team.
Tue, May 12, 2:11 PM · Wikidata Platform Team, Data-Platform-SRE
bking created T426067: Create WDQS meltdown runbook.
Tue, May 12, 2:07 PM · Wikidata Platform Team, Data-Platform-SRE

Mon, May 11

bking changed the status of T425973: Requesting Ceph S3 credentials for Wikidata Plaftorm from Open to In Progress.
Mon, May 11, 7:08 PM · Data-Platform-SRE (2026-04-24 - 2026-05-15), Wikidata Platform Team
bking closed T425681: Ensure`vm.max_map_count` count is set to an adequately high number in our OpenSearch environments as Resolved.
Mon, May 11, 3:48 PM · Data-Platform-SRE (2026-04-24 - 2026-05-15)
bking added a comment to T425681: Ensure`vm.max_map_count` count is set to an adequately high number in our OpenSearch environments.

I've merged everything in our environment, and submitted some changes upstream. I don't think we need to wait for the upstream changes to be reviewed, so I'll close this one out for now.

Mon, May 11, 3:48 PM · Data-Platform-SRE (2026-04-24 - 2026-05-15)
bking updated the task description for T425681: Ensure`vm.max_map_count` count is set to an adequately high number in our OpenSearch environments.
Mon, May 11, 3:45 PM · Data-Platform-SRE (2026-04-24 - 2026-05-15)

May 7 2026

bking renamed T425585: Write lightweight OCI-image-based Puppet plans for beta cluster from 1. Write lightweight quadlet-based Puppet plans for beta cluster/relforge to Write lightweight quadlet-based Puppet plans for beta cluster/relforge.
May 7 2026, 10:04 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26), Beta-Cluster-Infrastructure, Discovery-Search (2026.03.03 - 2026.04.03)
bking reopened T425585: Write lightweight OCI-image-based Puppet plans for beta cluster as "Open".

Thanks @bd808 , I'll take a look for sure.

May 7 2026, 10:04 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26), Beta-Cluster-Infrastructure, Discovery-Search (2026.03.03 - 2026.04.03)
bking reopened T425585: Write lightweight OCI-image-based Puppet plans for beta cluster, a subtask of T421763: Migrate beta cluster to OpenSearch 2.x, as Open.
May 7 2026, 10:04 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26), Beta-Cluster-Infrastructure, Discovery-Search (2026.03.03 - 2026.04.03)
bking updated Other Assignee for T425585: Write lightweight OCI-image-based Puppet plans for beta cluster, removed: Underscorre.
May 7 2026, 10:02 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26), Beta-Cluster-Infrastructure, Discovery-Search (2026.03.03 - 2026.04.03)
bking updated the task description for T425681: Ensure`vm.max_map_count` count is set to an adequately high number in our OpenSearch environments.
May 7 2026, 2:45 PM · Data-Platform-SRE (2026-04-24 - 2026-05-15)
bking updated the task description for T425681: Ensure`vm.max_map_count` count is set to an adequately high number in our OpenSearch environments.
May 7 2026, 2:42 PM · Data-Platform-SRE (2026-04-24 - 2026-05-15)
bking moved T425681: Ensure`vm.max_map_count` count is set to an adequately high number in our OpenSearch environments from Backlog - project to In Progress on the Data-Platform-SRE (2026-04-24 - 2026-05-15) board.
May 7 2026, 2:36 PM · Data-Platform-SRE (2026-04-24 - 2026-05-15)
bking created T425681: Ensure`vm.max_map_count` count is set to an adequately high number in our OpenSearch environments.
May 7 2026, 2:36 PM · Data-Platform-SRE (2026-04-24 - 2026-05-15)