Page MenuHomePhabricator

brouberol (Balthazar Rouberol)
Data Platform SRE

Today

  • No visible events.

Tomorrow

  • No visible events.

Sunday

  • No visible events.

User Details

User Since
Sep 5 2023, 11:23 AM (118 w, 3 d)
Availability
Available
IRC Nick
brouberol
LDAP User
Brouberol
MediaWiki User
BRouberol-WMF [ Global Accounts ]

Recent Activity

Today

brouberol added a comment to T411752: Enable OIDC login for Growthbook.

We've deployed OIDC login for growthbook.wikimedia.org, but it uses idp-test instead of idp at the moment, because cas.authn.oidc.id-token.include-id-token-claims is still set to false in idp.

Fri, Dec 12, 2:01 PM · OKR-Work, Data-Platform-SRE (2025.11.07 - 2025.11.28)

Yesterday

brouberol closed T412355: Validate the airflow-devenv create --branch argument as Resolved.
Thu, Dec 11, 11:54 AM · Data-Platform-SRE (2025.11.07 - 2025.11.28)
brouberol moved T412355: Validate the airflow-devenv create --branch argument from In Progress to Done on the Data-Platform-SRE (2025.11.07 - 2025.11.28) board.
Thu, Dec 11, 11:54 AM · Data-Platform-SRE (2025.11.07 - 2025.11.28)
brouberol added a comment to T412355: Validate the airflow-devenv create --branch argument.
brouberol@deploy2002:~$ airflow-devenv create --dags-folder ml --branch plop
Usage: airflow-devenv create [OPTIONS]
Try 'airflow-devenv create --help' for help.
Thu, Dec 11, 11:53 AM · Data-Platform-SRE (2025.11.07 - 2025.11.28)
brouberol changed the status of T412355: Validate the airflow-devenv create --branch argument from Open to In Progress.
Thu, Dec 11, 11:53 AM · Data-Platform-SRE (2025.11.07 - 2025.11.28)
brouberol moved T412355: Validate the airflow-devenv create --branch argument from Backlog - project to In Progress on the Data-Platform-SRE (2025.11.07 - 2025.11.28) board.
Thu, Dec 11, 11:53 AM · Data-Platform-SRE (2025.11.07 - 2025.11.28)
brouberol created T412355: Validate the airflow-devenv create --branch argument.
Thu, Dec 11, 10:23 AM · Data-Platform-SRE (2025.11.07 - 2025.11.28)

Wed, Dec 10

brouberol added a comment to T408819: When wikis cannot be exported due to SiteInfo, don't fail them.

Could we set a connection to a Spark Thrift Server?

I don't think we have a Spark Thrift Server running anywhere in the an-* hosts anywhere. We can possibly query Presto instead?

Wed, Dec 10, 2:49 PM · Data-Platform-SRE (2025.11.07 - 2025.11.28), Data-Engineering, DPE-Mediawiki-Content
brouberol placed T408819: When wikis cannot be exported due to SiteInfo, don't fail them up for grabs.
Wed, Dec 10, 10:42 AM · Data-Platform-SRE (2025.11.07 - 2025.11.28), Data-Engineering, DPE-Mediawiki-Content
brouberol added a comment to T408819: When wikis cannot be exported due to SiteInfo, don't fail them.

I've gotten so far as running the following query:

from airflow.decorators import task
from airflow.models.taskinstance import TaskInstance
from airflow.operators.empty import EmptyOperator
from airflow.providers.common.sql.operators.sql import SQLExecuteQueryOperator
from airflow.utils.task_group import TaskGroup
Wed, Dec 10, 10:41 AM · Data-Platform-SRE (2025.11.07 - 2025.11.28), Data-Engineering, DPE-Mediawiki-Content

Tue, Dec 9

brouberol added a comment to T406593: Link Growthbook to the Data Lake.

Hm, I wonder if the choice of wmf.edit_hourly was intentional

Very much not so. I picked a random table in the Superset SQL Lab view!

Tue, Dec 9, 1:47 PM · Data-Platform-SRE (2025.11.07 - 2025.11.28), OKR-Work, Test Kitchen
brouberol changed the status of T406593: Link Growthbook to the Data Lake, a subtask of T405749: [EPIC] Deploy GrowthBook - FY25/26 SDS 2.2, from Open to In Progress.
Tue, Dec 9, 1:47 PM · OKR-Work, Data-Platform-SRE, Test Kitchen, Epic
brouberol changed the status of T406593: Link Growthbook to the Data Lake from Open to In Progress.
Tue, Dec 9, 1:47 PM · Data-Platform-SRE (2025.11.07 - 2025.11.28), OKR-Work, Test Kitchen
brouberol added a comment to T406593: Link Growthbook to the Data Lake.

Screenshot 2025-12-09 at 14.46.22.png (1×2 px, 326 KB)
Indeed!

Tue, Dec 9, 1:46 PM · Data-Platform-SRE (2025.11.07 - 2025.11.28), OKR-Work, Test Kitchen
brouberol moved T411752: Enable OIDC login for Growthbook from In Progress to To Be Deployed on the Data-Platform-SRE (2025.11.07 - 2025.11.28) board.
Tue, Dec 9, 1:45 PM · OKR-Work, Data-Platform-SRE (2025.11.07 - 2025.11.28)
brouberol added a comment to T411752: Enable OIDC login for Growthbook.

https://growthbook-next.wikimedia.org is now OIDC-authenticated.

Tue, Dec 9, 9:38 AM · OKR-Work, Data-Platform-SRE (2025.11.07 - 2025.11.28)
brouberol updated subscribers of T411752: Enable OIDC login for Growthbook.

After having studied the CAS documentation and codebase, @SLyngshede-WMF notified me that he felt confident flipping that cas.authn.oidc.id-token.include-id-token-claims flag to true. We'll proceed on idp-test / growtbook-next first.

Tue, Dec 9, 9:32 AM · OKR-Work, Data-Platform-SRE (2025.11.07 - 2025.11.28)
brouberol added a comment to T411752: Enable OIDC login for Growthbook.

Reporting here after many (mostly failing) experimentations to integrate Growtbook with CAS. TLDR: the only way we have managed to make it work is by flipping cas.authn.oidc.id-token.include-id-token-claims to true globally.

Tue, Dec 9, 9:31 AM · OKR-Work, Data-Platform-SRE (2025.11.07 - 2025.11.28)
brouberol created P86466 (An Untitled Masterwork).
Tue, Dec 9, 8:13 AM

Fri, Dec 5

brouberol created P86418 (An Untitled Masterwork).
Fri, Dec 5, 8:53 AM

Thu, Dec 4

brouberol created P86411 (An Untitled Masterwork).
Thu, Dec 4, 7:11 PM
brouberol updated subscribers of T411752: Enable OIDC login for Growthbook.

One thing we need to figure out (cc @BTullis @mpopov) is what LDAP group(s) should we require the human to be part of to get entry to growthbook.

Thu, Dec 4, 9:07 AM · OKR-Work, Data-Platform-SRE (2025.11.07 - 2025.11.28)
brouberol added a subtask for T405749: [EPIC] Deploy GrowthBook - FY25/26 SDS 2.2: T411752: Enable OIDC login for Growthbook.
Thu, Dec 4, 8:55 AM · OKR-Work, Data-Platform-SRE, Test Kitchen, Epic
brouberol created T411752: Enable OIDC login for Growthbook.
Thu, Dec 4, 8:55 AM · OKR-Work, Data-Platform-SRE (2025.11.07 - 2025.11.28)

Tue, Dec 2

brouberol added a comment to T407185: Fix Kafka replicas skew.

Anytime!

Tue, Dec 2, 4:46 PM · Data-Platform-SRE (2025.11.07 - 2025.11.28), Essential-Work, Data-Engineering-Radar, serviceops, Observability-Logging, Data-Engineering
brouberol moved T407185: Fix Kafka replicas skew from To Be Deployed to Done on the Data-Platform-SRE (2025.11.07 - 2025.11.28) board.
Tue, Dec 2, 4:34 PM · Data-Platform-SRE (2025.11.07 - 2025.11.28), Essential-Work, Data-Engineering-Radar, serviceops, Observability-Logging, Data-Engineering
brouberol closed T407185: Fix Kafka replicas skew as Resolved.
Tue, Dec 2, 4:34 PM · Data-Platform-SRE (2025.11.07 - 2025.11.28), Essential-Work, Data-Engineering-Radar, serviceops, Observability-Logging, Data-Engineering
brouberol added a comment to T407185: Fix Kafka replicas skew.

I've switched leadership for the codfw.mediawiki.job.htmlCacheUpdate topic.

Tue, Dec 2, 4:33 PM · Data-Platform-SRE (2025.11.07 - 2025.11.28), Essential-Work, Data-Engineering-Radar, serviceops, Observability-Logging, Data-Engineering
brouberol added a comment to T407185: Fix Kafka replicas skew.

Screenshot 2025-12-02 at 17.12.34.png (1×2 px, 392 KB)
Storage has been rebalanced on kafka-main. I'll let traffic settle, and will see whether we need to adjust leadership for 2005.

Tue, Dec 2, 4:13 PM · Data-Platform-SRE (2025.11.07 - 2025.11.28), Essential-Work, Data-Engineering-Radar, serviceops, Observability-Logging, Data-Engineering
brouberol closed T411499: The command displayed to document how to destroy a devenv is wrong as Resolved.
Tue, Dec 2, 2:25 PM · Data-Platform-SRE (2025.11.07 - 2025.11.28)
brouberol moved T411499: The command displayed to document how to destroy a devenv is wrong from In Progress to Done on the Data-Platform-SRE (2025.11.07 - 2025.11.28) board.
Tue, Dec 2, 2:25 PM · Data-Platform-SRE (2025.11.07 - 2025.11.28)
brouberol added a comment to T411499: The command displayed to document how to destroy a devenv is wrong.

Screenshot 2025-12-02 at 15.24.42.png (772×2 px, 199 KB)
Fixed!

Tue, Dec 2, 2:24 PM · Data-Platform-SRE (2025.11.07 - 2025.11.28)
brouberol added a comment to T407185: Fix Kafka replicas skew.

One thing to note: we're seeing a lot of traffic already coming out of kafka-main2010, and the rebalancing will add new partitions to the broker (not necessary leaders, but still). If the resulting traffic / broker is too skewed and that kafka-main2010 is sending out too much data compared to other brokers., we can set it as a replica instead of a leader of the largest topic in the cluster:

brouberol@kafka-main2008:~/T407185$ kafka topics --describe --topic codfw.mediawiki.job.htmlCacheUpdate
kafka-topics --zookeeper conf2004.codfw.wmnet,conf2005.codfw.wmnet,conf2006.codfw.wmnet/kafka/main-codfw --describe --topic codfw.mediawiki.job.htmlCacheUpdate
Topic:codfw.mediawiki.job.htmlCacheUpdate	PartitionCount:1	ReplicationFactor:3	Configs:
	Topic: codfw.mediawiki.job.htmlCacheUpdate	Partition: 0	Leader: 2005	Replicas: 2005,2003,2001	Isr: 2001,2003,2005
Tue, Dec 2, 2:23 PM · Data-Platform-SRE (2025.11.07 - 2025.11.28), Essential-Work, Data-Engineering-Radar, serviceops, Observability-Logging, Data-Engineering
brouberol moved T411499: The command displayed to document how to destroy a devenv is wrong from Backlog - project to In Progress on the Data-Platform-SRE (2025.11.07 - 2025.11.28) board.
Tue, Dec 2, 2:08 PM · Data-Platform-SRE (2025.11.07 - 2025.11.28)
brouberol changed the status of T411499: The command displayed to document how to destroy a devenv is wrong from Open to In Progress.
Tue, Dec 2, 2:08 PM · Data-Platform-SRE (2025.11.07 - 2025.11.28)
brouberol updated the task description for T411499: The command displayed to document how to destroy a devenv is wrong.
Tue, Dec 2, 2:08 PM · Data-Platform-SRE (2025.11.07 - 2025.11.28)
brouberol created T411499: The command displayed to document how to destroy a devenv is wrong.
Tue, Dec 2, 2:07 PM · Data-Platform-SRE (2025.11.07 - 2025.11.28)
brouberol added a comment to T407185: Fix Kafka replicas skew.
brouberol@kafka-main2008:~/T407185$ kafka reassign-partitions kafka reassign-partitions --reassignment-json-file ./rebalancing.json --execute --throttle 30000000
kafka-reassign-partitions --zookeeper conf2004.codfw.wmnet,conf2005.codfw.wmnet,conf2006.codfw.wmnet/kafka/main-codfw kafka reassign-partitions --reassignment-json-file ./rebalancing.json --execute --throttle 30000000
Current partition replica assignment
Tue, Dec 2, 1:05 PM · Data-Platform-SRE (2025.11.07 - 2025.11.28), Essential-Work, Data-Engineering-Radar, serviceops, Observability-Logging, Data-Engineering

Mon, Dec 1

brouberol moved T411174: Ensure airflow-devenv expose picks a stable port per devenv from In Progress to Done on the Data-Platform-SRE (2025.11.07 - 2025.11.28) board.
Mon, Dec 1, 11:58 AM · Data-Platform-SRE (2025.11.07 - 2025.11.28)
brouberol closed T411174: Ensure airflow-devenv expose picks a stable port per devenv as Resolved.
Mon, Dec 1, 11:58 AM · Data-Platform-SRE (2025.11.07 - 2025.11.28)
brouberol added a comment to T407185: Fix Kafka replicas skew.

I've generated a rebalancing plan on kafka-main2008 using

brouberol@kafka-main2008:~/T407185$ topicmappr rebalance --topics '.*' --brokers -2 --out-file rebalancing --storage-threshold-gb 1000
Mon, Dec 1, 8:17 AM · Data-Platform-SRE (2025.11.07 - 2025.11.28), Essential-Work, Data-Engineering-Radar, serviceops, Observability-Logging, Data-Engineering
brouberol updated Other Assignee for T407808: Rename xLab to test kitchen, added: brouberol; removed; Sfaci.
Mon, Dec 1, 8:11 AM · Test Kitchen UI, Data-Platform-SRE (2025.11.07 - 2025.11.28), Patch-For-Review, Test Kitchen (Experiment Platform Sprint 16), Essential-Work
brouberol reassigned T407808: Rename xLab to test kitchen from brouberol to Sfaci.
Mon, Dec 1, 8:11 AM · Test Kitchen UI, Data-Platform-SRE (2025.11.07 - 2025.11.28), Patch-For-Review, Test Kitchen (Experiment Platform Sprint 16), Essential-Work
brouberol changed the status of T407805: Rename mpic.wikimedia.org, a subtask of T407803: [Epic] Rename Experimentation Lab/Experiment Platform to... Test Kitchen, from Open to In Progress.
Mon, Dec 1, 8:10 AM · Test Kitchen (Experiment Platform Sprint 16), Essential-Work, Epic
brouberol changed the status of T407805: Rename mpic.wikimedia.org from Open to In Progress.
Mon, Dec 1, 8:10 AM · Data-Platform-SRE (2025.11.07 - 2025.11.28), Patch-For-Review, Test Kitchen (Experiment Platform Sprint 16), Essential-Work
brouberol moved T407805: Rename mpic.wikimedia.org from Backlog - project to In Progress on the Data-Platform-SRE (2025.11.07 - 2025.11.28) board.
Mon, Dec 1, 8:10 AM · Data-Platform-SRE (2025.11.07 - 2025.11.28), Patch-For-Review, Test Kitchen (Experiment Platform Sprint 16), Essential-Work
brouberol closed T411172: Ensure that the airflow-devenv create --dags-folder argument is valid as Resolved.
Mon, Dec 1, 8:04 AM · Data-Platform-SRE (2025.11.07 - 2025.11.28)
brouberol moved T411172: Ensure that the airflow-devenv create --dags-folder argument is valid from Needs Review to Done on the Data-Platform-SRE (2025.11.07 - 2025.11.28) board.
Mon, Dec 1, 8:04 AM · Data-Platform-SRE (2025.11.07 - 2025.11.28)

Fri, Nov 28

brouberol changed the status of T407808: Rename xLab to test kitchen from Open to In Progress.
Fri, Nov 28, 9:00 AM · Test Kitchen UI, Data-Platform-SRE (2025.11.07 - 2025.11.28), Patch-For-Review, Test Kitchen (Experiment Platform Sprint 16), Essential-Work
brouberol moved T407808: Rename xLab to test kitchen from Backlog - project to In Progress on the Data-Platform-SRE (2025.11.07 - 2025.11.28) board.
Fri, Nov 28, 9:00 AM · Test Kitchen UI, Data-Platform-SRE (2025.11.07 - 2025.11.28), Patch-For-Review, Test Kitchen (Experiment Platform Sprint 16), Essential-Work
brouberol changed the status of T407808: Rename xLab to test kitchen, a subtask of T407803: [Epic] Rename Experimentation Lab/Experiment Platform to... Test Kitchen, from Open to In Progress.
Fri, Nov 28, 9:00 AM · Test Kitchen (Experiment Platform Sprint 16), Essential-Work, Epic

Thu, Nov 27

brouberol updated Other Assignee for T407805: Rename mpic.wikimedia.org, added: brouberol.
Thu, Nov 27, 9:47 PM · Data-Platform-SRE (2025.11.07 - 2025.11.28), Patch-For-Review, Test Kitchen (Experiment Platform Sprint 16), Essential-Work
brouberol changed the status of T411174: Ensure airflow-devenv expose picks a stable port per devenv from Open to In Progress.
Thu, Nov 27, 12:45 PM · Data-Platform-SRE (2025.11.07 - 2025.11.28)
brouberol moved T411174: Ensure airflow-devenv expose picks a stable port per devenv from Backlog - project to In Progress on the Data-Platform-SRE (2025.11.07 - 2025.11.28) board.
Thu, Nov 27, 12:45 PM · Data-Platform-SRE (2025.11.07 - 2025.11.28)
brouberol claimed T411174: Ensure airflow-devenv expose picks a stable port per devenv.
Thu, Nov 27, 12:45 PM · Data-Platform-SRE (2025.11.07 - 2025.11.28)
brouberol moved T411172: Ensure that the airflow-devenv create --dags-folder argument is valid from In Progress to Needs Review on the Data-Platform-SRE (2025.11.07 - 2025.11.28) board.
Thu, Nov 27, 12:45 PM · Data-Platform-SRE (2025.11.07 - 2025.11.28)
brouberol moved T411172: Ensure that the airflow-devenv create --dags-folder argument is valid from Needs Review to In Progress on the Data-Platform-SRE (2025.11.07 - 2025.11.28) board.
Thu, Nov 27, 12:45 PM · Data-Platform-SRE (2025.11.07 - 2025.11.28)
brouberol created T411174: Ensure airflow-devenv expose picks a stable port per devenv.
Thu, Nov 27, 12:44 PM · Data-Platform-SRE (2025.11.07 - 2025.11.28)
brouberol claimed T411172: Ensure that the airflow-devenv create --dags-folder argument is valid.
Thu, Nov 27, 12:41 PM · Data-Platform-SRE (2025.11.07 - 2025.11.28)
brouberol moved T411172: Ensure that the airflow-devenv create --dags-folder argument is valid from Backlog - project to Needs Review on the Data-Platform-SRE (2025.11.07 - 2025.11.28) board.
Thu, Nov 27, 12:41 PM · Data-Platform-SRE (2025.11.07 - 2025.11.28)
brouberol changed the status of T411172: Ensure that the airflow-devenv create --dags-folder argument is valid from Open to In Progress.
Thu, Nov 27, 12:41 PM · Data-Platform-SRE (2025.11.07 - 2025.11.28)
brouberol created T411172: Ensure that the airflow-devenv create --dags-folder argument is valid.
Thu, Nov 27, 12:33 PM · Data-Platform-SRE (2025.11.07 - 2025.11.28)
brouberol changed the status of T408819: When wikis cannot be exported due to SiteInfo, don't fail them, a subtask of T366752: Dumps 2.0 Phase III: Production level dumps (SDS 1.2), from Open to In Progress.
Thu, Nov 27, 8:14 AM · Data-Engineering-Roadmap, DPE-Mediawiki-Content, Epic
brouberol changed the status of T408819: When wikis cannot be exported due to SiteInfo, don't fail them from Open to In Progress.
Thu, Nov 27, 8:14 AM · Data-Platform-SRE (2025.11.07 - 2025.11.28), Data-Engineering, DPE-Mediawiki-Content
brouberol moved T408819: When wikis cannot be exported due to SiteInfo, don't fail them from Backlog - project to In Progress on the Data-Platform-SRE (2025.11.07 - 2025.11.28) board.
Thu, Nov 27, 8:14 AM · Data-Platform-SRE (2025.11.07 - 2025.11.28), Data-Engineering, DPE-Mediawiki-Content
brouberol claimed T408819: When wikis cannot be exported due to SiteInfo, don't fail them.
Thu, Nov 27, 8:14 AM · Data-Platform-SRE (2025.11.07 - 2025.11.28), Data-Engineering, DPE-Mediawiki-Content
brouberol added a comment to T408819: When wikis cannot be exported due to SiteInfo, don't fail them.

I'm going to experiment with something akin to

Thu, Nov 27, 8:13 AM · Data-Platform-SRE (2025.11.07 - 2025.11.28), Data-Engineering, DPE-Mediawiki-Content

Wed, Nov 26

brouberol added a comment to T411106: Activate the Growthbook licenses.

Screenshot 2025-11-26 at 17.11.01.png (176×2 px, 73 KB)

Wed, Nov 26, 4:12 PM · Data-Platform-SRE (2025.11.07 - 2025.11.28)
brouberol closed T411106: Activate the Growthbook licenses, a subtask of T405749: [EPIC] Deploy GrowthBook - FY25/26 SDS 2.2, as Resolved.
Wed, Nov 26, 4:11 PM · OKR-Work, Data-Platform-SRE, Test Kitchen, Epic
brouberol closed T411106: Activate the Growthbook licenses as Resolved.
Wed, Nov 26, 4:11 PM · Data-Platform-SRE (2025.11.07 - 2025.11.28)
brouberol moved T411106: Activate the Growthbook licenses from Needs Review to Done on the Data-Platform-SRE (2025.11.07 - 2025.11.28) board.
Wed, Nov 26, 4:11 PM · Data-Platform-SRE (2025.11.07 - 2025.11.28)
brouberol changed the status of T411106: Activate the Growthbook licenses, a subtask of T405749: [EPIC] Deploy GrowthBook - FY25/26 SDS 2.2, from Open to In Progress.
Wed, Nov 26, 3:53 PM · OKR-Work, Data-Platform-SRE, Test Kitchen, Epic
brouberol changed the status of T411106: Activate the Growthbook licenses from Open to In Progress.
Wed, Nov 26, 3:53 PM · Data-Platform-SRE (2025.11.07 - 2025.11.28)
brouberol moved T411106: Activate the Growthbook licenses from Backlog - project to Needs Review on the Data-Platform-SRE (2025.11.07 - 2025.11.28) board.
Wed, Nov 26, 3:53 PM · Data-Platform-SRE (2025.11.07 - 2025.11.28)
brouberol added a subtask for T405749: [EPIC] Deploy GrowthBook - FY25/26 SDS 2.2: T411106: Activate the Growthbook licenses.
Wed, Nov 26, 3:52 PM · OKR-Work, Data-Platform-SRE, Test Kitchen, Epic
brouberol created T411106: Activate the Growthbook licenses.
Wed, Nov 26, 3:52 PM · Data-Platform-SRE (2025.11.07 - 2025.11.28)
brouberol added a subtask for T405749: [EPIC] Deploy GrowthBook - FY25/26 SDS 2.2: T410999: Setup a pre-production Growtbook instance.
Wed, Nov 26, 3:50 PM · OKR-Work, Data-Platform-SRE, Test Kitchen, Epic
brouberol added a parent task for T410999: Setup a pre-production Growtbook instance: T405749: [EPIC] Deploy GrowthBook - FY25/26 SDS 2.2.
Wed, Nov 26, 3:50 PM · Data-Platform-SRE (2025.11.07 - 2025.11.28)
brouberol removed a subtask for T406579: Deploy FerretDB for GrowthBook: T410999: Setup a pre-production Growtbook instance.
Wed, Nov 26, 3:50 PM · Data-Platform-SRE (2025.10.17 - 2025.11.07), OKR-Work, Test Kitchen
brouberol removed a parent task for T410999: Setup a pre-production Growtbook instance: T406579: Deploy FerretDB for GrowthBook.
Wed, Nov 26, 3:50 PM · Data-Platform-SRE (2025.11.07 - 2025.11.28)
brouberol added a subtask for T406579: Deploy FerretDB for GrowthBook: T410999: Setup a pre-production Growtbook instance.
Wed, Nov 26, 3:50 PM · Data-Platform-SRE (2025.10.17 - 2025.11.07), OKR-Work, Test Kitchen
brouberol added a parent task for T410999: Setup a pre-production Growtbook instance: T406579: Deploy FerretDB for GrowthBook.
Wed, Nov 26, 3:50 PM · Data-Platform-SRE (2025.11.07 - 2025.11.28)
brouberol added a comment to T410075: Discovery of Cassandra cluster nodes.

Oh, you're right!

Wed, Nov 26, 2:58 PM · Data-Persistence, SRE, Cassandra
brouberol moved T410999: Setup a pre-production Growtbook instance from To Be Deployed to Done on the Data-Platform-SRE (2025.11.07 - 2025.11.28) board.
Wed, Nov 26, 1:37 PM · Data-Platform-SRE (2025.11.07 - 2025.11.28)
brouberol closed T410999: Setup a pre-production Growtbook instance as Resolved.
Wed, Nov 26, 1:37 PM · Data-Platform-SRE (2025.11.07 - 2025.11.28)
brouberol added a comment to T410999: Setup a pre-production Growtbook instance.

https://growthbook-next.wikimedia.org is now up and running. It is connected to the analytics-test-presto cluster.

Screenshot 2025-11-26 at 14.36.49.png (2×3 px, 371 KB)

Wed, Nov 26, 1:37 PM · Data-Platform-SRE (2025.11.07 - 2025.11.28)
brouberol moved T410999: Setup a pre-production Growtbook instance from Needs Review to To Be Deployed on the Data-Platform-SRE (2025.11.07 - 2025.11.28) board.
Wed, Nov 26, 8:18 AM · Data-Platform-SRE (2025.11.07 - 2025.11.28)
brouberol added a comment to T410075: Discovery of Cassandra cluster nodes.

Naïve q, piggybacking on @Eevans 's response: what about a DNS domain resolving to the node IPs? If we have a recent enough version, we can let the client perform the DNS resolution and trial of the different resolved node IPs, as per https://issues.apache.org/jira/browse/CASSANDRA-14361

Wed, Nov 26, 8:03 AM · Data-Persistence, SRE, Cassandra

Tue, Nov 25

brouberol moved T405943: eqiad row C/D Data Platform host migrations from Blocked/Waiting to Done on the Data-Platform-SRE (2025.11.07 - 2025.11.28) board.
Tue, Nov 25, 4:48 PM · Data-Platform-SRE (2025.11.07 - 2025.11.28), Essential-Work, SRE, DC-Ops, ops-eqiad
brouberol placed T213561: Discovery for Kafka cluster brokers up for grabs.
Tue, Nov 25, 4:36 PM · Data-Engineering-Radar, Data-Platform-SRE, Data-Engineering, SRE, Services (watching), Event-Platform, Analytics
brouberol added a comment to T410999: Setup a pre-production Growtbook instance.

Serets have been provisioned. I'll just need to update the DB password once the DB itself is provisioned.

Tue, Nov 25, 11:11 AM · Data-Platform-SRE (2025.11.07 - 2025.11.28)
brouberol added a comment to T410999: Setup a pre-production Growtbook instance.
brouberol@krb1002:~$ sudo kadmin.local addprinc -randkey HTTP/growthbook-next.discovery.wmnet@WIKIMEDIA
brouberol@krb1002:~$ sudo kadmin.local addprinc -randkey growthbook/growthbook-next.discovery.wmnet@WIKIMEDIA
brouberol@krb1002:~$ sudo kadmin.local ktadd -norandkey -k growthbook-next-backend.keytab HTTP/growthbook-next.discovery.wmnet@WIKIMEDIA growthbook/growthbook-next.discovery.wmnet@WIKIMEDIA
Entry for principal HTTP/growthbook-next.discovery.wmnet@WIKIMEDIA with kvno 1, encryption type aes256-cts-hmac-sha1-96 added to keytab WRFILE:growthbook-next-backend.keytab.
Entry for principal growthbook/growthbook-next.discovery.wmnet@WIKIMEDIA with kvno 1, encryption type aes256-cts-hmac-sha1-96 added to keytab WRFILE:growthbook-next-backend.keytab.
Tue, Nov 25, 11:08 AM · Data-Platform-SRE (2025.11.07 - 2025.11.28)
brouberol added a comment to T410999: Setup a pre-production Growtbook instance.
brouberol@stat1008:~$ s3cmd --access_key=$access_key --secret_key=$secret_key --host=rgw.eqiad.dpe.anycast.wmnet --region=dpe --host-bucket=no mb s3://postgresql-growthbook-next.dse-k8s-eqiad
Bucket 's3://postgresql-growthbook-next.dse-k8s-eqiad/' created
Tue, Nov 25, 11:05 AM · Data-Platform-SRE (2025.11.07 - 2025.11.28)
brouberol added a comment to T410999: Setup a pre-production Growtbook instance.
brouberol@cephosd1001:~$ sudo radosgw-admin user create --uid=postgresql-growthbook-next --display-name="postgresql-growthbook-next"
{
    "user_id": "postgresql-growthbook-next",
    "display_name": "postgresql-growthbook-next",
    "email": "",
    "suspended": 0,
    "max_buckets": 1000,
    "subusers": [],
    "keys": [
        {
            "user": "postgresql-growthbook-next",
            "access_key": "[REDACTED]",
            "secret_key": "[REDACTED]"
        }
    ],
    "swift_keys": [],
    "caps": [],
    "op_mask": "read, write, delete",
    "default_placement": "",
    "default_storage_class": "",
    "placement_tags": [],
    "bucket_quota": {
        "enabled": false,
        "check_on_raw": false,
        "max_size": -1,
        "max_size_kb": 0,
        "max_objects": -1
    },
    "user_quota": {
        "enabled": false,
        "check_on_raw": false,
        "max_size": -1,
        "max_size_kb": 0,
        "max_objects": -1
    },
    "temp_url_keys": [],
    "type": "rgw",
    "mfa_ids": []
}
Tue, Nov 25, 11:03 AM · Data-Platform-SRE (2025.11.07 - 2025.11.28)
brouberol moved T410999: Setup a pre-production Growtbook instance from In Progress to Needs Review on the Data-Platform-SRE (2025.11.07 - 2025.11.28) board.
Tue, Nov 25, 10:59 AM · Data-Platform-SRE (2025.11.07 - 2025.11.28)
brouberol changed the status of T410999: Setup a pre-production Growtbook instance from Open to In Progress.
Tue, Nov 25, 10:30 AM · Data-Platform-SRE (2025.11.07 - 2025.11.28)
brouberol moved T410999: Setup a pre-production Growtbook instance from Backlog - project to In Progress on the Data-Platform-SRE (2025.11.07 - 2025.11.28) board.
Tue, Nov 25, 10:30 AM · Data-Platform-SRE (2025.11.07 - 2025.11.28)
brouberol created T410999: Setup a pre-production Growtbook instance.
Tue, Nov 25, 10:30 AM · Data-Platform-SRE (2025.11.07 - 2025.11.28)
brouberol closed T410966: Bump hadoop container maximum memory size as Resolved.
Tue, Nov 25, 10:00 AM · Data-Platform-SRE (2025.11.07 - 2025.11.28), Data-Engineering
brouberol moved T410966: Bump hadoop container maximum memory size from To Be Deployed to Done on the Data-Platform-SRE (2025.11.07 - 2025.11.28) board.
Tue, Nov 25, 10:00 AM · Data-Platform-SRE (2025.11.07 - 2025.11.28), Data-Engineering