Page MenuHomePhabricator

JMonton-WMF (Javier Montón)
User

Projects (1)

Today

  • No visible events.

Tomorrow

  • No visible events.

Sunday

  • No visible events.

User Details

User Since
Oct 1 2025, 2:42 PM (19 w, 2 d)
Availability
Available
LDAP User
Unknown
MediaWiki User
JMonton-WMF [ Global Accounts ]

Recent Activity

Today

JMonton-WMF moved T417184: Check HTML Content Size from Next Up to In progress on the Data-Engineering (Q3 FY25/26 January 1st - March 31th) board.
Fri, Feb 13, 3:09 PM · Data-Engineering (Q3 FY25/26 January 1st - March 31th), Event-Platform
JMonton-WMF moved T417152: Sqlfluff Rules for dbt from In Review to Done on the Data-Engineering (Q3 FY25/26 January 1st - March 31th) board.
Fri, Feb 13, 3:09 PM · Data-Engineering (Q3 FY25/26 January 1st - March 31th)
JMonton-WMF moved T417171: Sqlufluff on Stat hosts from In progress to In Review on the Data-Engineering (Q3 FY25/26 January 1st - March 31th) board.
Fri, Feb 13, 3:09 PM · Data-Engineering (Q3 FY25/26 January 1st - March 31th)
JMonton-WMF added a comment to T417171: Sqlufluff on Stat hosts.

If installing these tools is a concern, and we don't need to provide sqlfluff to many people, maybe we can just add an explanation of how to install it in a new virtual environment, or as part of a Makefile, so anyone that needs it, can install it only for them.

Fri, Feb 13, 12:17 PM · Data-Engineering (Q3 FY25/26 January 1st - March 31th)

Yesterday

JMonton-WMF added a comment to T417152: Sqlfluff Rules for dbt.

For most of the rules, we can take the ones used in Airflow, but there are some rules that @Mayakp.wiki would like to change a bit.

Thu, Feb 12, 10:06 AM · Data-Engineering (Q3 FY25/26 January 1st - March 31th)

Wed, Feb 11

JMonton-WMF created T417184: Check HTML Content Size.
Wed, Feb 11, 3:27 PM · Data-Engineering (Q3 FY25/26 January 1st - March 31th), Event-Platform
JMonton-WMF moved T417171: Sqlufluff on Stat hosts from Next Up to In progress on the Data-Engineering (Q3 FY25/26 January 1st - March 31th) board.
Wed, Feb 11, 2:09 PM · Data-Engineering (Q3 FY25/26 January 1st - March 31th)
JMonton-WMF moved T417152: Sqlfluff Rules for dbt from In progress to In Review on the Data-Engineering (Q3 FY25/26 January 1st - March 31th) board.
Wed, Feb 11, 2:09 PM · Data-Engineering (Q3 FY25/26 January 1st - March 31th)
JMonton-WMF created T417171: Sqlufluff on Stat hosts.
Wed, Feb 11, 2:08 PM · Data-Engineering (Q3 FY25/26 January 1st - March 31th)
JMonton-WMF moved T417152: Sqlfluff Rules for dbt from Next Up to In progress on the Data-Engineering (Q3 FY25/26 January 1st - March 31th) board.
Wed, Feb 11, 11:05 AM · Data-Engineering (Q3 FY25/26 January 1st - March 31th)
JMonton-WMF removed a project from T417152: Sqlfluff Rules for dbt: Epic.
Wed, Feb 11, 11:04 AM · Data-Engineering (Q3 FY25/26 January 1st - March 31th)
JMonton-WMF created T417152: Sqlfluff Rules for dbt.
Wed, Feb 11, 11:04 AM · Data-Engineering (Q3 FY25/26 January 1st - March 31th)

Mon, Feb 9

JMonton-WMF moved T416672: dbt repository structure (Milestone 3) from Next Up to In progress on the Data-Engineering (Q3 FY25/26 January 1st - March 31th) board.
Mon, Feb 9, 4:08 PM · Data-Engineering (Q3 FY25/26 January 1st - March 31th), Movement-Insights (FY25-26 H2)
JMonton-WMF added a comment to T416672: dbt repository structure (Milestone 3).

Let me make sure I understand: when we say "subfolders by projects", these are not distinct dbt projects right? The repository itself is the dbt project, and the subfolders are units of organization that the team defines?

That's right, there is a single dbt project, folders are just ways of organizing models.

Mon, Feb 9, 4:01 PM · Data-Engineering (Q3 FY25/26 January 1st - March 31th), Movement-Insights (FY25-26 H2)

Fri, Feb 6

JMonton-WMF added a comment to T416672: dbt repository structure (Milestone 3).

There are many different ways of configuring a dbt project to work with multiple teams and projects, here is a list of some features that can allow different configurations, and a proposal.

Fri, Feb 6, 4:17 PM · Data-Engineering (Q3 FY25/26 January 1st - March 31th), Movement-Insights (FY25-26 H2)
JMonton-WMF updated the task description for T416672: dbt repository structure (Milestone 3).
Fri, Feb 6, 11:14 AM · Data-Engineering (Q3 FY25/26 January 1st - March 31th), Movement-Insights (FY25-26 H2)
JMonton-WMF removed a subtask for T369226: [MI 1] Reporting, Data Visualization, and Data Communications: T416672: dbt repository structure (Milestone 3).
Fri, Feb 6, 11:12 AM · Movement-Insights (FY25-26 H2), Epic
JMonton-WMF added a subtask for T416679: dbt DPE work: T416672: dbt repository structure (Milestone 3).
Fri, Feb 6, 11:12 AM · Data-Engineering (Q3 FY25/26 January 1st - March 31th), Epic
JMonton-WMF edited parent tasks for T416672: dbt repository structure (Milestone 3), added: T416679: dbt DPE work; removed: T369226: [MI 1] Reporting, Data Visualization, and Data Communications.
Fri, Feb 6, 11:12 AM · Data-Engineering (Q3 FY25/26 January 1st - March 31th), Movement-Insights (FY25-26 H2)
JMonton-WMF created T416679: dbt DPE work.
Fri, Feb 6, 11:11 AM · Data-Engineering (Q3 FY25/26 January 1st - March 31th), Epic
JMonton-WMF updated the task description for T416672: dbt repository structure (Milestone 3).
Fri, Feb 6, 10:57 AM · Data-Engineering (Q3 FY25/26 January 1st - March 31th), Movement-Insights (FY25-26 H2)
JMonton-WMF added a project to T416672: dbt repository structure (Milestone 3): Data-Engineering (Q3 FY25/26 January 1st - March 31th).
Fri, Feb 6, 10:43 AM · Data-Engineering (Q3 FY25/26 January 1st - March 31th), Movement-Insights (FY25-26 H2)
JMonton-WMF created T416672: dbt repository structure (Milestone 3).
Fri, Feb 6, 10:43 AM · Data-Engineering (Q3 FY25/26 January 1st - March 31th), Movement-Insights (FY25-26 H2)
JMonton-WMF moved T360794: Implement stream of HTML content on mw.page_change event from In progress to In Review on the Data-Engineering (Q3 FY25/26 January 1st - March 31th) board.
Fri, Feb 6, 10:19 AM · Data-Engineering (Q3 FY25/26 January 1st - March 31th), Patch-For-Review, Research, Event-Platform

Mon, Feb 2

JMonton-WMF moved T360794: Implement stream of HTML content on mw.page_change event from Blocked/Paused to In progress on the Data-Engineering (Q3 FY25/26 January 1st - March 31th) board.
Mon, Feb 2, 3:44 PM · Data-Engineering (Q3 FY25/26 January 1st - March 31th), Patch-For-Review, Research, Event-Platform

Fri, Jan 30

JMonton-WMF added a comment to T408146: [Hypothesis] WE1.5.1 Contributor metrics dashboard.

Weekly update from the Data Engineering team:

Fri, Jan 30, 11:24 AM · OKR-Work (WE1 FY2025-26), Movement-Insights (FY25-26 H1)

Tue, Jan 27

JMonton-WMF added a comment to T360794: Implement stream of HTML content on mw.page_change event.

@Ottomata I've updated the MR to the latest structure inside the Python folder. I'm assuming the version .dev0 would be set in the helm files, so I'll start working on it.
Maybe the MR could be merged after a review, and we can iterate over it with the new schema after it's decided in T415158

Tue, Jan 27, 12:12 PM · Data-Engineering (Q3 FY25/26 January 1st - March 31th), Patch-For-Review, Research, Event-Platform

Mon, Jan 26

JMonton-WMF moved T415338: Update Blunderbuss-bugler from In Review to Done on the Data-Engineering (Q3 FY25/26 January 1st - March 31th) board.
Mon, Jan 26, 3:06 PM · OKR-Work, Data-Engineering (Q3 FY25/26 January 1st - March 31th)
JMonton-WMF moved T415338: Update Blunderbuss-bugler from In progress to In Review on the Data-Engineering (Q3 FY25/26 January 1st - March 31th) board.
Mon, Jan 26, 9:57 AM · OKR-Work, Data-Engineering (Q3 FY25/26 January 1st - March 31th)

Fri, Jan 23

JMonton-WMF added a comment to T408146: [Hypothesis] WE1.5.1 Contributor metrics dashboard.

Weekly update from the Data Engineering team:

Fri, Jan 23, 2:56 PM · OKR-Work (WE1 FY2025-26), Movement-Insights (FY25-26 H1)
JMonton-WMF updated the task description for T412978: Support for Java 25 and Flink 2.
Fri, Jan 23, 2:43 PM · Data-Engineering (Q3 FY25/26 January 1st - March 31th), Patch-For-Review
JMonton-WMF moved T412978: Support for Java 25 and Flink 2 from In progress to Blocked/Paused on the Data-Engineering (Q3 FY25/26 January 1st - March 31th) board.
Fri, Jan 23, 7:20 AM · Data-Engineering (Q3 FY25/26 January 1st - March 31th), Patch-For-Review
JMonton-WMF moved T415338: Update Blunderbuss-bugler from Next Up to In progress on the Data-Engineering (Q3 FY25/26 January 1st - March 31th) board.
Fri, Jan 23, 7:20 AM · OKR-Work, Data-Engineering (Q3 FY25/26 January 1st - March 31th)
JMonton-WMF created T415338: Update Blunderbuss-bugler.
Fri, Jan 23, 7:19 AM · OKR-Work, Data-Engineering (Q3 FY25/26 January 1st - March 31th)

Tue, Jan 20

JMonton-WMF added a comment to T414784: Test the dbt+skein approach to running dbt Spark jobs in K8s.

Just a couple of comments:

Tue, Jan 20, 3:53 PM · Data-Engineering (Q3 FY25/26 January 1st - March 31th)

Fri, Jan 16

JMonton-WMF added a comment to T408146: [Hypothesis] WE1.5.1 Contributor metrics dashboard.

Weekly update from the Data Engineering team:

Fri, Jan 16, 8:21 AM · OKR-Work (WE1 FY2025-26), Movement-Insights (FY25-26 H1)

Jan 8 2026

JMonton-WMF moved T411598: Increase partitions of mediawiki.content_history_reconcile.v1 from In progress to Done on the Data-Engineering (Q2 FY25/26 October 1st - December 31th) board.
Jan 8 2026, 10:11 AM · Data-Engineering (Q2 FY25/26 October 1st - December 31th)
JMonton-WMF updated the task description for T411598: Increase partitions of mediawiki.content_history_reconcile.v1.
Jan 8 2026, 10:11 AM · Data-Engineering (Q2 FY25/26 October 1st - December 31th)
JMonton-WMF added a comment to T411598: Increase partitions of mediawiki.content_history_reconcile.v1.

The topics have now 3 partitions. We are not changing the number of tasks on Flink for now, but this change will allow us to parallelize better when needed.

Jan 8 2026, 10:10 AM · Data-Engineering (Q2 FY25/26 October 1st - December 31th)
JMonton-WMF moved T411598: Increase partitions of mediawiki.content_history_reconcile.v1 from Blocked/Paused to In progress on the Data-Engineering (Q2 FY25/26 October 1st - December 31th) board.
Jan 8 2026, 9:44 AM · Data-Engineering (Q2 FY25/26 October 1st - December 31th)
JMonton-WMF added a comment to T411774: Requesting a new group allowing shell access to kafka-jumbo servers - with membership for JavierMonton.

Many thanks @MoritzMuehlenhoff !

Jan 8 2026, 9:26 AM · Data-Platform-SRE (2026.01.05 - 2026.01.23), Essential-Work, Infrastructure-Foundations

Dec 19 2025

JMonton-WMF added a comment to T408146: [Hypothesis] WE1.5.1 Contributor metrics dashboard.

Weekly update from the Data Engineering team:

Dec 19 2025, 10:12 AM · OKR-Work (WE1 FY2025-26), Movement-Insights (FY25-26 H1)
JMonton-WMF updated the task description for T412978: Support for Java 25 and Flink 2.
Dec 19 2025, 9:14 AM · Data-Engineering (Q3 FY25/26 January 1st - March 31th), Patch-For-Review

Dec 18 2025

JMonton-WMF updated the task description for T412978: Support for Java 25 and Flink 2.
Dec 18 2025, 11:22 AM · Data-Engineering (Q3 FY25/26 January 1st - March 31th), Patch-For-Review

Dec 17 2025

JMonton-WMF moved T410266: Explore how to migrate PyFlink to Java/Scala from In progress to Next Up on the Data-Engineering (Q2 FY25/26 October 1st - December 31th) board.
Dec 17 2025, 4:12 PM · Data-Engineering (Q3 FY25/26 January 1st - March 31th), Patch-For-Review, Spike, Event-Platform
JMonton-WMF moved T412978: Support for Java 25 and Flink 2 from Next Up to In progress on the Data-Engineering (Q2 FY25/26 October 1st - December 31th) board.
Dec 17 2025, 4:12 PM · Data-Engineering (Q3 FY25/26 January 1st - March 31th), Patch-For-Review
JMonton-WMF edited projects for T412978: Support for Java 25 and Flink 2, added: Data-Engineering (Q2 FY25/26 October 1st - December 31th); removed Data-Engineering.
Dec 17 2025, 4:11 PM · Data-Engineering (Q3 FY25/26 January 1st - March 31th), Patch-For-Review
JMonton-WMF edited projects for T412978: Support for Java 25 and Flink 2, added: Data-Engineering; removed Data-Engineering (Q2 FY25/26 October 1st - December 31th), Java-Scala-Standardization.
Dec 17 2025, 4:11 PM · Data-Engineering (Q3 FY25/26 January 1st - March 31th), Patch-For-Review
JMonton-WMF edited projects for T412978: Support for Java 25 and Flink 2, added: Data-Engineering (Q2 FY25/26 October 1st - December 31th); removed Data-Engineering.
Dec 17 2025, 4:11 PM · Data-Engineering (Q3 FY25/26 January 1st - March 31th), Patch-For-Review
JMonton-WMF created T412978: Support for Java 25 and Flink 2.
Dec 17 2025, 4:10 PM · Data-Engineering (Q3 FY25/26 January 1st - March 31th), Patch-For-Review
JMonton-WMF created T412921: Java parent pom: CI Test multiple Java versions.
Dec 17 2025, 9:16 AM · Data-Engineering, Java-Scala-Standardization

Dec 12 2025

JMonton-WMF moved T409105: mediawiki.page_change.v1 event stream - Investigate mistmatched meta.dt and dt (and rev_dt) fields from In progress to Blocked/Paused on the Data-Engineering (Q2 FY25/26 October 1st - December 31th) board.
Dec 12 2025, 11:09 AM · Data-Engineering (Q3 FY25/26 January 1st - March 31th), MW-Interfaces-Team, Event-Platform
JMonton-WMF moved T360794: Implement stream of HTML content on mw.page_change event from Blocked/Paused to In progress on the Data-Engineering (Q2 FY25/26 October 1st - December 31th) board.
Dec 12 2025, 11:09 AM · Data-Engineering (Q3 FY25/26 January 1st - March 31th), Patch-For-Review, Research, Event-Platform
JMonton-WMF added a comment to T408146: [Hypothesis] WE1.5.1 Contributor metrics dashboard.

Weekly update from the Data Engineering team:

Dec 12 2025, 9:08 AM · OKR-Work (WE1 FY2025-26), Movement-Insights (FY25-26 H1)
JMonton-WMF added a comment to T411598: Increase partitions of mediawiki.content_history_reconcile.v1.

Thanks for the info @xcollazo. This is actually a good example for this ticket, even if we increase to 20 TaskManagers temporarily, the throughput is limited by the topic having only 1 partition. Having 3 partitions should help in similar scenarios.

Dec 12 2025, 9:00 AM · Data-Engineering (Q2 FY25/26 October 1st - December 31th)

Dec 11 2025

JMonton-WMF added a comment to T410266: Explore how to migrate PyFlink to Java/Scala.

After some conversations, we think there are different paths related to this:

Dec 11 2025, 2:57 PM · Data-Engineering (Q3 FY25/26 January 1st - March 31th), Patch-For-Review, Spike, Event-Platform
JMonton-WMF added a comment to T409105: mediawiki.page_change.v1 event stream - Investigate mistmatched meta.dt and dt (and rev_dt) fields.

I think this might be a bit more complex that it seems. I've been looking at the import events in Mediawiki, and it seems that there are 2 different fields:

  • EventType: Which can be PageCreated, PageDeleted, PageRevisionUpdated, etc.
  • Cause: Which can be edit, move, delete, import, rollback, etc.
Dec 11 2025, 11:02 AM · Data-Engineering (Q3 FY25/26 January 1st - March 31th), MW-Interfaces-Team, Event-Platform

Dec 10 2025

JMonton-WMF moved T409105: mediawiki.page_change.v1 event stream - Investigate mistmatched meta.dt and dt (and rev_dt) fields from Next Up to In progress on the Data-Engineering (Q2 FY25/26 October 1st - December 31th) board.
Dec 10 2025, 3:22 PM · Data-Engineering (Q3 FY25/26 January 1st - March 31th), MW-Interfaces-Team, Event-Platform
JMonton-WMF claimed T409105: mediawiki.page_change.v1 event stream - Investigate mistmatched meta.dt and dt (and rev_dt) fields.
Dec 10 2025, 3:21 PM · Data-Engineering (Q3 FY25/26 January 1st - March 31th), MW-Interfaces-Team, Event-Platform

Dec 9 2025

JMonton-WMF added a comment to T410266: Explore how to migrate PyFlink to Java/Scala.

Good point @GGoncalves-WMF, I got lost after many Slack messages. I'll keep Java and Maven as "decided", and we can discuss more about the use cases.

Dec 9 2025, 2:55 PM · Data-Engineering (Q3 FY25/26 January 1st - March 31th), Patch-For-Review, Spike, Event-Platform
JMonton-WMF updated the task description for T410266: Explore how to migrate PyFlink to Java/Scala.
Dec 9 2025, 2:33 PM · Data-Engineering (Q3 FY25/26 January 1st - March 31th), Patch-For-Review, Spike, Event-Platform
JMonton-WMF updated the task description for T410266: Explore how to migrate PyFlink to Java/Scala.
Dec 9 2025, 10:34 AM · Data-Engineering (Q3 FY25/26 January 1st - March 31th), Patch-For-Review, Spike, Event-Platform
JMonton-WMF added a comment to T410266: Explore how to migrate PyFlink to Java/Scala.

After a conversation in Slack, it looks like the majority of the people agree with at least a few things: We'll do the migration, and we'll use Java and Maven. I'll create some subtasks describing the work needed.

Dec 9 2025, 10:32 AM · Data-Engineering (Q3 FY25/26 January 1st - March 31th), Patch-For-Review, Spike, Event-Platform

Dec 5 2025

JMonton-WMF added a comment to T408146: [Hypothesis] WE1.5.1 Contributor metrics dashboard.

Weekly update from the Data Engineering team:

Dec 5 2025, 4:05 PM · OKR-Work (WE1 FY2025-26), Movement-Insights (FY25-26 H1)
JMonton-WMF moved T411598: Increase partitions of mediawiki.content_history_reconcile.v1 from In progress to Blocked/Paused on the Data-Engineering (Q2 FY25/26 October 1st - December 31th) board.
Dec 5 2025, 11:48 AM · Data-Engineering (Q2 FY25/26 October 1st - December 31th)
JMonton-WMF added a comment to T411774: Requesting a new group allowing shell access to kafka-jumbo servers - with membership for JavierMonton.

Completely agree with that @elukey, thanks for the help!

Dec 5 2025, 10:14 AM · Data-Platform-SRE (2026.01.05 - 2026.01.23), Essential-Work, Infrastructure-Foundations

Dec 4 2025

JMonton-WMF added a comment to T411598: Increase partitions of mediawiki.content_history_reconcile.v1.

That's interesting.
In general, I wouldn't reduce the number of partitions of a topic to slow down a pipeline, I assume we could slow down the pipeline by not increasing the parallelism of the Flink application, or adding some kind of back off system in the process. But I understand the concern. So, at least it seems that it doesn't make sense to increase the number of workers of the PyFlink application, and if we don't need that, one of the reasons to increase partitions is also not needed.

Dec 4 2025, 5:31 PM · Data-Engineering (Q2 FY25/26 October 1st - December 31th)
JMonton-WMF added a comment to T411774: Requesting a new group allowing shell access to kafka-jumbo servers - with membership for JavierMonton.

Hi @elukey!
We don't need this often to be honest, maybe it's more about being able to help the DP SRE team with small tasks rather than giving them more work, but I totally understand the concerns.

Dec 4 2025, 3:42 PM · Data-Platform-SRE (2026.01.05 - 2026.01.23), Essential-Work, Infrastructure-Foundations
JMonton-WMF added a comment to T411774: Requesting a new group allowing shell access to kafka-jumbo servers - with membership for JavierMonton.

In case this is approved, I created the patch I believe is needed, to help with the process.

Dec 4 2025, 1:15 PM · Data-Platform-SRE (2026.01.05 - 2026.01.23), Essential-Work, Infrastructure-Foundations
JMonton-WMF created T411774: Requesting a new group allowing shell access to kafka-jumbo servers - with membership for JavierMonton.
Dec 4 2025, 12:57 PM · Data-Platform-SRE (2026.01.05 - 2026.01.23), Essential-Work, Infrastructure-Foundations
JMonton-WMF added a comment to T411598: Increase partitions of mediawiki.content_history_reconcile.v1.

We had a discussion on Slack, we'll start with 3 partitions, it should help balance the storage across the brokers, speed up the process, and we don't need many partitions because this process runs only once a month.

Dec 4 2025, 9:31 AM · Data-Engineering (Q2 FY25/26 October 1st - December 31th)
JMonton-WMF moved T360794: Implement stream of HTML content on mw.page_change event from In progress to Blocked/Paused on the Data-Engineering (Q2 FY25/26 October 1st - December 31th) board.
Dec 4 2025, 9:22 AM · Data-Engineering (Q3 FY25/26 January 1st - March 31th), Patch-For-Review, Research, Event-Platform

Dec 3 2025

JMonton-WMF moved T410266: Explore how to migrate PyFlink to Java/Scala from Next Up to In progress on the Data-Engineering (Q2 FY25/26 October 1st - December 31th) board.
Dec 3 2025, 2:52 PM · Data-Engineering (Q3 FY25/26 January 1st - March 31th), Patch-For-Review, Spike, Event-Platform
JMonton-WMF added a comment to T276088: Configuration Management for Kafka settings.

Another option: I worked in the past with https://github.com/devshawn/kafka-gitops to manage all topic settings from the CI/CD and it worked pretty well. The repo seems to be a bit old now, but it does the job.

Dec 3 2025, 11:08 AM · Data-Platform-SRE, Data-Engineering, serviceops-radar, Event-Platform, Analytics-Radar, SRE
JMonton-WMF added a comment to T411598: Increase partitions of mediawiki.content_history_reconcile.v1.

A question we haven't solved yet is: How many partitions should we use?

Dec 3 2025, 10:56 AM · Data-Engineering (Q2 FY25/26 October 1st - December 31th)
JMonton-WMF moved T411598: Increase partitions of mediawiki.content_history_reconcile.v1 from Next Up to In progress on the Data-Engineering (Q2 FY25/26 October 1st - December 31th) board.
Dec 3 2025, 10:52 AM · Data-Engineering (Q2 FY25/26 October 1st - December 31th)
JMonton-WMF created T411598: Increase partitions of mediawiki.content_history_reconcile.v1.
Dec 3 2025, 10:50 AM · Data-Engineering (Q2 FY25/26 October 1st - December 31th)

Nov 28 2025

JMonton-WMF added a comment to T408146: [Hypothesis] WE1.5.1 Contributor metrics dashboard.

Weekly update from the Data Engineering team:

Nov 28 2025, 9:07 AM · OKR-Work (WE1 FY2025-26), Movement-Insights (FY25-26 H1)

Nov 26 2025

JMonton-WMF moved T377023: Add CI step to event schema repositories to test to fail if a schema is deleted from In Review to Done on the Data-Engineering (Q2 FY25/26 October 1st - December 31th) board.
Nov 26 2025, 2:37 PM · Patch-For-Review, Data-Engineering (Q2 FY25/26 October 1st - December 31th), Event-Platform
JMonton-WMF moved T360794: Implement stream of HTML content on mw.page_change event from Next Up to In progress on the Data-Engineering (Q2 FY25/26 October 1st - December 31th) board.
Nov 26 2025, 10:52 AM · Data-Engineering (Q3 FY25/26 January 1st - March 31th), Patch-For-Review, Research, Event-Platform

Nov 25 2025

JMonton-WMF moved T377023: Add CI step to event schema repositories to test to fail if a schema is deleted from In progress to In Review on the Data-Engineering (Q2 FY25/26 October 1st - December 31th) board.
Nov 25 2025, 2:20 PM · Patch-For-Review, Data-Engineering (Q2 FY25/26 October 1st - December 31th), Event-Platform
JMonton-WMF added a comment to T377023: Add CI step to event schema repositories to test to fail if a schema is deleted.

Thanks both! I've added the changes. Now it requires a manual execution and it can be bypassed by adding the CI_BREAK_GLASS_REASON.

Nov 25 2025, 11:49 AM · Patch-For-Review, Data-Engineering (Q2 FY25/26 October 1st - December 31th), Event-Platform

Nov 24 2025

JMonton-WMF claimed T360794: Implement stream of HTML content on mw.page_change event.
Nov 24 2025, 5:58 PM · Data-Engineering (Q3 FY25/26 January 1st - March 31th), Patch-For-Review, Research, Event-Platform
JMonton-WMF added a comment to T377023: Add CI step to event schema repositories to test to fail if a schema is deleted.

In aims to move this check to the CI, but keeping it standard from the jsonschema-tools repository, I'd like to go for this proposal:

Nov 24 2025, 1:29 PM · Patch-For-Review, Data-Engineering (Q2 FY25/26 October 1st - December 31th), Event-Platform

Nov 21 2025

JMonton-WMF added a comment to T377023: Add CI step to event schema repositories to test to fail if a schema is deleted.

Some days ago I created a new CI check in a MR to check for deletions, but @Ottomata suggested that it could be moved to the jsonschema-tools repository.
The issue with this MR is that it should be created on every repository using json-schemas.

Nov 21 2025, 3:55 PM · Patch-For-Review, Data-Engineering (Q2 FY25/26 October 1st - December 31th), Event-Platform
JMonton-WMF moved T377023: Add CI step to event schema repositories to test to fail if a schema is deleted from Next Up to In progress on the Data-Engineering (Q2 FY25/26 October 1st - December 31th) board.
Nov 21 2025, 2:01 PM · Patch-For-Review, Data-Engineering (Q2 FY25/26 October 1st - December 31th), Event-Platform
JMonton-WMF added a comment to T408146: [Hypothesis] WE1.5.1 Contributor metrics dashboard.

Weekly update from the Data Engineering team:

Nov 21 2025, 11:15 AM · OKR-Work (WE1 FY2025-26), Movement-Insights (FY25-26 H1)
JMonton-WMF added a comment to T360794: Implement stream of HTML content on mw.page_change event.

Hi all,
I have some room to work on Event Platform tasks and I could take the work done on this one and try to push it to the finish line.
As the ticket is a bit old, I'd like to confirm if this is still needed.

Nov 21 2025, 11:01 AM · Data-Engineering (Q3 FY25/26 January 1st - March 31th), Patch-For-Review, Research, Event-Platform
JMonton-WMF moved T410268: Run dbt from Airflow from In progress to Blocked/Paused on the Data-Engineering (Q2 FY25/26 October 1st - December 31th) board.
Nov 21 2025, 9:56 AM · Data-Platform-SRE (2026.01.23 - 2026.02.13), OKR-Work, Data-Engineering (Q3 FY25/26 January 1st - March 31th), Patch-For-Review, Movement-Insights

Nov 17 2025

JMonton-WMF moved T410268: Run dbt from Airflow from Next Up to In progress on the Data-Engineering (Q2 FY25/26 October 1st - December 31th) board.
Nov 17 2025, 11:12 AM · Data-Platform-SRE (2026.01.23 - 2026.02.13), OKR-Work, Data-Engineering (Q3 FY25/26 January 1st - March 31th), Patch-For-Review, Movement-Insights
JMonton-WMF removed a project from T410268: Run dbt from Airflow: Epic.
Nov 17 2025, 11:11 AM · Data-Platform-SRE (2026.01.23 - 2026.02.13), OKR-Work, Data-Engineering (Q3 FY25/26 January 1st - March 31th), Patch-For-Review, Movement-Insights
JMonton-WMF created T410268: Run dbt from Airflow.
Nov 17 2025, 11:11 AM · Data-Platform-SRE (2026.01.23 - 2026.02.13), OKR-Work, Data-Engineering (Q3 FY25/26 January 1st - March 31th), Patch-For-Review, Movement-Insights
JMonton-WMF created T410266: Explore how to migrate PyFlink to Java/Scala.
Nov 17 2025, 10:46 AM · Data-Engineering (Q3 FY25/26 January 1st - March 31th), Patch-For-Review, Spike, Event-Platform
JMonton-WMF added a comment to T409469: Enable ChangeProp to consume mediawiki.page_content_change.v1.

That sounds good. Then, we could consider increasing the partitions in Jumbo too, codfw.mediawiki.page_content_change.v1 and eqiad.mediawiki.page_content_change.v1 both have 3 partitions right now. I'm not sure if any consumer In Jumbo relies on the message ordering that could be affected by the change in partitions, I'm guessing not.

Nov 17 2025, 9:19 AM · Data-Engineering, serviceops, Machine-Learning-Team

Nov 14 2025

JMonton-WMF added a comment to T409469: Enable ChangeProp to consume mediawiki.page_content_change.v1.

@Ottomata, what work is required to produce mediawiki.page_content_change.v1 to Kafka main? I'm expecting just some helmfile changes, for example, in the mw-page-content-change-enrich/values-codfw.yaml, and not requiring changes in mediawiki event enrichment code, right?

Nov 14 2025, 2:53 PM · Data-Engineering, serviceops, Machine-Learning-Team
JMonton-WMF added a comment to T408178: Decommission the Wikistories instruments.

Hive tables event.mediawiki_wikistories_consumption_event, event.mediawiki_wikistories_contribution_event and the associated data in HDFS are removed.

Nov 14 2025, 2:09 PM · Test Kitchen (Experiment Platform Sprint 15), MW-1.46-notes (1.46.0-wmf.2; 2025-11-12), Essential-Work, Wikistories

Nov 13 2025

JMonton-WMF moved T409054: Explore a local dbt environment setup (independent from Conda) from In Review to Done on the Data-Engineering (Q2 FY25/26 October 1st - December 31th) board.
Nov 13 2025, 11:53 AM · Data-Engineering (Q2 FY25/26 October 1st - December 31th)
JMonton-WMF moved T406636: Create a dbt Docker container from In Review to Done on the Data-Engineering (Q2 FY25/26 October 1st - December 31th) board.
Nov 13 2025, 11:53 AM · Data-Platform-SRE (2025.11.07 - 2025.11.28), OKR-Work, Data-Engineering (Q2 FY25/26 October 1st - December 31th)

Nov 12 2025

JMonton-WMF updated the task description for T407779: mediawiki_event_enrichment - update default params and tests to use mediawiki/page_change 1.3.0 (latest) schema.
Nov 12 2025, 11:20 AM · Data-Engineering (Q2 FY25/26 October 1st - December 31th), Patch-For-Review, Event-Platform
JMonton-WMF moved T409099: Iceberg Merge strategies with dbt from In progress to In Review on the Data-Engineering (Q2 FY25/26 October 1st - December 31th) board.
Nov 12 2025, 10:26 AM · Data-Engineering (Q2 FY25/26 October 1st - December 31th)