Page MenuHomePhabricator

Ottomata (Andrew Otto)
User

Today

  • No visible events.

Tomorrow

  • No visible events.

Saturday

  • No visible events.

User Details

User Since
Oct 9 2014, 4:50 PM (601 w, 6 h)
Availability
Available
IRC Nick
ottomata
LDAP User
Ottomata
MediaWiki User
Ottomata [ Global Accounts ]

Recent Activity

Today

Ottomata updated the task description for T422928: HTML Pipeline - Performance improvements.
Thu, Apr 16, 7:22 PM · Patch-For-Review, Data-Engineering (Q4 FS25/26 April 1st - June 30st), Event-Platform
Ottomata added a comment to T421216: HTML Enrichment - Tuning & Backfilling configuration.

I fixed the process function metrics. I accidentally renamed them.

Errr, I thought I fixed them, but I don't see them. They were fixed locally. Hm. Will follow up tomorrow.

Thu, Apr 16, 7:20 PM · Patch-For-Review, Data-Engineering (Q4 FS25/26 April 1st - June 30st), Event-Platform
Ottomata added a comment to T421216: HTML Enrichment - Tuning & Backfilling configuration.

Status report!

Thu, Apr 16, 7:19 PM · Patch-For-Review, Data-Engineering (Q4 FS25/26 April 1st - June 30st), Event-Platform
Ottomata added a comment to T421216: HTML Enrichment - Tuning & Backfilling configuration.

Since OOMs don't seem to be related to async_enabled=false or the v1.50.0 release I'm going to deploy staging back at v1.50.0. I want to find out if enrich function metrics are broken just for synch mode, or if they are also broken in async. I also don't think we need so much parallelism (if we are not backfilling, and maybe even if we are). Deploying v1.50.0 with 'fake sync' mode, with slight reduction in parallelism and TM memory.

Thu, Apr 16, 1:12 PM · Patch-For-Review, Data-Engineering (Q4 FS25/26 April 1st - June 30st), Event-Platform
Ottomata created T423583: mediawiki.page_change.v1 event - Add revision is revert field.
Thu, Apr 16, 12:50 PM · Data-Engineering, Event-Platform
Ottomata added a comment to T421216: HTML Enrichment - Tuning & Backfilling configuration.

Ah! It also slowly container OOMs and restarts in the same way on v1.50.0.dev40 (without the new async_enabled param). So seems like more due to our latest Flink settings? Or was it OOMing before (in non backfill case) too?

Thu, Apr 16, 12:47 AM · Patch-For-Review, Data-Engineering (Q4 FS25/26 April 1st - June 30st), Event-Platform

Yesterday

Ottomata updated subscribers of T422928: HTML Pipeline - Performance improvements.

Avoid envoy retries (there's a header)

Wed, Apr 15, 2:49 PM · Patch-For-Review, Data-Engineering (Q4 FS25/26 April 1st - June 30st), Event-Platform
Ottomata added a comment to T421216: HTML Enrichment - Tuning & Backfilling configuration.

Hm, it looks like my Monday deployment now container OOMs regularly after about 11 hours. it restarts and recovers fine.

Wed, Apr 15, 2:42 PM · Patch-For-Review, Data-Engineering (Q4 FS25/26 April 1st - June 30st), Event-Platform
Ottomata added a comment to T422842: Kafka-topics broken in beta: "zookeeper is not a recognized option".

Possibly related to T416669: Upgrade Kafka to version 3.x? But I'm not sure if they upgraded beta kafka?

Wed, Apr 15, 1:59 PM · Data-Platform-SRE (2026-03-27 - 2026-04-17), Data-Engineering, Event-Platform, WMF-JobQueue, Beta-Cluster-Infrastructure

Mon, Apr 13

Ottomata added a comment to T421216: HTML Enrichment - Tuning & Backfilling configuration.

Hm, async_enabled=False EventProcessFunction metrics look busted though: https://grafana.wikimedia.org/goto/bfizxkx7mumm8d?orgId=1

Mon, Apr 13, 8:17 PM · Patch-For-Review, Data-Engineering (Q4 FS25/26 April 1st - June 30st), Event-Platform
Ottomata added a comment to T421216: HTML Enrichment - Tuning & Backfilling configuration.

Alright, I merged the patch for T421965: eventutilties-python - support synchronous Flink process function mode, and bumped MWEE to use it. I merged your patch to use a single process function, and released MWEE.

Mon, Apr 13, 8:16 PM · Patch-For-Review, Data-Engineering (Q4 FS25/26 April 1st - June 30st), Event-Platform
Ottomata updated the task description for T422928: HTML Pipeline - Performance improvements.
Mon, Apr 13, 7:00 PM · Patch-For-Review, Data-Engineering (Q4 FS25/26 April 1st - June 30st), Event-Platform
Ottomata added a comment to T400380: MediaWiki\Revision\RevisionAccessException: Unable to load fresh row for rev_id: {rev_id}.

Ah yes, that would be best! TY

Mon, Apr 13, 5:15 PM · Data-Engineering (Q4 FS25/26 April 1st - June 30st), MW-1.46-notes (1.46.0-wmf.21; 2026-03-24), Patch-For-Review, MW-Interfaces-Team, Event-Platform, MediaWiki-DomainEvents, Unstewarded-production-error, MediaWiki-Core-Revision-backend, Wikimedia-production-error

Thu, Apr 9

Ottomata added a comment to T421216: HTML Enrichment - Tuning & Backfilling configuration.

BTW, I tried pipeline.object-reuse and got an exception. See slack thread.

Thu, Apr 9, 7:12 PM · Patch-For-Review, Data-Engineering (Q4 FS25/26 April 1st - June 30st), Event-Platform
Ottomata added a comment to T421216: HTML Enrichment - Tuning & Backfilling configuration.

I am trying process_async_enabled_default=False in staging (T421965).

Thu, Apr 9, 7:05 PM · Patch-For-Review, Data-Engineering (Q4 FS25/26 April 1st - June 30st), Event-Platform
Ottomata added a comment to T422826: Attribution - Better support for Project Family.

In case helpful:

Thu, Apr 9, 5:55 PM · MW-Interfaces-Team, Spike, MediaWiki-REST-API
Ottomata added a comment to T422511: page_change.v1 increate partitions to 3.

Based on a meeting today (notes here) it sounded like adding more partitions may actually hinder our current efforts in T421216: HTML Enrichment - Tuning & Backfilling configuration. @JMonton-WMF should we hold on actually doing this?

Thu, Apr 9, 5:47 PM · Data-Engineering (Q4 FS25/26 April 1st - June 30st), Event-Platform
Ottomata added a comment to T422511: page_change.v1 increate partitions to 3.

if the canary events only flow to a single partition this might create an imbalance on flink applications

Thu, Apr 9, 4:30 PM · Data-Engineering (Q4 FS25/26 April 1st - June 30st), Event-Platform
Ottomata added a comment to T421216: HTML Enrichment - Tuning & Backfilling configuration.

Deploying content length fix in prod, and also avoiding retries on 504 server timeouts.

Thu, Apr 9, 1:46 PM · Patch-For-Review, Data-Engineering (Q4 FS25/26 April 1st - June 30st), Event-Platform
Ottomata added a comment to T421216: HTML Enrichment - Tuning & Backfilling configuration.

Wow nice!

Thu, Apr 9, 1:45 PM · Patch-For-Review, Data-Engineering (Q4 FS25/26 April 1st - June 30st), Event-Platform

Wed, Apr 8

Ottomata added a comment to T421965: eventutilties-python - support synchronous Flink process function mode.

I published docker v1.49.0.dev30 today with synch mode. Didn't have time to test it live.

Wed, Apr 8, 7:24 PM · Data-Engineering (Q4 FS25/26 April 1st - June 30st), Patch-For-Review, Research, Event-Platform
Ottomata added a comment to T421216: HTML Enrichment - Tuning & Backfilling configuration.

Hm! prod failed with a message to large error but in the error sink!

Wed, Apr 8, 6:10 PM · Patch-For-Review, Data-Engineering (Q4 FS25/26 April 1st - June 30st), Event-Platform
Ottomata added a comment to T421216: HTML Enrichment - Tuning & Backfilling configuration.

As a rule of thumb, we need to do 40 HTTP calls per second. If many of them take 10 seconds, we need to have 400 HTTP calls in parallel or we won't be able to reach 20msg/seconds. If we need to backfill, we need to increase that x2, x4, or ideally x10 or more.

Wed, Apr 8, 1:37 PM · Patch-For-Review, Data-Engineering (Q4 FS25/26 April 1st - June 30st), Event-Platform
Ottomata updated subscribers of T421216: HTML Enrichment - Tuning & Backfilling configuration.

making it "unaligned" which seems to improve in scenarios with big latencies, and we can also reduce the number of checkpoints being created.

Wed, Apr 8, 1:26 PM · Patch-For-Review, Data-Engineering (Q4 FS25/26 April 1st - June 30st), Event-Platform
Ottomata added a comment to T421216: HTML Enrichment - Tuning & Backfilling configuration.

The checkpoint was already exactly_once, the Sink delivery guarantee was at_least_once,

Oh! I did not know this! Very interesting.

Wed, Apr 8, 1:20 PM · Patch-For-Review, Data-Engineering (Q4 FS25/26 April 1st - June 30st), Event-Platform
Ottomata added a comment to T422511: page_change.v1 increate partitions to 3.

Linking some really good thoughts from Javier from the parent task: T421216#11792886
...and responding here.

Wed, Apr 8, 1:09 PM · Data-Engineering (Q4 FS25/26 April 1st - June 30st), Event-Platform

Mon, Apr 6

Ottomata updated subscribers of T421216: HTML Enrichment - Tuning & Backfilling configuration.
Mon, Apr 6, 12:25 AM · Patch-For-Review, Data-Engineering (Q4 FS25/26 April 1st - June 30st), Event-Platform
Ottomata added a comment to T421216: HTML Enrichment - Tuning & Backfilling configuration.

the job restarted. It is now slowly catching back up.

Well it isn't catching back up now.

Mon, Apr 6, 12:14 AM · Patch-For-Review, Data-Engineering (Q4 FS25/26 April 1st - June 30st), Event-Platform

Sun, Apr 5

Ottomata added a comment to T421216: HTML Enrichment - Tuning & Backfilling configuration.

Looks like something similar to T421216#11787170 happened again last night.
https://grafana.wikimedia.org/goto/efi675t7jq96ob?orgId=1

Sun, Apr 5, 12:27 PM · Patch-For-Review, Data-Engineering (Q4 FS25/26 April 1st - June 30st), Event-Platform

Sat, Apr 4

Ottomata added a comment to T421216: HTML Enrichment - Tuning & Backfilling configuration.

Can we do better than 400ms in the normal case?

Sat, Apr 4, 6:49 PM · Patch-For-Review, Data-Engineering (Q4 FS25/26 April 1st - June 30st), Event-Platform
Ottomata added a comment to T421216: HTML Enrichment - Tuning & Backfilling configuration.

Ah, I figured out why the Flink UI wasn't showing metrics. metrics.internal.query-service.port was not configured. It is a random port by default, but in k8s we need a networkpolicy for it. This was already supported by the flink chart, but it wasn't set by default.

Sat, Apr 4, 6:18 PM · Patch-For-Review, Data-Engineering (Q4 FS25/26 April 1st - June 30st), Event-Platform
Ottomata added a comment to T421216: HTML Enrichment - Tuning & Backfilling configuration.

FYI, we see log messages like

Name collision: Group already contains a Metric with the name 'pendingCommittables'. Metric will not be reported.
Sat, Apr 4, 5:59 PM · Patch-For-Review, Data-Engineering (Q4 FS25/26 April 1st - June 30st), Event-Platform
Ottomata added a comment to T421216: HTML Enrichment - Tuning & Backfilling configuration.

Increase mediawiki.page_change.v1 kafka topic partitions and increase kafka source parallelism

@JMonton-WMF this is still worth a try, but I am less and less thinking that source throughput is the problem. Flink should shuffle the source messages to downstream tasks, and the source messages are not the big ones. We have successfully backfilled page_content_change with a lot of TMs before, with the same number of kafka topic partitions.

Sat, Apr 4, 3:22 PM · Patch-For-Review, Data-Engineering (Q4 FS25/26 April 1st - June 30st), Event-Platform
Ottomata added a comment to T421216: HTML Enrichment - Tuning & Backfilling configuration.

Huh! And production caught back up! It did not crash or restart.

Sat, Apr 4, 12:39 PM · Patch-For-Review, Data-Engineering (Q4 FS25/26 April 1st - June 30st), Event-Platform
Ottomata added a comment to T421216: HTML Enrichment - Tuning & Backfilling configuration.

The production job has started getting stuck.
https://grafana.wikimedia.org/goto/bfi0ytdvi6y2od?orgId=1

Sat, Apr 4, 1:18 AM · Patch-For-Review, Data-Engineering (Q4 FS25/26 April 1st - June 30st), Event-Platform

Fri, Apr 3

Ottomata added a comment to T421216: HTML Enrichment - Tuning & Backfilling configuration.

Well, whatever I changed didn't work. staging still dying due to the same size too large in kafka sink error.

Fri, Apr 3, 10:18 PM · Patch-For-Review, Data-Engineering (Q4 FS25/26 April 1st - June 30st), Event-Platform
Ottomata added a comment to T421216: HTML Enrichment - Tuning & Backfilling configuration.

For now, in staging, I'm going to reduce enrich.max_content_size to 15MB, giving us a 5MB margin.

Fri, Apr 3, 8:13 PM · Patch-For-Review, Data-Engineering (Q4 FS25/26 April 1st - June 30st), Event-Platform
Ottomata added a comment to T421216: HTML Enrichment - Tuning & Backfilling configuration.

Well! Staging is failing with message too large in kafka sink again:

Fri, Apr 3, 8:09 PM · Patch-For-Review, Data-Engineering (Q4 FS25/26 April 1st - June 30st), Event-Platform
Ottomata added a comment to T421216: HTML Enrichment - Tuning & Backfilling configuration.

Looks like prod died, and is now backfilling! Ah, but if there is no checkpointed offsets, flink source will fallback on earliest offset by default!

Fri, Apr 3, 6:40 PM · Patch-For-Review, Data-Engineering (Q4 FS25/26 April 1st - June 30st), Event-Platform
Ottomata added a comment to T297231: [Data Quality] Sending Apache Spark metrics to PushGateway.

Cool! Seems easy enough, except last commit is 7 years ago? :D

Fri, Apr 3, 6:37 PM · Data-Engineering (Q4 FS25/26 April 1st - June 30st), Observability-Metrics
Ottomata added a comment to T360794: Event stream with latest revision HTML & parent revision HTML diff.

@JMonton-WMF something we should keep an eye on: kafka topic size. I think the html topics will end up being the largest on kafka jumbo. It looks the largest topic is about 420GB right now.

Fri, Apr 3, 12:25 AM · Data-Engineering (Q4 FS25/26 April 1st - June 30st), Patch-For-Review, Research, Event-Platform

Thu, Apr 2

Ottomata added a comment to T421216: HTML Enrichment - Tuning & Backfilling configuration.

@JMonton-WMF good luck to you! Things we need to try:

Thu, Apr 2, 8:26 PM · Patch-For-Review, Data-Engineering (Q4 FS25/26 April 1st - June 30st), Event-Platform
Ottomata added a comment to T421216: HTML Enrichment - Tuning & Backfilling configuration.

Okay, in staging (-next) I just applied

Thu, Apr 2, 8:21 PM · Patch-For-Review, Data-Engineering (Q4 FS25/26 April 1st - June 30st), Event-Platform
Ottomata renamed T421216: HTML Enrichment - Tuning & Backfilling configuration from HTML Enrichment - Backfilling configuration to HTML Enrichment - Tuning & Backfilling configuration.
Thu, Apr 2, 7:39 PM · Patch-For-Review, Data-Engineering (Q4 FS25/26 April 1st - June 30st), Event-Platform
Ottomata updated the task description for T421216: HTML Enrichment - Tuning & Backfilling configuration.
Thu, Apr 2, 7:38 PM · Patch-For-Review, Data-Engineering (Q4 FS25/26 April 1st - June 30st), Event-Platform
Ottomata updated the task description for T421216: HTML Enrichment - Tuning & Backfilling configuration.
Thu, Apr 2, 7:38 PM · Patch-For-Review, Data-Engineering (Q4 FS25/26 April 1st - June 30st), Event-Platform
Ottomata added a comment to T421216: HTML Enrichment - Tuning & Backfilling configuration.

@JMonton-WMF and @AKhatun_WMF while we have backfill tuning issues, I want the production job to run, not restart, and not have to backfill if it does restart. To achieve this, I'm doing the following:

Thu, Apr 2, 7:19 PM · Patch-For-Review, Data-Engineering (Q4 FS25/26 April 1st - June 30st), Event-Platform
Ottomata moved T421951: eventutilities-python - make Flink Source and Sink parallelism configurable from Next Up to Done on the Data-Engineering (Q3 FY25/26 January 1st - March 31th) board.
Thu, Apr 2, 5:41 PM · Data-Engineering (Q3 FY25/26 January 1st - March 31th), Event-Platform
Ottomata added a comment to T419997: Fix PyFlink log levels.

Another main Error that appears as INFO, just right after a real ERROR:

Thu, Apr 2, 5:37 PM · Data-Engineering (Q4 FS25/26 April 1st - June 30st), Event-Platform
Ottomata added a comment to T415892: Add Multilingual RevertRisk predictions to mediawiki.page_revert_risk_prediction_change.

changeprop errors

Thu, Apr 2, 5:35 PM · Machine-Learning-Team (Q4 FY2025-26), Data-Engineering-Radar, Event-Platform, Data-Engineering
Ottomata updated the task description for T347282: [Event Platform] eventutilites-python: improve consistency guarantees of async process functions.
Thu, Apr 2, 12:17 AM · Data-Engineering (Q4 FS25/26 April 1st - June 30st), Patch-For-Review, Event-Platform

Wed, Apr 1

Ottomata added a comment to T421919: Backfill newly productionized edit types dataset.

Should I host the code anywhere, get it reviewed? Hosting probably not required since it is a one-time spark job.

Nah, but it would be good to post it somewhere. Here in phab is fine, or in a link Gitlab snippet, or whatever you prefer.

Wed, Apr 1, 11:46 PM · Data-Engineering (Q4 FS25/26 April 1st - June 30st)
Ottomata added a comment to T421216: HTML Enrichment - Tuning & Backfilling configuration.

Uh, that did not work.

Wed, Apr 1, 8:54 PM · Patch-For-Review, Data-Engineering (Q4 FS25/26 April 1st - June 30st), Event-Platform
Ottomata added a comment to T421216: HTML Enrichment - Tuning & Backfilling configuration.

BTW  I think we can see the container OOMs here: https://grafana.wikimedia.org/goto/bfht68nna58g0f?orgId=1

Wed, Apr 1, 8:49 PM · Patch-For-Review, Data-Engineering (Q4 FS25/26 April 1st - June 30st), Event-Platform
Ottomata added a comment to T421216: HTML Enrichment - Tuning & Backfilling configuration.

There are a couple of other (minor?) pieces of the puzzle.

...And also, maybe we should just try upgrading to Flink 2.2.0 and doing T347282: [Event Platform] eventutilites-python: improve consistency guarantees of async process functions, using Flink's built in AsyncProcessFunction instead of our custom microbatcher.

Wed, Apr 1, 5:20 PM · Patch-For-Review, Data-Engineering (Q4 FS25/26 April 1st - June 30st), Event-Platform
Ottomata edited projects for T421514: Investigate empty Constructive edit rate of newer editors (mobile web), added: Test Kitchen; removed Data-Engineering.
Wed, Apr 1, 5:01 PM · OKR-Work (WE1 FY2025-26), Test Kitchen, Product-Analytics (Kanban), Growth-Team
Ottomata moved T409676: Request for Hourly Pageview Data for multiple articles– July 18 to September 8, 2025 from Incoming (new tickets) to Tag with Radar on the Data-Engineering board.
Wed, Apr 1, 5:00 PM · Data-Engineering-Radar, Research, Data-Engineering
Ottomata added a comment to T409676: Request for Hourly Pageview Data for multiple articles– July 18 to September 8, 2025.

access to internal data sets such as wmf.pageview_actor, sessionlength

These are unlikely to be made available publicly, but...

Wed, Apr 1, 5:00 PM · Data-Engineering-Radar, Research, Data-Engineering
Ottomata reassigned T421919: Backfill newly productionized edit types dataset from Ottomata to AKhatun_WMF.
Wed, Apr 1, 4:37 PM · Data-Engineering (Q4 FS25/26 April 1st - June 30st)
Ottomata edited projects for T421783: Requesting Kerberos access for matmarex, added: Data-Platform-SRE; removed Data-Engineering.
Wed, Apr 1, 4:37 PM · Data-Engineering, Data-Platform-SRE (2026-03-27 - 2026-04-17)
Ottomata moved T228921: incomplete conversion of flow revisions after disabling flow, breaks stubs dumps from Incoming (new tickets) to Tag with Radar on the Data-Engineering board.
Wed, Apr 1, 4:35 PM · Data-Engineering-Radar, Patch-Needs-Improvement, Data-Engineering, MW-1.36-notes (1.36.0-wmf.2; 2020-07-28), Dumps-Generation
Ottomata moved T421703: Integrate airflow-fr-tech-ops into Bitu from Incoming (new tickets) to Tag with Radar on the Data-Engineering board.
Wed, Apr 1, 4:35 PM · Patch-For-Review, Bitu, Infrastructure-Foundations, Data-Engineering-Radar, Data-Engineering, FR-Tech-Analytics
Ottomata moved T51133: Create partial SQL dump of watchlist table from Incoming (new tickets) to Tag with Radar on the Data-Engineering board.
Wed, Apr 1, 4:34 PM · Patch-Needs-Improvement, Data-Engineering, Platform Team Workboards (Clinic Duty Team), Privacy Engineering, Dumps-Generation
Ottomata moved T24509: RCFeed should include change tags from Incoming (new tickets) to Tag with Radar on the Data-Engineering board.
Wed, Apr 1, 4:33 PM · Patch-Needs-Improvement, Data-Engineering, Moderator-Tools-Team, EventStreams, MediaWiki-Change-tagging, MediaWiki-Recent-changes
Ottomata added a comment to T421216: HTML Enrichment - Tuning & Backfilling configuration.

Re checkpointing, this could be what we need when backfilling: https://nightlies.apache.org/flink/flink-docs-stable/docs/dev/datastream/fault-tolerance/checkpointing/#execution-checkpointing-interval-during-backlog

Wed, Apr 1, 3:05 PM · Patch-For-Review, Data-Engineering (Q4 FS25/26 April 1st - June 30st), Event-Platform
Ottomata added a comment to T421216: HTML Enrichment - Tuning & Backfilling configuration.

More off-heap memory

Right! makes sense. IIRC, @AKhatun_WMF and I had to increase this for edit types stuff too, to deal with large messages in pyflink.

Wed, Apr 1, 2:30 PM · Patch-For-Review, Data-Engineering (Q4 FS25/26 April 1st - June 30st), Event-Platform
Ottomata added a comment to T421216: HTML Enrichment - Tuning & Backfilling configuration.

Very nice writeup, thank you.

Wed, Apr 1, 1:01 PM · Patch-For-Review, Data-Engineering (Q4 FS25/26 April 1st - June 30st), Event-Platform
Ottomata added a comment to T417721: Page-related DomainEvent classes with "@deprecated temporary alias, remove before 1.45 release".

No objections! I haven't followed this closely but IIUC, the EventBus related work has been done by Aaron. Thanks Aaron!

Wed, Apr 1, 2:10 AM · MW-1.46-notes (1.46.0-wmf.23; 2026-04-07), Data-Engineering-Radar, MW-Interfaces-Team (MWI-Sprint-30 (2026-03-24 to 2026-04-07)), Data-Engineering, Patch-For-Review, MW-1.45-release, MediaWiki-DomainEvents

Tue, Mar 31

Ottomata updated the task description for T421965: eventutilties-python - support synchronous Flink process function mode.
Tue, Mar 31, 8:29 PM · Data-Engineering (Q4 FS25/26 April 1st - June 30st), Patch-For-Review, Research, Event-Platform
Ottomata created T421965: eventutilties-python - support synchronous Flink process function mode.
Tue, Mar 31, 8:29 PM · Data-Engineering (Q4 FS25/26 April 1st - June 30st), Patch-For-Review, Research, Event-Platform
Ottomata added a comment to T421951: eventutilities-python - make Flink Source and Sink parallelism configurable.

It works!

Tue, Mar 31, 7:24 PM · Data-Engineering (Q3 FY25/26 January 1st - March 31th), Event-Platform
Ottomata updated subscribers of T421951: eventutilities-python - make Flink Source and Sink parallelism configurable.
Tue, Mar 31, 6:31 PM · Data-Engineering (Q3 FY25/26 January 1st - March 31th), Event-Platform
Ottomata added a subtask for T360794: Event stream with latest revision HTML & parent revision HTML diff: T421951: eventutilities-python - make Flink Source and Sink parallelism configurable.
Tue, Mar 31, 6:18 PM · Data-Engineering (Q4 FS25/26 April 1st - June 30st), Patch-For-Review, Research, Event-Platform
Ottomata added a parent task for T421951: eventutilities-python - make Flink Source and Sink parallelism configurable: T360794: Event stream with latest revision HTML & parent revision HTML diff.
Tue, Mar 31, 6:18 PM · Data-Engineering (Q3 FY25/26 January 1st - March 31th), Event-Platform
Ottomata created T421951: eventutilities-python - make Flink Source and Sink parallelism configurable.
Tue, Mar 31, 6:17 PM · Data-Engineering (Q3 FY25/26 January 1st - March 31th), Event-Platform
Ottomata added a comment to T421919: Backfill newly productionized edit types dataset.

OTOH, perhaps all we need is something for a one off now? If so this, would be a quick one off and easy enough to do.

Tue, Mar 31, 3:29 PM · Data-Engineering (Q4 FS25/26 April 1st - June 30st)
Ottomata added a comment to T421919: Backfill newly productionized edit types dataset.

It would be nice to backfill the final production table, rather than our current development tables.

Tue, Mar 31, 3:22 PM · Data-Engineering (Q4 FS25/26 April 1st - June 30st)
Ottomata added a comment to T421919: Backfill newly productionized edit types dataset.

Slack thread with backfill approach.

Tue, Mar 31, 3:19 PM · Data-Engineering (Q4 FS25/26 April 1st - June 30st)
Ottomata added a comment to T418996: Audit and fix observability (logging and metrics) for pyflink jobs.

Config uses deprecated configuration key 'state.backend' instead of proper key 'state.backend.type'

Tue, Mar 31, 3:03 PM · Patch-For-Review, Data-Engineering (Q4 FS25/26 April 1st - June 30st), Event-Platform

Mon, Mar 30

Ottomata moved T415158: Common event data model for data derived from parsed page revision html (and more!) from In Review to Ready to Deploy on the Data-Engineering (Q3 FY25/26 January 1st - March 31th) board.
Mon, Mar 30, 3:27 PM · Data-Engineering (Q4 FS25/26 April 1st - June 30st), Content-Transform-Team, MW-Interfaces-Team, Event-Platform
Ottomata moved T409676: Request for Hourly Pageview Data for multiple articles– July 18 to September 8, 2025 from Needs Clarification to Incoming (new tickets) on the Data-Engineering board.
Mon, Mar 30, 3:16 PM · Data-Engineering-Radar, Research, Data-Engineering
Ottomata added a comment to T419436: Investigate Gobblin failures.

FailureRequest to uri https://meta.wikimedia.org/w/api.php?format=json&action=streamconfigs&all_settings=true failed. BasicHttpResult(failure) encountered local exception: Connect to mw-api-int-ro.discovery.wmnet:4446 [mw-api-int-ro.discovery.wmnet/10.2.2.81] failed: Connection timed out (Connection timed out)

Mon, Mar 30, 3:10 PM · Data-Engineering (Q4 FS25/26 April 1st - June 30st), Data-Platform-SRE (2026-03-27 - 2026-04-17)
Ottomata moved T409358: Add page_id and namespace to X-Analytics header in Mobile App requests (2025 remake) from Blocked/Paused to Done on the Data-Engineering (Q3 FY25/26 January 1st - March 31th) board.
Mon, Mar 30, 2:57 PM · Data-Engineering (Q3 FY25/26 January 1st - March 31th), Content-Transform-Team, Essential-Work, Reader Growth Team, MobileFrontend (Core PHP), Wikipedia-Android-App-Backlog, Wikipedia-iOS-App-Backlog
Ottomata updated the task description for T409358: Add page_id and namespace to X-Analytics header in Mobile App requests (2025 remake).
Mon, Mar 30, 2:57 PM · Data-Engineering (Q3 FY25/26 January 1st - March 31th), Content-Transform-Team, Essential-Work, Reader Growth Team, MobileFrontend (Core PHP), Wikipedia-Android-App-Backlog, Wikipedia-iOS-App-Backlog
Ottomata added a comment to T409358: Add page_id and namespace to X-Analytics header in Mobile App requests (2025 remake).

I think it works!

Mon, Mar 30, 2:57 PM · Data-Engineering (Q3 FY25/26 January 1st - March 31th), Content-Transform-Team, Essential-Work, Reader Growth Team, MobileFrontend (Core PHP), Wikipedia-Android-App-Backlog, Wikipedia-iOS-App-Backlog

Thu, Mar 26

Ottomata added a comment to T331283: [Event Platform] Store Flink HA metadata in Zookeeper.

the operator doesn't know about ZK; it only checks the K8s-native HA

Hm, perhaps! I'm not sure though. IIUC (I might not!) HA is handled by Flink JM, not by the operator? I think you can use k8s ConfigMap HA if you were not using the k8s operator, and I'd expect you to be able to use ZK HA even if you are using the operator.

Thu, Mar 26, 5:29 PM · Discovery-Search, Event-Platform, serviceops-radar, Data-Engineering
Ottomata added a comment to T409462: mediawiki.page_change.v1 event - add a page type field.

@JAllemandou I think this is semi-relevant to the discussion we were just having about MWH incremental and namespace_is_content_historical. We probably want this info in the event

Thu, Mar 26, 4:06 PM · Data-Engineering, Event-Platform
Ottomata added a project to T421214: Requesting Ops level access to the 'platform_eng' Airflow Instance for the WME team: Data-Platform-SRE.
Thu, Mar 26, 4:01 PM · LDAP-Access-Requests, Data-Platform-SRE (2026-03-27 - 2026-04-17), SRE-Access-Requests, Wikimedia Enterprise
Ottomata moved T421237: `mediawiki.page_change.v1`: two schema validation errors causing events to be silently dropped by EventGate from Backlog to Stream Data Products on the Event-Platform board.
Thu, Mar 26, 2:51 PM · Data-Engineering, Event-Platform
Ottomata moved T421257: EventBus: Unable to deliver all events: 503: Service Unavailable from Backlog to Stream Data Products on the Event-Platform board.
Thu, Mar 26, 2:51 PM · Data-Engineering, Event-Platform

Wed, Mar 25

Ottomata renamed T335958: The differential privacy per country pageview datasets should be linked from dumps.wikmedia.org from The soon-to-be-released pageview datasets should be linked from dumps page to The differential privacy per country pageview datasets should be linked from dumps.wikmedia.org.
Wed, Mar 25, 6:09 PM · Data-Engineering, Privacy Engineering
Ottomata updated subscribers of T335958: The differential privacy per country pageview datasets should be linked from dumps.wikmedia.org.

cc @GGoncalves-WMF (we are grooming and putting this in backlog for now).

Wed, Mar 25, 5:51 PM · Data-Engineering, Privacy Engineering
Ottomata edited projects for T410668: PHP Deprecated: Invalid characters passed for attempted conversion, these have been ignored, added: Test Kitchen; removed Data-Engineering.
Wed, Mar 25, 5:48 PM · MediaWiki-extensions-EventLogging, User-brennen, DiscussionTools, Wikimedia-production-error
Ottomata closed T414526: Recent Wikidata dumps missing “All pages with complete edit history (.7z)” (job marked failed) as Declined.

Declining, please use the mediawiki_content_history 'AKA dumps 2) files. Please reopen if declining was incorrect.

Wed, Mar 25, 5:47 PM · Wikidata data dumps, Dumps-Generation, Data-Engineering, Wikidata
Ottomata added a project to T412428: Wikidata full .json.gz dumps not published since 20250625: Data-Platform-SRE.
Wed, Mar 25, 5:44 PM · Data-Platform-SRE (2026-03-27 - 2026-04-17), Wikidata, Data-Engineering, Dumps-Generation, Wikidata data dumps
Ottomata moved T402612: Add pageview information to turnilo's webrequest_sampled_live (is_pageview is always "-") from Needs Clarification to Tag with Radar on the Data-Engineering board.
Wed, Mar 25, 5:43 PM · Data-Engineering-Radar, Data-Engineering, SRE, Traffic
Ottomata edited projects for T420379: Kubernetes setup for similarity search, added: Data-Platform-SRE; removed Data-Engineering.
Wed, Mar 25, 5:40 PM · Data-Platform-SRE (2026-03-27 - 2026-04-17), Patch-For-Review, Research-engineering
Ottomata moved T421152: Could not hoist data into experiment.subject_id for event from Incoming (new tickets) to Tag with Radar on the Data-Engineering board.
Wed, Mar 25, 5:38 PM · MW-1.46-notes (1.46.0-wmf.24; 2026-04-14), Test Kitchen (Test Kitchen (Experiment Platform Sprint 22)), Patch-For-Review, Data-Engineering-Radar, Data-Engineering, Event-Platform, MediaWiki-extensions-WikimediaEvents, Wikimedia-production-error, Growth-Team
Ottomata added a project to T421152: Could not hoist data into experiment.subject_id for event: Test Kitchen.
Wed, Mar 25, 5:37 PM · MW-1.46-notes (1.46.0-wmf.24; 2026-04-14), Test Kitchen (Test Kitchen (Experiment Platform Sprint 22)), Patch-For-Review, Data-Engineering-Radar, Data-Engineering, Event-Platform, MediaWiki-extensions-WikimediaEvents, Wikimedia-production-error, Growth-Team
Ottomata added a project to T420974: when analyzing a Wikifunctions dump, parent_id in page creation revisions is sometimes 0 and sometimes None: Dumps-Generation.
Wed, Mar 25, 5:34 PM · Data-Engineering (Q4 FS25/26 April 1st - June 30st), Dumps-Generation