Page MenuHomePhabricator

Ottomata (Andrew Otto)
User

Today

  • Clear sailing ahead.

Tomorrow

  • Clear sailing ahead.

Thursday

  • Clear sailing ahead.

User Details

User Since
Oct 9 2014, 4:50 PM (498 w, 4 d)
Availability
Available
IRC Nick
ottomata
LDAP User
Ottomata
MediaWiki User
Ottomata [ Global Accounts ]

Recent Activity

Today

Ottomata added a comment to T120242: Eventually Consistent MediaWiki State Change Events.

^ changed title to remove the controversial 'source of truth' terminology.

Tue, Apr 30, 11:48 AM · Data-Engineering, Analytics, DBA, WMF-Architecture-Team, Platform Team Legacy (Later), Event-Platform, Services (later)
Ottomata renamed T120242: Eventually Consistent MediaWiki State Change Events from Eventually-Consistent MediaWiki state change events | MediaWiki events as source of truth to Eventually Consistent MediaWiki State Change Events.
Tue, Apr 30, 11:48 AM · Data-Engineering, Analytics, DBA, WMF-Architecture-Team, Platform Team Legacy (Later), Event-Platform, Services (later)

Yesterday

Ottomata updated subscribers of T228177: Event Platform Client Library Specification.

@phuedx @VirginiaPoundstone should this task be resolved?

Mon, Apr 29, 9:44 PM · Metrics Platform Icebox, Product-Data-Infrastructure, Product-Infrastructure-Team-Backlog-Deprecated, Epic, Better Use Of Data
Ottomata updated subscribers of T281999: Metrics Platform Schema: Define & Model Event Level Fields.

@VirginiaPoundstone another task to decline or resolve?

Mon, Apr 29, 9:43 PM · Metrics Platform Icebox, Epic
Ottomata added a comment to T281999: Metrics Platform Schema: Define & Model Event Level Fields.

Oh, or perhaps the subtasks should be done first?

Mon, Apr 29, 9:43 PM · Metrics Platform Icebox, Epic
Ottomata closed T282001: Metrics Platform Schema: Define & Model Bespoke Metrics Data as Declined.

@phuedx @VirginiaPoundstone being bold and declining. Please reopen if this was wrong.

Mon, Apr 29, 7:43 PM · Metrics Platform Icebox, Epic

Fri, Apr 26

Ottomata created T363587: [Event Platform] Instrument EventBus with prometheus MW Statslib.
Fri, Apr 26, 3:55 PM · Event-Platform, Data-Engineering
Ottomata added a comment to T351117: Move analytics log from Varnish to HAProxy.

Do you suggest to use something like uslfo_webrequest_text instead?

Fri, Apr 26, 1:48 PM · Data Products, Patch-For-Review, Data-Engineering, Observability-Logging, Traffic

Thu, Apr 25

Ottomata renamed T361017: [SPIKE] Can we express Event Platform configs in Datasets Config? from [SPIKE] Can we express Event Platform configs in config store? to [SPIKE] Can we express Event Platform configs in Datasets Config?.
Thu, Apr 25, 6:14 PM · Data-Engineering (Q4 2024 April 1st - June 30th), Spike, Event-Platform
Ottomata awarded T363461: Remove the need for karapace by using the schema registry built into DataHub a Like token.
Thu, Apr 25, 3:43 PM · Data-Catalog, Data-Platform-SRE
Ottomata added a comment to T329327: Frequent `429 Client Error: Too Many Requests for url: https://stream.wikimedia.org/v2/stream/recentchange` errors in SULWatcher.

Oh, another piece of info: WMF traffic frontends set a timeout for all connections of 15 minutes. This causes connected SSE clients to reconnect every 15 minutes. The disconnect should decrement the client IP. But, if there are enough connections from the same IP, I think the reconnect would be more likely to end up at a worker that is already at the limit for that IP.

Thu, Apr 25, 1:36 AM · Toolforge, Tools, EventStreams, Data-Engineering, Event-Platform, stewardbots
Ottomata added a comment to T329327: Frequent `429 Client Error: Too Many Requests for url: https://stream.wikimedia.org/v2/stream/recentchange` errors in SULWatcher.

Or, could we just avoid rate limiting Cloud VPS / Toolforge IPs in EventStreams code? Or at least increase the limit by a lot for those IPs?

Thu, Apr 25, 1:33 AM · Toolforge, Tools, EventStreams, Data-Engineering, Event-Platform, stewardbots

Wed, Apr 24

Ottomata added a comment to T329327: Frequent `429 Client Error: Too Many Requests for url: https://stream.wikimedia.org/v2/stream/recentchange` errors in SULWatcher.

This is probably not helpful, but EventStreams naive IP based local rate limiting is pretty dumb. If there was a smarter more global solution in WMF prod (maybe there is these days), we'd much prefer to use that.

Wed, Apr 24, 11:59 AM · Toolforge, Tools, EventStreams, Data-Engineering, Event-Platform, stewardbots

Fri, Apr 19

Ottomata added a comment to T249745: Could not enqueue jobs: "Unable to deliver all events: 503: Service Unavailable".

I fear I read that task, the way it is written at least, differently.

Fri, Apr 19, 9:02 PM · MediaWiki-Engineering, Data-Engineering, Unstewarded-production-error, User-brennen, serviceops, WMF-JobQueue, Wikimedia-production-error

Thu, Apr 18

Ottomata closed T269936: Schema repository structure, naming as Resolved.

Being bold.

Thu, Apr 18, 6:21 PM · Data-Engineering-Icebox, Metrics Platform Backlog, Better Use Of Data, Product-Analytics, Analytics, Product-Data-Infrastructure
Ottomata added a comment to T351117: Move analytics log from Varnish to HAProxy.

We could append (or prepend) other information pieces to the sequence number (like the haproxy process id) to avoid duplicates

Thu, Apr 18, 12:41 PM · Data Products, Patch-For-Review, Data-Engineering, Observability-Logging, Traffic
Ottomata added a comment to T249745: Could not enqueue jobs: "Unable to deliver all events: 503: Service Unavailable".

Replied at T120242#9726131

Thu, Apr 18, 12:36 PM · MediaWiki-Engineering, Data-Engineering, Unstewarded-production-error, User-brennen, serviceops, WMF-JobQueue, Wikimedia-production-error
Ottomata added a comment to T120242: Eventually Consistent MediaWiki State Change Events.

There is a lil discussion about this topic in T249745: Could not enqueue jobs: "Unable to deliver all events: 503: Service Unavailable". Moving that discussion to here.

Thu, Apr 18, 12:34 PM · Data-Engineering, Analytics, DBA, WMF-Architecture-Team, Platform Team Legacy (Later), Event-Platform, Services (later)
Ottomata changed the status of T280628: Securely connect Wikimedia Enterprise Infrastructure with WMF Kafka Streams from Resolved to Declined.

Hello! I don't think this task is resolved. Perhaps you meant to decline it?

Thu, Apr 18, 11:36 AM · Wikimedia Enterprise, Traffic, SRE, Platform Engineering
Ottomata added a comment to T249745: Could not enqueue jobs: "Unable to deliver all events: 503: Service Unavailable".

search index not getting updated in 0.001% of edits

Thu, Apr 18, 11:30 AM · MediaWiki-Engineering, Data-Engineering, Unstewarded-production-error, User-brennen, serviceops, WMF-JobQueue, Wikimedia-production-error

Wed, Apr 17

Ottomata added a comment to T249745: Could not enqueue jobs: "Unable to deliver all events: 503: Service Unavailable".

For replicating state changes (T120242) [...]

Why though? Why is 99.9999% (or 99.999999% or 99.99%) not enough?

Wed, Apr 17, 5:31 PM · MediaWiki-Engineering, Data-Engineering, Unstewarded-production-error, User-brennen, serviceops, WMF-JobQueue, Wikimedia-production-error
Ottomata renamed T120242: Eventually Consistent MediaWiki State Change Events from Consistent MediaWiki state change events | MediaWiki events as source of truth to Eventually-Consistent MediaWiki state change events | MediaWiki events as source of truth.
Wed, Apr 17, 3:21 PM · Data-Engineering, Analytics, DBA, WMF-Architecture-Team, Platform Team Legacy (Later), Event-Platform, Services (later)
Ottomata added a comment to T249745: Could not enqueue jobs: "Unable to deliver all events: 503: Service Unavailable".

see the CAP theorem

C != eventual-C. Eventual Consistency + AP is feasible and done often.

Wed, Apr 17, 3:21 PM · MediaWiki-Engineering, Data-Engineering, Unstewarded-production-error, User-brennen, serviceops, WMF-JobQueue, Wikimedia-production-error

Mon, Apr 15

Ottomata renamed T306797: [Shared Event Platform] Investigate Stream Processing Platforms from [Shared Event Platform] Investigate Event Service Platforms to [Shared Event Platform] Investigate Stream Processing Platforms.
Mon, Apr 15, 2:42 PM · Epic, Generated Data Platform
Ottomata updated subscribers of T361853: [Datasets Config][Spike] Understand and document the details and conflicts between Datasets Config, Refine refactor, Dynamic EventStreamConfig, and Metrics Platform Instrumentation Configurator.
Mon, Apr 15, 2:41 PM · Data-Engineering (Q4 2024 April 1st - June 30th)
Ottomata added a subtask for T356762: [Refine refactoring] Extract refine schema management into a dedicated tool: T361853: [Datasets Config][Spike] Understand and document the details and conflicts between Datasets Config, Refine refactor, Dynamic EventStreamConfig, and Metrics Platform Instrumentation Configurator.
Mon, Apr 15, 2:11 PM · Data-Engineering (Q4 2024 April 1st - June 30th), Patch-For-Review
Ottomata added a parent task for T361853: [Datasets Config][Spike] Understand and document the details and conflicts between Datasets Config, Refine refactor, Dynamic EventStreamConfig, and Metrics Platform Instrumentation Configurator: T356762: [Refine refactoring] Extract refine schema management into a dedicated tool.
Mon, Apr 15, 2:11 PM · Data-Engineering (Q4 2024 April 1st - June 30th)

Fri, Apr 5

Ottomata added a comment to T351117: Move analytics log from Varnish to HAProxy.

Very cool!

Fri, Apr 5, 5:46 PM · Data Products, Patch-For-Review, Data-Engineering, Observability-Logging, Traffic
Ottomata added a comment to T356762: [Refine refactoring] Extract refine schema management into a dedicated tool.

I prefer the "by functionality" organization

Yap cool with me. Let the namingbikeshed begin.

Fri, Apr 5, 5:45 PM · Data-Engineering (Q4 2024 April 1st - June 30th), Patch-For-Review
Ottomata added a comment to T360924: Replace service runner with a simplified library to better support metrics and debugging.

Curious! What's the status on collaboration with rest of org on NodeJS services and library support? IIUC there is tech department essential work planned to work on this.

Fri, Apr 5, 5:38 PM · Data-Engineering (Q4 2024 April 1st - June 30th)
Ottomata updated the task description for T361853: [Datasets Config][Spike] Understand and document the details and conflicts between Datasets Config, Refine refactor, Dynamic EventStreamConfig, and Metrics Platform Instrumentation Configurator.
Fri, Apr 5, 2:30 AM · Data-Engineering (Q4 2024 April 1st - June 30th)

Thu, Apr 4

Ottomata added a subtask for T354557: Dataset Config Store: T361017: [SPIKE] Can we express Event Platform configs in Datasets Config?.
Thu, Apr 4, 7:17 PM · Epic, Data-Engineering
Ottomata added a parent task for T361017: [SPIKE] Can we express Event Platform configs in Datasets Config?: T354557: Dataset Config Store.
Thu, Apr 4, 7:17 PM · Data-Engineering (Q4 2024 April 1st - June 30th), Spike, Event-Platform
Ottomata added a comment to T361853: [Datasets Config][Spike] Understand and document the details and conflicts between Datasets Config, Refine refactor, Dynamic EventStreamConfig, and Metrics Platform Instrumentation Configurator.

Perhaps we can close T361017: [SPIKE] Can we express Event Platform configs in Datasets Config? as duplicate?

Thu, Apr 4, 7:16 PM · Data-Engineering (Q4 2024 April 1st - June 30th)
Ottomata renamed T361853: [Datasets Config][Spike] Understand and document the details and conflicts between Datasets Config, Refine refactor, Dynamic EventStreamConfig, and Metrics Platform Instrumentation Configurator from [Event Platform] [Spike] Understand and document the details and conflicts between Datasets Config, Refine refactor, Dynamic EventStreamConfig, and Metrics Platform Instrumentation Configurator to [Datasets Config][Spike] Understand and document the details and conflicts between Datasets Config, Refine refactor, Dynamic EventStreamConfig, and Metrics Platform Instrumentation Configurator.
Thu, Apr 4, 6:17 PM · Data-Engineering (Q4 2024 April 1st - June 30th)
Ottomata added a subtask for T360738: Update EventStreamConfigs extension to use MPMW hook: T361853: [Datasets Config][Spike] Understand and document the details and conflicts between Datasets Config, Refine refactor, Dynamic EventStreamConfig, and Metrics Platform Instrumentation Configurator.
Thu, Apr 4, 6:17 PM · MW-1.42-notes (1.42.0-wmf.25; 2024-04-02), Metrics Platform Backlog, Data Products (Data Products Sprint 11)
Ottomata added a subtask for T354557: Dataset Config Store: T361853: [Datasets Config][Spike] Understand and document the details and conflicts between Datasets Config, Refine refactor, Dynamic EventStreamConfig, and Metrics Platform Instrumentation Configurator.
Thu, Apr 4, 6:17 PM · Epic, Data-Engineering
Ottomata added parent tasks for T361853: [Datasets Config][Spike] Understand and document the details and conflicts between Datasets Config, Refine refactor, Dynamic EventStreamConfig, and Metrics Platform Instrumentation Configurator: T354557: Dataset Config Store, T360738: Update EventStreamConfigs extension to use MPMW hook.
Thu, Apr 4, 6:17 PM · Data-Engineering (Q4 2024 April 1st - June 30th)
Ottomata renamed T361853: [Datasets Config][Spike] Understand and document the details and conflicts between Datasets Config, Refine refactor, Dynamic EventStreamConfig, and Metrics Platform Instrumentation Configurator from [Event Platform] [Spike] Develop a concept to apply Metrics Platform configurations to event stream configurations to [Event Platform] [Spike] Understand and document the details and conflicts between Datasets Config, Refine refactor, Dynamic EventStreamConfig, and Metrics Platform Instrumentation Configurator.
Thu, Apr 4, 6:14 PM · Data-Engineering (Q4 2024 April 1st - June 30th)
Ottomata added a comment to T356762: [Refine refactoring] Extract refine schema management into a dedicated tool.

Should we have 2 lib files, one for schema and one for data, for both Hive and Iceberg? Or one file doing both as it is now?

Thu, Apr 4, 5:35 PM · Data-Engineering (Q4 2024 April 1st - June 30th), Patch-For-Review

Wed, Apr 3

Ottomata added a comment to T356762: [Refine refactoring] Extract refine schema management into a dedicated tool.

Just read Antione's patch and I think I'm missing something, so I thought I could ask here.

Wed, Apr 3, 11:40 PM · Data-Engineering (Q4 2024 April 1st - June 30th), Patch-For-Review
Ottomata closed T296529: Refine Investigation as Declined.

Being bold, reopen if needed.

Wed, Apr 3, 11:09 PM · Data-Engineering, Spike, Data Pipelines

Mar 28 2024

Ottomata updated subscribers of T361214: Public dashboard process.
Mar 28 2024, 1:57 PM · Epic, Data Products (Epics Timeline), Data-Engineering-Dashiki, Data-Engineering

Mar 27 2024

Ottomata added a comment to T266813: mw.user.generateRandomSessionId should return a UUID.

I think this is a library that Data Engineering owns?

@VirginiaPoundstone I don't think so. I believe mw.user.generate.generateRandomSessionId is part of MediaWiki core.

Mar 27 2024, 10:46 PM · Metrics Platform Backlog, Data Products, Data-Engineering, Analytics-Radar, Better Use Of Data, Product-Data-Infrastructure

Mar 26 2024

Ottomata added a comment to T360642: Remove extra fields currently sent to Kafka.

meta.id

Do you know who set these fields with the current webrequest flow?

Mar 26 2024, 6:37 PM · Event-Platform, Patch-For-Review, Data-Engineering, Observability-Logging, Traffic

Mar 22 2024

Ottomata added a comment to T360642: Remove extra fields currently sent to Kafka.

meta.id and meta.request_id

Mar 22 2024, 7:41 PM · Event-Platform, Patch-For-Review, Data-Engineering, Observability-Logging, Traffic

Mar 21 2024

Ottomata added a comment to T348958: Bump memory to enable large artifacts sync on HDFS.
Supports only reading, with read-ahead of a predermined block-size.

In the case that the server does not supply the filesize, only reading of
the complete file in one go is supported.
Mar 21 2024, 8:12 PM · Structured-Data-Backlog, Data-Engineering

Mar 19 2024

Ottomata added a comment to T348958: Bump memory to enable large artifacts sync on HDFS.

Hm, actually, as far as I can tell, reading from HTTP (and many other sources) uses https://filesystem-spec.readthedocs.io/en/stable/api.html#fsspec.spec.AbstractBufferedFile, which has a default read blocksize of 5MB.

Mar 19 2024, 12:17 AM · Structured-Data-Backlog, Data-Engineering

Mar 18 2024

Ottomata added a comment to T348958: Bump memory to enable large artifacts sync on HDFS.

Or maybe:

Mar 18 2024, 11:56 PM · Structured-Data-Backlog, Data-Engineering
Ottomata added a comment to T348958: Bump memory to enable large artifacts sync on HDFS.

Maybe: https://filesystem-spec.readthedocs.io/en/latest/api.html?highlight=clear%20cache#fsspec.utils.read_block ?

Mar 18 2024, 11:37 PM · Structured-Data-Backlog, Data-Engineering
Ottomata added a comment to T359178: Check statsv and eventlogging VarnishKafka instances.

@Fabfur I'm really really hoping we can remove varnishkafka-eventlogging after we complete T353817: Create legacy EventLogging proxy HTTP intake (for MediaWikiPingback) endpoint to EventGate, as part of T238230: Decommission EventLogging backend components by migrating to MEP.

Mar 18 2024, 10:25 PM · Data-Engineering, Observability-Logging, Traffic
Ottomata added a comment to T249745: Could not enqueue jobs: "Unable to deliver all events: 503: Service Unavailable".

This doesn't mean that MediaWiki shoudn't try to improve the situation by handling the failure to submit a job by saving it somewhere (a specific db table?) and we can replay them later. At the current failure rate, this would guarantee the jobs would be executed with an irrelevant cost in terms of resources.

Mar 18 2024, 10:20 PM · MediaWiki-Engineering, Data-Engineering, Unstewarded-production-error, User-brennen, serviceops, WMF-JobQueue, Wikimedia-production-error
Ottomata added a comment to T249745: Could not enqueue jobs: "Unable to deliver all events: 503: Service Unavailable".

I wonder if JobQueueGroup::lazyPush()/JobQueueEventBus could be rigged to make the provided jobs use "hasty" mode in EventGate?

Mar 18 2024, 10:15 PM · MediaWiki-Engineering, Data-Engineering, Unstewarded-production-error, User-brennen, serviceops, WMF-JobQueue, Wikimedia-production-error

Mar 15 2024

Ottomata updated subscribers of T291120: MediaWiki Event Carried State Transfer - Problem Statement.
Mar 15 2024, 11:53 PM · Data-Engineering, Platform Engineering, Event-Platform, tech-decision-forum
Ottomata updated the task description for T347970: [L] MachineVision: archive and remove all events and event schemas.
Mar 15 2024, 9:13 PM · Patch-For-Review, Structured-Data-Backlog (Current Work), MachineVision
Ottomata created T360210: Document instructions for deleting an event stream and its usages.
Mar 15 2024, 4:14 PM · Data-Engineering (Q4 2024 April 1st - June 30th), Metrics Platform Backlog, Event-Platform

Mar 6 2024

Ottomata updated the task description for T120242: Eventually Consistent MediaWiki State Change Events.
Mar 6 2024, 1:30 PM · Data-Engineering, Analytics, DBA, WMF-Architecture-Team, Platform Team Legacy (Later), Event-Platform, Services (later)

Mar 5 2024

Ottomata added a comment to T357537: Alerts Review: determine if we can use Prometheus to alert based on historical datasets.

Oh cool! @bking I read the linked notes but I'm missing how its gonna work? How can you alert on dataset $X for partition $N is failing? Is there a way to make partition or hour or datetime or whatever a label?

Mar 5 2024, 2:12 PM · Data-Platform-SRE (2024.03.04 - 2024.03.24), Data-Engineering

Mar 3 2024

Ottomata added a comment to T249745: Could not enqueue jobs: "Unable to deliver all events: 503: Service Unavailable".

I still wonder why profile::kafka::mirror::properties doesn't blacklist all MW jobs?* Is anything making use of that extra data?

Mar 3 2024, 2:17 PM · MediaWiki-Engineering, Data-Engineering, Unstewarded-production-error, User-brennen, serviceops, WMF-JobQueue, Wikimedia-production-error

Feb 29 2024

Ottomata added a comment to T253058: DRY kafka broker declaration in helmfiles.

+1, or add this as a subtask of that?

Feb 29 2024, 7:57 PM · Data-Engineering, Data-Platform-SRE, serviceops, SRE, Event-Platform
Ottomata added a comment to T358612: Investigate replacing Archiva with Gitlab repositories.

+1! to this idea!

Feb 29 2024, 3:14 PM · Data-Platform-SRE (2024.04.15 - 2024.05.05), Java-Scala-Standardization, Security, collaboration-services, Release-Engineering-Team

Feb 27 2024

Ottomata added a comment to T350180: Upgrade prom-client in NodeJS service-runner and enable collectDefaultMetrics.

<3

Feb 27 2024, 9:42 PM · Data-Engineering, observability, ChangeProp, Event-Platform, service-runner
Ottomata updated subscribers of T309772: npm audit reports several security issues with Service runner.
Feb 27 2024, 1:22 PM · MediaWiki-Engineering, CX-cxserver, Security, service-runner

Feb 21 2024

Ottomata added a comment to T354557: Dataset Config Store.

Worth investigating? https://datacontract.com/

Feb 21 2024, 1:30 AM · Epic, Data-Engineering
Ottomata added a comment to T276088: Configuration Management for Kafka settings.

Just came across https://www.jikkou.io/docs/tutorials/get_started/ . Worth a look!

Feb 21 2024, 1:22 AM · Data-Platform-SRE, Data-Engineering, serviceops-radar, Event-Platform, Analytics-Radar, SRE
Ottomata closed T358073: kafka management as Invalid.
Feb 21 2024, 1:20 AM
Ottomata created T358073: kafka management.
Feb 21 2024, 1:20 AM

Feb 19 2024

Ottomata added a comment to T307959: [Event Platform] Design and Implement realtime enrichment pipeline for MW page change with content.

@lbowmaker @gmodena Should we resolve and close this?

Feb 19 2024, 3:39 PM · Data-Engineering, Event-Platform, Epic

Feb 15 2024

Ottomata added a comment to T356597: Investigate if the new 'Multiblocks' user blocks feature affects the mediawiki.user-blocks-change event stream.

@JWheeler-WMF EventBus extension uses the BlockIpComplete hook. If there are no changes to this hook API, then there are no changes needed for EventBus or the mediawiki.user-blocks-change stream. However, I'd assume that to accomodate Multiblocks feature, the hook will need to be changed to represent the multiple expiration dates of the different blocks.

Feb 15 2024, 6:40 PM · Data Products (Data Products Sprint 09), Multiblocks, Community-Tech, Data-Engineering, Event-Platform

Feb 13 2024

Ottomata added a comment to T356762: [Refine refactoring] Extract refine schema management into a dedicated tool.

Oh and in case you haven't seen it: EvolveHiveTable.

Feb 13 2024, 11:44 PM · Data-Engineering (Q4 2024 April 1st - June 30th), Patch-For-Review
Ottomata added a comment to T266813: mw.user.generateRandomSessionId should return a UUID.

I think because it was on the Event Platform board, but doesn't have anything really to do with Event Platform. Instead, it has to do with MW generated session IDs, which I believe are used in EventLogging instrumentation schemas.

Feb 13 2024, 2:29 AM · Metrics Platform Backlog, Data Products, Data-Engineering, Analytics-Radar, Better Use Of Data, Product-Data-Infrastructure

Feb 12 2024

Ottomata updated the task description for T353817: Create legacy EventLogging proxy HTTP intake (for MediaWikiPingback) endpoint to EventGate.
Feb 12 2024, 4:08 PM · MediaWiki-Platform-Team (Radar), Patch-For-Review, Data-Engineering, Event-Platform, MediaWiki-General
Ottomata updated subscribers of T353817: Create legacy EventLogging proxy HTTP intake (for MediaWikiPingback) endpoint to EventGate.

PHP execution.
Afaik PHP execution is limited for security reasons to only specific directories. This will thus likely need a puppet change first to Apache config to allow this directory to execute PHP.

Given the transition to Docker/Helm/Kubernetes etc this will also need a corresponding change there, which has its own copy of the Apache config.

Feb 12 2024, 4:07 PM · MediaWiki-Platform-Team (Radar), Patch-For-Review, Data-Engineering, Event-Platform, MediaWiki-General

Feb 11 2024

Ottomata added a comment to T357005: eventstreams regularly uses more than 95% of its memory limit.

If you have time to dive deep, you can live inspect a nodejs process and search for memory leaks.

Feb 11 2024, 9:16 PM · Data-Engineering, Event-Platform, EventStreams, serviceops, Prod-Kubernetes, Kubernetes

Feb 9 2024

Ottomata added projects to T357005: eventstreams regularly uses more than 95% of its memory limit: Event-Platform, Data-Engineering.

wondering about the stream connection duration

Feb 9 2024, 11:47 PM · Data-Engineering, Event-Platform, EventStreams, serviceops, Prod-Kubernetes, Kubernetes
Ottomata added a comment to T351837: [SPIKE] Assess impact of Move analytics log from Varnish to HAProxy .

all we'd need would be to switch our webrequest pipelines to start consuming from the proposed new table names discussed in T314956: [Event Platform] Declare webrequest as an Event Platform stream.

Feb 9 2024, 10:45 PM · Data Products (Data Products Sprint 07)
Ottomata added a comment to T351117: Move analytics log from Varnish to HAProxy.

I think that this more precise timestamp would be parseable by our ingestion system just fine, but we should verify. If we can get this precise I suppose...why not? I see that existent varnish dt is only seconds, which doesn't seem very precise, especially for webrequest. Perhaps we should take this opportunity to increase the precision a bit. If we can, we should strive for at least millisecond. Not a blocker for this task though.

Feb 9 2024, 10:42 PM · Data Products, Patch-For-Review, Data-Engineering, Observability-Logging, Traffic
Ottomata updated the task description for T314956: [Event Platform] Declare webrequest as an Event Platform stream.
Feb 9 2024, 10:39 PM · Patch-For-Review, Data-Engineering, Event-Platform
Ottomata added projects to T336842: Introduce new logging schema: Data Products, Metrics Platform Backlog.
Feb 9 2024, 6:45 PM · WMDE-FUN-Sprint-2024-04-23, WMDE-FUN-Sprint-2024-04-09, WMDE-FUN-Sprint-2024-02-27, WMDE-FUN-Sprint-2024-02-13, Metrics Platform Backlog, Data Products, WMDE-FUN-Sprint-2024-01-30, WMDE-FUN-Team, WMDE-Fundraising-Tech
Ottomata updated subscribers of T336842: Introduce new logging schema.

respective doc page

Feb 9 2024, 6:45 PM · WMDE-FUN-Sprint-2024-04-23, WMDE-FUN-Sprint-2024-04-09, WMDE-FUN-Sprint-2024-02-27, WMDE-FUN-Sprint-2024-02-13, Metrics Platform Backlog, Data Products, WMDE-FUN-Sprint-2024-01-30, WMDE-FUN-Team, WMDE-Fundraising-Tech

Feb 8 2024

Ottomata added a comment to T356762: [Refine refactoring] Extract refine schema management into a dedicated tool.

Hello! I'm not entirely sure what this ticket is trying to do, but here's some hopefully useful information:

Feb 8 2024, 3:33 PM · Data-Engineering (Q4 2024 April 1st - June 30th), Patch-For-Review

Feb 5 2024

Ottomata updated the task description for T314956: [Event Platform] Declare webrequest as an Event Platform stream.
Feb 5 2024, 2:38 PM · Patch-For-Review, Data-Engineering, Event-Platform

Feb 4 2024

Ottomata created T356597: Investigate if the new 'Multiblocks' user blocks feature affects the mediawiki.user-blocks-change event stream.
Feb 4 2024, 2:26 PM · Data Products (Data Products Sprint 09), Multiblocks, Community-Tech, Data-Engineering, Event-Platform

Jan 30 2024

Ottomata edited projects for T266813: mw.user.generateRandomSessionId should return a UUID, added: Data Products, Metrics Platform Backlog; removed Event-Platform.
Jan 30 2024, 11:52 PM · Metrics Platform Backlog, Data Products, Data-Engineering, Analytics-Radar, Better Use Of Data, Product-Data-Infrastructure
Ottomata added a comment to T352783: Change data platform-related IRC channels to improve communication.

Copypasting comment from Alerts Review doc:

Jan 30 2024, 11:47 PM · Data-Platform-SRE (2024.04.15 - 2024.05.05), observability

Jan 18 2024

Ottomata updated the task description for T341229: ProduceCanaryEvents job should be scheduled by Airflow and/or a k8s service.
Jan 18 2024, 6:25 PM · Data-Engineering (Sprint 9), Event-Platform
Ottomata renamed T341229: ProduceCanaryEvents job should be scheduled by Airflow and/or a k8s service from ProduceCanaryEvents job should be scheduled by Airflow to ProduceCanaryEvents job should be scheduled by Airflow and/or a k8s service.
Jan 18 2024, 6:25 PM · Data-Engineering (Sprint 9), Event-Platform
Ottomata added a project to T341229: ProduceCanaryEvents job should be scheduled by Airflow and/or a k8s service: Event-Platform.
Jan 18 2024, 6:13 PM · Data-Engineering (Sprint 9), Event-Platform

Jan 10 2024

Ottomata updated subscribers of T347421: [NEEDS GROOMING] schema services should be moved to k8s.
Jan 10 2024, 10:49 PM · Data-Platform-SRE, Event-Platform, Data-Engineering
Ottomata added a comment to T353817: Create legacy EventLogging proxy HTTP intake (for MediaWikiPingback) endpoint to EventGate.

Oh, and actually, we only need to count requests to mediawiki.org/beacon/event, so:

Jan 10 2024, 2:42 PM · MediaWiki-Platform-Team (Radar), Patch-For-Review, Data-Engineering, Event-Platform, MediaWiki-General

Jan 8 2024

Ottomata added a comment to T349289: Upgrade eventlogging VM to bullseye (or bookworm).

Decommissioning probably won't get done until after I'm back from leave in late April. Can we wait that long?

Jan 8 2024, 3:17 PM · Data-Platform-SRE (2024.02.12 - 2024.03.03), Data-Engineering, Event-Platform

Jan 5 2024

Ottomata added a comment to T259163: Migrate legacy metawiki schemas to Event Platform.

Okay great! Thank you.

Jan 5 2024, 2:47 PM · Data-Engineering, Better Use Of Data, Product-Analytics, MW-1.36-notes (1.36.0-wmf.18; 2020-11-17), Product-Data-Infrastructure, Event-Platform
Ottomata added a comment to T212482: RFC: Evolve hook system to support "filters" and "actions" only.

Also, from the convo in December's tech leadership CoP meeting, I started thinking about how what we want for T291120: MediaWiki Event Carried State Transfer - Problem Statement is pretty similar to what is in MW's logging table, except we need the data to be structured, comprehensive and consistent (meaning no missing state changes). In T120242: Eventually Consistent MediaWiki State Change Events, one of the solutions outlined is the 'Transactional Outbox' pattern, which is kinda similar to a comprehensive+structured logging table from which we can generate and externalize state change events. I betcha we could tie these ideas together somehow.

Jan 5 2024, 2:29 PM · Patch-Needs-Improvement, Platform Engineering Roadmap Decision Making, MediaWiki-Core-Hooks, Platform Team Initiatives (New Hook System), TechCom-RFC, TechCom
Ottomata added a comment to T259163: Migrate legacy metawiki schemas to Event Platform.

@SNowick_WMF, are latest versions of apps still sending the various MobileApp* events? I see a few events coming in, but maybe those are just from old versions?

Jan 5 2024, 1:42 AM · Data-Engineering, Better Use Of Data, Product-Analytics, MW-1.36-notes (1.36.0-wmf.18; 2020-11-17), Product-Data-Infrastructure, Event-Platform
Ottomata added a comment to T353817: Create legacy EventLogging proxy HTTP intake (for MediaWikiPingback) endpoint to EventGate.

peak request rate was ~1900 requests/s.

Oh, that turnilo chart is per hour (I think), and is also sampled 1/128. 1900/s Seemed like a lot! So more like Peak of 900*128/60/60 == 32 requests/s. (I think you misread the chart, the peak I see shows '900', not 1900).

Jan 5 2024, 1:28 AM · MediaWiki-Platform-Team (Radar), Patch-For-Review, Data-Engineering, Event-Platform, MediaWiki-General
Ottomata added a comment to T346463: Identify and label prefetch proxy data in our traffic.

IIRC, the decision was to wait until the new year, so as not to risk a mistake while people were out on holidays.

Jan 5 2024, 1:14 AM · Traffic, Movement-Insights, Data-Engineering
Ottomata added a comment to T353817: Create legacy EventLogging proxy HTTP intake (for MediaWikiPingback) endpoint to EventGate.

Volume
peak request rate was ~1900 requests/s.

Jan 5 2024, 1:12 AM · MediaWiki-Platform-Team (Radar), Patch-For-Review, Data-Engineering, Event-Platform, MediaWiki-General
Ottomata added a comment to T307040: Propagate field descriptions from event schemas to Hive event tables.

we should decide sometime soon

Jan 5 2024, 1:01 AM · Patch-For-Review, Product-Analytics, Data-Engineering

Jan 3 2024

Ottomata added a comment to T353680: Android Metrics Platform Migration Data Validation - first pass - first 4 tables.

Which reads to me as EventGate needing a logic update for how it formats a multi-status response

Jan 3 2024, 1:46 PM · Data Products (Data Products Sprint 05), Product-Analytics (Kanban), Patch-For-Review, Wikipedia-Android-App-Backlog (Android Release - FY2023-24)

Jan 2 2024

Ottomata added a comment to T307040: Propagate field descriptions from event schemas to Hive event tables.

Wow it...kinda...works~

Jan 2 2024, 10:01 PM · Patch-For-Review, Product-Analytics, Data-Engineering