Page MenuHomePhabricator

Adapt the rdf-streaming-updater flink job to use wikimedia-eventutilities-flink
Open, Needs TriagePublic5 Estimated Story Points

Description

This project was created before any of the utilities function were available, we should adapt the WDQS Streaming Updater flink to use these so that it better integrates with the event-platform.

AC:

  • re-use the flink utilities function to consume/produce to kafka
  • events coming out of this pipeline are validated against a well-defined schema

Event Timeline

Gehel set the point value for this task to 5.Sep 23 2024, 3:41 PM

Change #1078991 had a related patch set uploaded (by DCausse; author: DCausse):

[wikidata/query/rdf@master] Add support for EventDataStreamFactory for input streams

https://gerrit.wikimedia.org/r/1078991

Change #1079538 had a related patch set uploaded (by DCausse; author: DCausse):

[wikidata/query/rdf@master] Add support for setting a start timestamp

https://gerrit.wikimedia.org/r/1079538

Change #1081221 had a related patch set uploaded (by DCausse; author: DCausse):

[wikidata/query/rdf@master] Add support for event platform in output streams

https://gerrit.wikimedia.org/r/1081221

Change #1078991 merged by jenkins-bot:

[wikidata/query/rdf@master] Add support for EventDataStreamFactory for input streams

https://gerrit.wikimedia.org/r/1078991

Change #1079538 merged by jenkins-bot:

[wikidata/query/rdf@master] Add support for setting a start timestamp

https://gerrit.wikimedia.org/r/1079538

Change #1081221 merged by jenkins-bot:

[wikidata/query/rdf@master] Add support for EventDataStreamFactory for output streams

https://gerrit.wikimedia.org/r/1081221

Change #1091290 had a related patch set uploaded (by DCausse; author: DCausse):

[wikidata/query/deploy@master] deploy version 0.3.150

https://gerrit.wikimedia.org/r/1091290

Change #1091306 had a related patch set uploaded (by DCausse; author: DCausse):

[operations/deployment-charts@master] rdf-streaming-updater: bump to 0.3.150

https://gerrit.wikimedia.org/r/1091306

Change #1092191 had a related patch set uploaded (by DCausse; author: DCausse):

[operations/deployment-charts@master] rdf-streaming-updater: produce rdf_change v2 events

https://gerrit.wikimedia.org/r/1092191

Change #1091290 merged by Bking:

[wikidata/query/deploy@master] deploy version 0.3.150

https://gerrit.wikimedia.org/r/1091290

Change #1092191 merged by jenkins-bot:

[operations/deployment-charts@master] rdf-streaming-updater: produce rdf_change v2 events

https://gerrit.wikimedia.org/r/1092191

Change #1099656 had a related patch set uploaded (by DCausse; author: DCausse):

[wikidata/query/rdf@master] Add canary event filtering for the streaming-updater-consumer

https://gerrit.wikimedia.org/r/1099656

Change #1099727 had a related patch set uploaded (by DCausse; author: DCausse):

[operations/mediawiki-config@master] rdf-streaming-updater: add wdqs udpater streams in event stream config

https://gerrit.wikimedia.org/r/1099727

Change #1102319 had a related patch set uploaded (by DCausse; author: DCausse):

[wikidata/query/rdf@master] Add support for versioned streams

https://gerrit.wikimedia.org/r/1102319

Change #1099727 merged by jenkins-bot:

[operations/mediawiki-config@master] rdf-streaming-updater: add wdqs udpater streams in event stream config

https://gerrit.wikimedia.org/r/1099727

Mentioned in SAL (#wikimedia-operations) [2024-12-17T14:05:33Z] <dcausse@deploy2002> Started scap sync-world: Backport for [[gerrit:1099727|rdf-streaming-updater: add wdqs udpater streams in event stream config (T374919)]], [[gerrit:1104598|cirrussearch: increase shard count for cebwiki_content (T379002)]]

Mentioned in SAL (#wikimedia-operations) [2024-12-17T14:14:55Z] <dcausse@deploy2002> dcausse: Backport for [[gerrit:1099727|rdf-streaming-updater: add wdqs udpater streams in event stream config (T374919)]], [[gerrit:1104598|cirrussearch: increase shard count for cebwiki_content (T379002)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug)

Mentioned in SAL (#wikimedia-operations) [2024-12-17T14:25:41Z] <dcausse@deploy2002> Finished scap sync-world: Backport for [[gerrit:1099727|rdf-streaming-updater: add wdqs udpater streams in event stream config (T374919)]], [[gerrit:1104598|cirrussearch: increase shard count for cebwiki_content (T379002)]] (duration: 20m 07s)