Page MenuHomePhabricator

Adapt the rdf-streaming-updater flink job to use wikimedia-eventutilities-flink
Closed, ResolvedPublic5 Estimated Story Points

Description

This project was created before any of the utilities function were available, we should adapt the WDQS Streaming Updater flink to use these so that it better integrates with the event-platform.

AC:

  • re-use the flink utilities function to consume/produce to kafka
  • events coming out of this pipeline are validated against a well-defined schema

Details

Related Changes in Gerrit:
SubjectRepoBranchLines +/-
wikidata/query/deploymaster+16 -18
operations/deployment-chartsmaster+0 -4
operations/deployment-chartsmaster+1 -3
operations/deployment-chartsmaster+1 -0
operations/deployment-chartsmaster+11 -4
wikidata/query/rdfmaster+30 -8
operations/deployment-chartsmaster+1 -1
wikidata/query/rdfmaster+30 -21
operations/deployment-chartsmaster+4 -4
operations/deployment-chartsmaster+7 -5
operations/deployment-chartsmaster+1 -0
operations/deployment-chartsmaster+3 -1
operations/deployment-chartsmaster+3 -3
wikidata/query/rdfmaster+44 -2
wikidata/query/rdfmaster+87 -36
wikidata/query/rdfmaster+8 -8
operations/mediawiki-configmaster+140 -0
operations/deployment-chartsmaster+1 -0
wikidata/query/deploymaster+15 -15
wikidata/query/rdfmaster+1 K -63
wikidata/query/rdfmaster+21 -2
wikidata/query/rdfmaster+1 K -149
Show related patches Customize query in gerrit
Related Changes in GitLab:
TitleReferenceAuthorSource BranchDest Branch
Bump to 0.3.154repos/search-platform/flink-rdf-streaming-updater!26dcaussebump-to-0_3_154main
Bump to 0.3.154-82c97965 (snapshot)repos/search-platform/flink-rdf-streaming-updater!25dcaussebump-to-0_3_154-82c97965main
Bump to 0.3.153repos/search-platform/flink-rdf-streaming-updater!24dcaussebump-to-0_3_153main
Bump to 0.3.152repos/search-platform/flink-rdf-streaming-updater!23dcaussebump_to_0_3_150main
Bump to 0.3.150repos/search-platform/flink-rdf-streaming-updater!22dcaussebump-0.3.150main
Customize query in GitLab

Event Timeline

Gehel set the point value for this task to 5.Sep 23 2024, 3:41 PM

Change #1078991 had a related patch set uploaded (by DCausse; author: DCausse):

[wikidata/query/rdf@master] Add support for EventDataStreamFactory for input streams

https://gerrit.wikimedia.org/r/1078991

Change #1079538 had a related patch set uploaded (by DCausse; author: DCausse):

[wikidata/query/rdf@master] Add support for setting a start timestamp

https://gerrit.wikimedia.org/r/1079538

Change #1081221 had a related patch set uploaded (by DCausse; author: DCausse):

[wikidata/query/rdf@master] Add support for event platform in output streams

https://gerrit.wikimedia.org/r/1081221

Change #1078991 merged by jenkins-bot:

[wikidata/query/rdf@master] Add support for EventDataStreamFactory for input streams

https://gerrit.wikimedia.org/r/1078991

Change #1079538 merged by jenkins-bot:

[wikidata/query/rdf@master] Add support for setting a start timestamp

https://gerrit.wikimedia.org/r/1079538

Change #1081221 merged by jenkins-bot:

[wikidata/query/rdf@master] Add support for EventDataStreamFactory for output streams

https://gerrit.wikimedia.org/r/1081221

Change #1091290 had a related patch set uploaded (by DCausse; author: DCausse):

[wikidata/query/deploy@master] deploy version 0.3.150

https://gerrit.wikimedia.org/r/1091290

Change #1091306 had a related patch set uploaded (by DCausse; author: DCausse):

[operations/deployment-charts@master] rdf-streaming-updater: bump to 0.3.150

https://gerrit.wikimedia.org/r/1091306

Change #1092191 had a related patch set uploaded (by DCausse; author: DCausse):

[operations/deployment-charts@master] rdf-streaming-updater: produce rdf_change v2 events

https://gerrit.wikimedia.org/r/1092191

Change #1091290 merged by Bking:

[wikidata/query/deploy@master] deploy version 0.3.150

https://gerrit.wikimedia.org/r/1091290

Change #1092191 merged by jenkins-bot:

[operations/deployment-charts@master] rdf-streaming-updater: produce rdf_change v2 events

https://gerrit.wikimedia.org/r/1092191

Change #1099656 had a related patch set uploaded (by DCausse; author: DCausse):

[wikidata/query/rdf@master] Add canary event filtering for the streaming-updater-consumer

https://gerrit.wikimedia.org/r/1099656

Change #1099727 had a related patch set uploaded (by DCausse; author: DCausse):

[operations/mediawiki-config@master] rdf-streaming-updater: add wdqs udpater streams in event stream config

https://gerrit.wikimedia.org/r/1099727

Change #1102319 had a related patch set uploaded (by DCausse; author: DCausse):

[wikidata/query/rdf@master] Add support for versioned streams

https://gerrit.wikimedia.org/r/1102319

Change #1099727 merged by jenkins-bot:

[operations/mediawiki-config@master] rdf-streaming-updater: add wdqs udpater streams in event stream config

https://gerrit.wikimedia.org/r/1099727

Mentioned in SAL (#wikimedia-operations) [2024-12-17T14:05:33Z] <dcausse@deploy2002> Started scap sync-world: Backport for [[gerrit:1099727|rdf-streaming-updater: add wdqs udpater streams in event stream config (T374919)]], [[gerrit:1104598|cirrussearch: increase shard count for cebwiki_content (T379002)]]

Mentioned in SAL (#wikimedia-operations) [2024-12-17T14:14:55Z] <dcausse@deploy2002> dcausse: Backport for [[gerrit:1099727|rdf-streaming-updater: add wdqs udpater streams in event stream config (T374919)]], [[gerrit:1104598|cirrussearch: increase shard count for cebwiki_content (T379002)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug)

Mentioned in SAL (#wikimedia-operations) [2024-12-17T14:25:41Z] <dcausse@deploy2002> Finished scap sync-world: Backport for [[gerrit:1099727|rdf-streaming-updater: add wdqs udpater streams in event stream config (T374919)]], [[gerrit:1104598|cirrussearch: increase shard count for cebwiki_content (T379002)]] (duration: 20m 07s)

Change #1109450 had a related patch set uploaded (by DCausse; author: DCausse):

[wikidata/query/rdf@master] Rename outputMutationSchema to outputMutationSchemaVersion

https://gerrit.wikimedia.org/r/1109450

Change #1109450 merged by jenkins-bot:

[wikidata/query/rdf@master] Rename outputMutationSchema to outputMutationSchemaVersion

https://gerrit.wikimedia.org/r/1109450

Change #1102319 merged by jenkins-bot:

[wikidata/query/rdf@master] Add support for versioned streams

https://gerrit.wikimedia.org/r/1102319

Change #1099656 merged by jenkins-bot:

[wikidata/query/rdf@master] Add canary event filtering for the streaming-updater-consumer

https://gerrit.wikimedia.org/r/1099656

Change #1112251 had a related patch set uploaded (by DCausse; author: DCausse):

[wikidata/query/deploy@master] deploy version 0.3.152

https://gerrit.wikimedia.org/r/1112251

Change #1112762 had a related patch set uploaded (by DCausse; author: DCausse):

[operations/deployment-charts@master] wdqs: bump image version to 0.3.152

https://gerrit.wikimedia.org/r/1112762

Change #1112763 had a related patch set uploaded (by DCausse; author: DCausse):

[operations/deployment-charts@master] wdqs: enable new event stream api config in staging

https://gerrit.wikimedia.org/r/1112763

Change #1112762 merged by jenkins-bot:

[operations/deployment-charts@master] wdqs: bump image version to 0.3.152

https://gerrit.wikimedia.org/r/1112762

Change #1112763 merged by jenkins-bot:

[operations/deployment-charts@master] wdqs: enable new event stream api config in staging

https://gerrit.wikimedia.org/r/1112763

Change #1113080 had a related patch set uploaded (by DCausse; author: DCausse):

[operations/deployment-charts@master] wdqs: add missing page_change_content_models config entry

https://gerrit.wikimedia.org/r/1113080

Change #1113080 merged by jenkins-bot:

[operations/deployment-charts@master] wdqs: add missing page_change_content_models config entry

https://gerrit.wikimedia.org/r/1113080

Change #1113082 had a related patch set uploaded (by DCausse; author: DCausse):

[operations/deployment-charts@master] wdqs: add missing config entry main_output_stream

https://gerrit.wikimedia.org/r/1113082

Change #1113082 merged by jenkins-bot:

[operations/deployment-charts@master] wdqs: add missing config entry main_output_stream

https://gerrit.wikimedia.org/r/1113082

Change #1113150 had a related patch set uploaded (by DCausse; author: DCausse):

[operations/deployment-charts@master] wdqs: fix staging stream names

https://gerrit.wikimedia.org/r/1113150

Change #1113150 merged by jenkins-bot:

[operations/deployment-charts@master] wdqs: fix staging stream names

https://gerrit.wikimedia.org/r/1113150

Change #1113165 had a related patch set uploaded (by DCausse; author: DCausse):

[wikidata/query/rdf@master] streaming-updater-producer: fix versionned streams support

https://gerrit.wikimedia.org/r/1113165

Change #1113165 merged by jenkins-bot:

[wikidata/query/rdf@master] streaming-updater-producer: fix versionned streams support

https://gerrit.wikimedia.org/r/1113165

Change #1113184 had a related patch set uploaded (by DCausse; author: DCausse):

[operations/deployment-charts@master] wdqs: bump image to 0.3.153

https://gerrit.wikimedia.org/r/1113184

Change #1113184 merged by jenkins-bot:

[operations/deployment-charts@master] wdqs: bump image to 0.3.153

https://gerrit.wikimedia.org/r/1113184

Change #1113204 had a related patch set uploaded (by DCausse; author: DCausse):

[wikidata/query/rdf@master] streaming-updater-producer: use long for sequence fields

https://gerrit.wikimedia.org/r/1113204

Change #1113204 merged by jenkins-bot:

[wikidata/query/rdf@master] streaming-updater-producer: use long for sequence fields

https://gerrit.wikimedia.org/r/1113204

Change #1113466 had a related patch set uploaded (by DCausse; author: DCausse):

[operations/deployment-charts@master] wdqs: bump to 0.3.154 and enable event utilities APIs

https://gerrit.wikimedia.org/r/1113466

Change #1113743 had a related patch set uploaded (by DCausse; author: DCausse):

[operations/deployment-charts@master] wdqs: bump to 0.3.154 and enable event utilities APIs (2/3)

https://gerrit.wikimedia.org/r/1113743

Change #1113744 had a related patch set uploaded (by DCausse; author: DCausse):

[operations/deployment-charts@master] wdqs: bump to 0.3.154 and enable event utilities APIs (Step 3/3)

https://gerrit.wikimedia.org/r/1113744

Change #1113745 had a related patch set uploaded (by DCausse; author: DCausse):

[operations/deployment-charts@master] wdqs: cleanup unused settings

https://gerrit.wikimedia.org/r/1113745

Change #1113466 merged by jenkins-bot:

[operations/deployment-charts@master] wdqs: bump to 0.3.154 and enable event utilities APIs (1/3)

https://gerrit.wikimedia.org/r/1113466

Change #1113743 merged by jenkins-bot:

[operations/deployment-charts@master] wdqs: bump to 0.3.154 and enable event utilities APIs (2/3)

https://gerrit.wikimedia.org/r/1113743

Change #1113744 merged by jenkins-bot:

[operations/deployment-charts@master] wdqs: bump to 0.3.154 and enable event utilities APIs (3/3)

https://gerrit.wikimedia.org/r/1113744

Change #1113745 merged by jenkins-bot:

[operations/deployment-charts@master] wdqs: cleanup unused settings

https://gerrit.wikimedia.org/r/1113745

Change #1112251 merged by Ryan Kemper:

[wikidata/query/deploy@master] deploy version 0.3.155

https://gerrit.wikimedia.org/r/1112251