Page MenuHomePhabricator

Upgrade the WDQS streaming updater to latest flink (1.16)
Closed, ResolvedPublic5 Estimated Story Points

Description

Upstream bug we might have seen while running on our k8s setup:

Upstream bug that prevents us from upgrading to 1.13:

Event Timeline

Gehel set the point value for this task to 5.Aug 30 2021, 3:53 PM
dcausse renamed this task from Upgrade to latest flink (1.13.2) to Upgrade to latest flink (1.14).Sep 7 2021, 5:52 PM
dcausse updated the task description. (Show Details)

Change 719456 had a related patch set uploaded (by DCausse; author: DCausse):

[wikidata/query/rdf@master] Upgrade to flink 1.14.0

https://gerrit.wikimedia.org/r/719456

Gehel triaged this task as Medium priority.Feb 22 2022, 8:25 PM
Gehel moved this task from Current work to Scaling on the Wikidata-Query-Service board.
dcausse renamed this task from Upgrade to latest flink (1.14) to Upgrade the WDQS streaming updater to latest flink (1.15).Aug 29 2022, 12:26 PM
dcausse added a subscriber: Event-Platform.
dcausse renamed this task from Upgrade the WDQS streaming updater to latest flink (1.15) to Upgrade the WDQS streaming updater to latest flink (1.16).Jan 13 2023, 3:37 PM

Change 879822 had a related patch set uploaded (by DCausse; author: DCausse):

[wikidata/query/rdf@master] Upgrade to flink 1.16.0

https://gerrit.wikimedia.org/r/879822

Change 881893 had a related patch set uploaded (by DCausse; author: DCausse):

[wikidata/query/flink-rdf-streaming-updater@master] Upgrade to flink 1.16

https://gerrit.wikimedia.org/r/881893

Change 882748 had a related patch set uploaded (by Bking; author: Bking):

[operations/puppet@production] dse-k8s: add rdf-streaming-update-ng namespace

https://gerrit.wikimedia.org/r/882748

Change 882748 abandoned by Bking:

[operations/puppet@production] dse-k8s: add rdf-streaming-update-ng namespace

Reason:

Not needed, we will use the current 'rdf-streaming-updater' namespace

https://gerrit.wikimedia.org/r/882748

Change 882748 restored by Bking:

[operations/puppet@production] dse-k8s: add rdf-streaming-update-ng namespace

https://gerrit.wikimedia.org/r/882748

The updater with flink 1.16 did run for about 3 days in yarn, notable change is that it required 5g of mem to run without failure during backfills. Current task_manager limits on k8s is at 2500M so this means doubling this value for both data centers and both jobs (~15G in each DC). There might be lower values that work but it might require tuning every components individually so if the mem is available I'd be for just increasing the mem limits.

Change 886116 had a related patch set uploaded (by DCausse; author: DCausse):

[wikidata/query/rdf@flink_1_16] Upgrade to flink 1.14.6

https://gerrit.wikimedia.org/r/886116

Change 886117 had a related patch set uploaded (by DCausse; author: DCausse):

[wikidata/query/rdf@flink_1_16] Upgrade to flink 1.16.0

https://gerrit.wikimedia.org/r/886117

Change 719456 abandoned by DCausse:

[wikidata/query/rdf@master] Upgrade to flink 1.14.6

Reason:

moved to the flink_1_16 branch

https://gerrit.wikimedia.org/r/719456

Change 879822 abandoned by DCausse:

[wikidata/query/rdf@master] Upgrade to flink 1.16.0

Reason:

moved to the flink_1_16 branch

https://gerrit.wikimedia.org/r/879822

Change 886116 merged by jenkins-bot:

[wikidata/query/rdf@flink_1_16] Upgrade to flink 1.14.6

https://gerrit.wikimedia.org/r/886116

Change 886117 merged by jenkins-bot:

[wikidata/query/rdf@flink_1_16] Upgrade to flink 1.16.0

https://gerrit.wikimedia.org/r/886117

Change 719456 restored by DCausse:

[wikidata/query/rdf@master] Upgrade to flink 1.14.6

https://gerrit.wikimedia.org/r/719456

Change 879822 restored by DCausse:

[wikidata/query/rdf@master] Upgrade to flink 1.16.0

https://gerrit.wikimedia.org/r/879822

Change 719456 merged by jenkins-bot:

[wikidata/query/rdf@master] Upgrade to flink 1.14.6

https://gerrit.wikimedia.org/r/719456

Change 879822 merged by jenkins-bot:

[wikidata/query/rdf@master] Upgrade to flink 1.16.0

https://gerrit.wikimedia.org/r/879822

Change 881893 abandoned by DCausse:

[wikidata/query/flink-rdf-streaming-updater@master] Upgrade to flink 1.16

Reason:

project moved

https://gerrit.wikimedia.org/r/881893