Page MenuHomePhabricator

Create a dse-k8s service demonstrating how to run the rdf-streaming-updater using the flink-app chart
Closed, ResolvedPublic5 Estimated Story Points

Description

The flink-k8s-operator is now available on the dse-k8s cluster. We should create a new helmfile using the flink-app chart to run an experimentation and learn how to operate a flink job using this technique.

AC:

  • a test job is running in the dse-k8s cluster

Details

Related Changes in Gerrit:
SubjectRepoBranchLines +/-
operations/deployment-chartsmaster+2 -1
operations/deployment-chartsmaster+2 -2
wikidata/query/rdfflink_1_16+5 -2
operations/deployment-chartsmaster+2 -3
operations/deployment-chartsmaster+3 -7
operations/deployment-chartsmaster+2 -0
operations/deployment-chartsmaster+4 -10
operations/deployment-chartsmaster+1 -1
operations/deployment-chartsmaster+3 -2
operations/deployment-chartsmaster+5 -4
operations/deployment-chartsmaster+2 -0
operations/deployment-chartsmaster+1 -1
operations/deployment-chartsmaster+1 -1
operations/deployment-chartsmaster+1 -1
operations/deployment-chartsmaster+3 -3
operations/deployment-chartsmaster+1 -1
operations/deployment-chartsmaster+1 -1
operations/deployment-chartsmaster+1 -0
operations/deployment-chartsmaster+2 -0
operations/deployment-chartsmaster+3 -3
operations/deployment-chartsmaster+1 -1
operations/deployment-chartsmaster+194 -0
Show related patches Customize query in gerrit

Event Timeline

There are a very large number of changes, so older changes are hidden. Show Older Changes

Change 886005 had a related patch set uploaded (by DCausse; author: DCausse):

[operations/deployment-charts@master] [WIP] rdf-streaming-updater: add a test job using the k8s operator...

https://gerrit.wikimedia.org/r/886005

Gehel set the point value for this task to 5.
dcausse moved this task from incoming to in progress on the Wikidata board.

Change 886005 merged by jenkins-bot:

[operations/deployment-charts@master] rdf-streaming-updater: add a test job using the k8s operator...

https://gerrit.wikimedia.org/r/886005

Change 901218 had a related patch set uploaded (by Bking; author: Bking):

[operations/deployment-charts@master] rdf-streaming-updater: use correct resource name

https://gerrit.wikimedia.org/r/901218

Change 901218 merged by jenkins-bot:

[operations/deployment-charts@master] rdf-streaming-updater: use correct resource name

https://gerrit.wikimedia.org/r/901218

Change 901226 had a related patch set uploaded (by Bking; author: Bking):

[operations/deployment-charts@master] rdf-streaming-update: use correct data type

https://gerrit.wikimedia.org/r/901226

Change 901226 merged by Bking:

[operations/deployment-charts@master] rdf-streaming-update: use correct data type

https://gerrit.wikimedia.org/r/901226

Change 901240 had a related patch set uploaded (by Bking; author: Bking):

[operations/deployment-charts@master] rdf-streaming-updater: use correct release and app

https://gerrit.wikimedia.org/r/901240

Change 901253 had a related patch set uploaded (by DCausse; author: DCausse):

[operations/deployment-charts@master] dse-k8s-eqiad: flink-operator should watch rdf-streaming-updater

https://gerrit.wikimedia.org/r/901253

Change 901253 merged by jenkins-bot:

[operations/deployment-charts@master] dse-k8s-eqiad: flink-operator should watch rdf-streaming-updater

https://gerrit.wikimedia.org/r/901253

Change 901608 had a related patch set uploaded (by DCausse; author: DCausse):

[operations/deployment-charts@master] rdf-streaming-updater: add jarURI

https://gerrit.wikimedia.org/r/901608

Change 901608 merged by jenkins-bot:

[operations/deployment-charts@master] rdf-streaming-updater: add jarURI

https://gerrit.wikimedia.org/r/901608

Change 901617 had a related patch set uploaded (by DCausse; author: DCausse):

[operations/deployment-charts@master] rdf-streaming-updater: jarURI should use local:// not file://

https://gerrit.wikimedia.org/r/901617

Change 901617 merged by jenkins-bot:

[operations/deployment-charts@master] rdf-streaming-updater: jarURI should use local:// not file://

https://gerrit.wikimedia.org/r/901617

Change 901619 had a related patch set uploaded (by DCausse; author: DCausse):

[operations/deployment-charts@master] rdf-streaming-updater: fix docker image URL

https://gerrit.wikimedia.org/r/901619

Change 901619 merged by jenkins-bot:

[operations/deployment-charts@master] rdf-streaming-updater: fix docker image URL

https://gerrit.wikimedia.org/r/901619

Change 901240 merged by jenkins-bot:

[operations/deployment-charts@master] rdf-streaming-updater: use correct release and app

https://gerrit.wikimedia.org/r/901240

Change 903266 had a related patch set uploaded (by Bking; author: Bking):

[operations/deployment-charts@master] rdf-streaming-updater: use correct config path for dse-k8s-eqiad

https://gerrit.wikimedia.org/r/903266

Change 903266 merged by jenkins-bot:

[operations/deployment-charts@master] rdf-streaming-updater: use correct config path for dse-k8s-eqiad

https://gerrit.wikimedia.org/r/903266

Change 903659 had a related patch set uploaded (by Bking; author: Bking):

[operations/deployment-charts@master] rdf-streaming-updater: raise taskManager mem in dse-k8s

https://gerrit.wikimedia.org/r/903659

Change 903659 merged by jenkins-bot:

[operations/deployment-charts@master] rdf-streaming-updater: raise taskManager mem in dse-k8s

https://gerrit.wikimedia.org/r/903659

Change 903689 had a related patch set uploaded (by DCausse; author: DCausse):

[operations/deployment-charts@master] rdf-streaming-updater: still use PLAINTEXT for kafka

https://gerrit.wikimedia.org/r/903689

Change 903689 merged by jenkins-bot:

[operations/deployment-charts@master] rdf-streaming-updater: still use PLAINTEXT for kafka

https://gerrit.wikimedia.org/r/903689

Change 903696 had a related patch set uploaded (by DCausse; author: DCausse):

[operations/deployment-charts@master] rdf-streaming-updater: enable mesh sidecar container

https://gerrit.wikimedia.org/r/903696

Change 903696 merged by jenkins-bot:

[operations/deployment-charts@master] rdf-streaming-updater: enable mesh sidecar container

https://gerrit.wikimedia.org/r/903696

Change 903729 had a related patch set uploaded (by DCausse; author: DCausse):

[operations/deployment-charts@master] flink-app: include mesh.deployment.volume

https://gerrit.wikimedia.org/r/903729

Change 903729 merged by jenkins-bot:

[operations/deployment-charts@master] flink-app: include mesh.deployment.volume

https://gerrit.wikimedia.org/r/903729

Change 903740 had a related patch set uploaded (by DCausse; author: DCausse):

[operations/deployment-charts@master] flink-app: add envoy configmaps

https://gerrit.wikimedia.org/r/903740

Change 904188 had a related patch set uploaded (by DCausse; author: DCausse):

[operations/deployment-charts@master] rdf-streaming-updater: use newer bootstrap state

https://gerrit.wikimedia.org/r/904188

Change 903740 merged by jenkins-bot:

[operations/deployment-charts@master] flink-app: add envoy configmaps

https://gerrit.wikimedia.org/r/903740

Change 904188 merged by jenkins-bot:

[operations/deployment-charts@master] rdf-streaming-updater: use newer bootstrap state

https://gerrit.wikimedia.org/r/904188

Change 904464 had a related patch set uploaded (by DCausse; author: DCausse):

[operations/deployment-charts@master] [DNM] flink-app: always include /etc/envoy/ssl/ca.crt

https://gerrit.wikimedia.org/r/904464

Change 904464 abandoned by DCausse:

[operations/deployment-charts@master] [DNM] flink-app: always include /etc/envoy/ssl/ca.crt

Reason:

Installing the wmf-certificates package is better solution https://phabricator.wikimedia.org/T333551

https://gerrit.wikimedia.org/r/904464

Change 904553 had a related patch set uploaded (by DCausse; author: DCausse):

[operations/deployment-charts@master] rdf-streaming-updater: temp fix, pin envoy image version to 1.18.3-2

https://gerrit.wikimedia.org/r/904553

Change 904553 merged by jenkins-bot:

[operations/deployment-charts@master] rdf-streaming-updater: temp fix, pin envoy image version to 1.18.3-2

https://gerrit.wikimedia.org/r/904553

Change 904762 had a related patch set uploaded (by DCausse; author: DCausse):

[operations/deployment-charts@master] flink-app: update to mesh.configuration 1.2.0

https://gerrit.wikimedia.org/r/904762

Change 904763 had a related patch set uploaded (by DCausse; author: DCausse):

[operations/deployment-charts@master] rdf-streaming-updater: bump job image to flink-1.16-rc2...

https://gerrit.wikimedia.org/r/904763

Change 904762 merged by jenkins-bot:

[operations/deployment-charts@master] flink-app: update to mesh.configuration 1.2.0

https://gerrit.wikimedia.org/r/904762

Change 904763 merged by jenkins-bot:

[operations/deployment-charts@master] rdf-streaming-updater: bump job image to flink-1.16-rc2

https://gerrit.wikimedia.org/r/904763

currently still stuck with

scala.MatchError: None (of class scala.None$)
	at org.wikidata.query.rdf.updater.SideOutputSerializationSchema.getRecordClock(SideOutputSerializationSchema.scala:31)
	at org.wikidata.query.rdf.updater.SideOutputSerializationSchema.clock$lzycompute(SideOutputSerializationSchema.scala:40)
	at org.wikidata.query.rdf.updater.SideOutputSerializationSchema.clock(SideOutputSerializationSchema.scala:40)
	at org.wikidata.query.rdf.updater.SideOutputSerializationSchema.serialize(SideOutputSerializationSchema.scala:73)
	at org.apache.flink.streaming.connectors.kafka.FlinkKafkaProducer.invoke(FlinkKafkaProducer.java:918)
	at org.apache.flink.streaming.connectors.kafka.FlinkKafkaProducer.invoke(FlinkKafkaProducer.java:101)
	at org.apache.flink.streaming.api.functions.sink.TwoPhaseCommitSinkFunction.invoke(TwoPhaseCommitSinkFunction.java:245)
	at org.apache.flink.streaming.api.operators.StreamSink.processElement(StreamSink.java:54)
	at org.apache.flink.streaming.runtime.tasks.ChainingOutput.pushToOperator(ChainingOutput.java:99)
	at org.apache.flink.streaming.runtime.tasks.ChainingOutput.collect(ChainingOutput.java:86)
	at org.apache.flink.streaming.runtime.tasks.CopyingBroadcastingOutputCollector.collect(CopyingBroadcastingOutputCollector.java:60)
	at org.apache.flink.streaming.api.operators.CountingOutput.collect(CountingOutput.java:62)
	at org.apache.flink.streaming.api.operators.KeyedProcessOperator$ContextImpl.output(KeyedProcessOperator.java:129)
	at org.wikidata.query.rdf.updater.ReorderAndDecideMutationOperation.fireEvent(ReorderAndDecideMutationOperation.scala:112)
	at org.wikidata.query.rdf.updater.ReorderAndDecideMutationOperation.processElement(ReorderAndDecideMutationOperation.scala:29)
	at org.wikidata.query.rdf.updater.ReorderAndDecideMutationOperation.processElement(ReorderAndDecideMutationOperation.scala:12)
	at org.apache.flink.streaming.api.operators.KeyedProcessOperator.processElement(KeyedProcessOperator.java:83)
	at org.apache.flink.streaming.runtime.tasks.OneInputStreamTask$StreamTaskNetworkOutput.emitRecord(OneInputStreamTask.java:233)
	at org.apache.flink.streaming.runtime.io.AbstractStreamTaskNetworkInput.processElement(AbstractStreamTaskNetworkInput.java:134)
	at org.apache.flink.streaming.runtime.io.AbstractStreamTaskNetworkInput.emitNext(AbstractStreamTaskNetworkInput.java:105)
	at org.apache.flink.streaming.runtime.io.StreamOneInputProcessor.processInput(StreamOneInputProcessor.java:65)
	at org.apache.flink.streaming.runtime.tasks.StreamTask.processInput(StreamTask.java:542)
	at org.apache.flink.streaming.runtime.tasks.mailbox.MailboxProcessor.runMailboxLoop(MailboxProcessor.java:231)
	at org.apache.flink.streaming.runtime.tasks.StreamTask.runMailboxLoop(StreamTask.java:831)
	at org.apache.flink.streaming.runtime.tasks.StreamTask.invoke(StreamTask.java:780)
	at org.apache.flink.runtime.taskmanager.Task.runWithSystemExitMonitoring(Task.java:935)
	at org.apache.flink.runtime.taskmanager.Task.restoreAndInvoke(Task.java:914)
	at org.apache.flink.runtime.taskmanager.Task.doRun(Task.java:728)
	at org.apache.flink.runtime.taskmanager.Task.run(Task.java:550)
	at java.base/java.lang.Thread.run(Thread.java:829)

suspecting a classloader problem...

Change 904797 had a related patch set uploaded (by DCausse; author: DCausse):

[wikidata/query/rdf@flink_1_16] Add debug code to identify classloading or serialization issues

https://gerrit.wikimedia.org/r/904797

Change 904797 merged by jenkins-bot:

[wikidata/query/rdf@flink_1_16] Add debug code to identify classloading or serialization issues

https://gerrit.wikimedia.org/r/904797

Change 904813 had a related patch set uploaded (by DCausse; author: DCausse):

[operations/deployment-charts@master] rdf-streaming-updater: bump image version to flink-1.16-rc3

https://gerrit.wikimedia.org/r/904813

Change 904813 merged by jenkins-bot:

[operations/deployment-charts@master] rdf-streaming-updater: bump image version to flink-1.16-rc3

https://gerrit.wikimedia.org/r/904813

Change 905602 had a related patch set uploaded (by DCausse; author: DCausse):

[operations/deployment-charts@master] rdf-streaming-updater: tune mem overhead

https://gerrit.wikimedia.org/r/905602

Change 905602 merged by jenkins-bot:

[operations/deployment-charts@master] rdf-streaming-updater: tune mem overhead

https://gerrit.wikimedia.org/r/905602