Page MenuHomePhabricator

SUP: Partition update_pipeline kafka topic
Closed, ResolvedPublic3 Estimated Story Points

Description

Processing updates cannot be handled by a single flink operator slot. To support and hint parallel processing inside flink, the topic should be partitioned.

AC:

  • SRE/ops: kafka-test is configured to use 5 partitions for topic (eqiad|codfw).cirrussearch.update_pipeline.update.rc0
  • stream configuration for cirrussearch.update_pipeline.update.rc0 should define message_key_fields (see 983719)
  • producer writes explicitly keyed records to the kafka sink
  • consumer reads keyed records and reinterpretes them as keyed stream partitioned topic

Event Timeline

pfischer set the point value for this task to 3.Jan 2 2024, 8:40 AM
pfischer renamed this task from SUP: Partition update topic to SUP: Partition update_pipeline kafka topic.Jan 2 2024, 9:46 AM
pfischer changed the task status from Open to In Progress.
pfischer triaged this task as High priority.
pfischer added projects: serviceops, SRE.
pfischer updated the task description. (Show Details)
pfischer moved this task from needs triage to Current work on the Discovery-Search board.
pfischer updated Other Assignee, added: pfischer.
pfischer added a project: Data-Platform-SRE.

Change 987160 had a related patch set uploaded (by Peter Fischer; author: Peter Fischer):

[operations/deployment-charts@master] Search update pipeline: enable kafka partition discovery

https://gerrit.wikimedia.org/r/987160

Change 987160 merged by jenkins-bot:

[operations/deployment-charts@master] Search update pipeline: enable kafka partition discovery

https://gerrit.wikimedia.org/r/987160

Change 987395 had a related patch set uploaded (by Peter Fischer; author: Peter Fischer):

[operations/deployment-charts@master] Search update pipeline: bump version

https://gerrit.wikimedia.org/r/987395

Change 987395 merged by jenkins-bot:

[operations/deployment-charts@master] Search update pipeline: bump version

https://gerrit.wikimedia.org/r/987395

brouberol@kafka-test1010:~$ kafka topics --topic codfw.cirrussearch.update_pipeline.update.rc0 --alter --partitions 5
kafka-topics --zookeeper zookeeper-test1002.eqiad.wmnet/kafka/test-eqiad --topic codfw.cirrussearch.update_pipeline.update.rc0 --alter --partitions 5
WARNING: If partitions are increased for a topic that has a key, the partition logic or ordering of the messages will be affected
Adding partitions succeeded!
brouberol@kafka-test1010:~$ kafka topics --topic eqiad.cirrussearch.update_pipeline.update.rc0 --alter --partitions 5
kafka-topics --zookeeper zookeeper-test1002.eqiad.wmnet/kafka/test-eqiad --topic eqiad.cirrussearch.update_pipeline.update.rc0 --alter --partitions 5
WARNING: If partitions are increased for a topic that has a key, the partition logic or ordering of the messages will be affected
Adding partitions succeeded!
pfischer updated the task description. (Show Details)
pfischer moved this task from Incoming to In Progress on the Discovery-Search (Current work) board.
pfischer updated Other Assignee, removed: pfischer.