Right now, Kafka poller when Updater starts uses timestamp to re-sync with the data stream. While this is the only way to go for dumps, if we're restarting from Kafka polling, we could preserve the current Kafka position and start with it when possible.
Description
Description
Details
Details
Subject | Repo | Branch | Lines +/- | |
---|---|---|---|---|
Store Kafka offsets in the DB | wikidata/query/rdf | master | +516 -85 |
Status | Subtype | Assigned | Task | ||
---|---|---|---|---|---|
Resolved | Gehel | T189458 re-enable wdqs kafka poller | |||
Resolved | Smalyshev | T192963 Store Kafka poller position data in the WDQS database |
Event Timeline
Comment Actions
Change 421348 had a related patch set uploaded (by Smalyshev; owner: Smalyshev):
[wikidata/query/rdf@master] Store Kafka offsets in the DB
Comment Actions
Change 421348 merged by jenkins-bot:
[wikidata/query/rdf@master] Store Kafka offsets in the DB