Maniphest T346456

Improve concurrency limits configuration of the wdqs updater
Closed, ResolvedPublic3 Estimated Story Points
Actions

Assigned To

Authored By

	• dcausse
	Sep 15 2023, 5:01 PM

Tags

Referenced Files

None

Subscribers

Clement_Goubert

Description

The WDQS updater have several config options to reduce the concurrency at which it calls the MW api.

The config option wikibase_repo_thread_pool_size controls the size of the thread pool running HTTP requests.

During a test of this application with zookeeper and the flink-k8s-operator we had to backfill around 2weeks of updates and this caused a massive load on the mw-api-int cluster.

We lowered this value from 30 to 5 expecting to see a 1/6 fold reduction but this reduction was nowhere near what was expected, we barely saw the impact suggesting that the current limits are already too high and the system is limited by the endpoint capacity not by itself.

Looking a the code this limits is imposed on the HTTP thread pool that is attached to a job task, given that we run at a parallelism of 12 this means that the actual number of concurrent requests is parallelism * wikibase_repo_thread_pool_size.

So we went from 30*12=360 to 5*12=60.

We should definitely change how this is configured to take the flink parallelism into account.

AC:

the updater should have a single option to control the MW requests concurrency
we should probably not run the AsyncOp over all the 12 tasks

Details

	Subject	Repo	Branch	Lines +/-
	flink: simplify parallelism setup	wikidata/query/deploy	master	+3 -3
	rdf-streaming-updater: simplify parallelism configuration	operations/deployment-charts	master	+2 -2
	search: simplify flink parallelism configuration	operations/alerts	master	+3 -3
	rdf-streaming-updater: simplify parallelism and use newer kafka APIs	operations/deployment-charts	master	+6 -5
	Add mediawiki_max_concurrent_requests	wikidata/query/rdf	master	+69 -22
	[WIP] Rework how concurrency limits are configured	wikidata/query/rdf	master	+119 -14

Customize query in gerrit

Related Objects

Mentioned In: T349848: Determine and control cirrus streaming updater's usage of MWAPI resources

Event Timeline

• dcausse created this task.Sep 15 2023, 5:01 PM

Restricted Application added a subscriber: Aklapper. · View Herald TranscriptSep 15 2023, 5:01 PM

• dcausse updated the task description. (Show Details)Sep 15 2023, 5:02 PM

bking added a parent task: T342149: Test common operations in the flink operator/k8s/Flink ZK environment.Sep 15 2023, 5:13 PM

Clement_Goubert moved this task from Incoming 🐫 to 🛎 Services & Oids on the serviceops board.Sep 18 2023, 11:29 AM

Maintenance_bot added a project: Wikidata.Sep 18 2023, 11:45 AM

Restricted Application added a project: [DEPRECATED] wdwb-tech. · View Herald TranscriptSep 18 2023, 11:45 AM

EBernhardson moved this task from Incoming to Current work on the Wikidata-Query-Service board.Sep 18 2023, 3:35 PM

EBernhardson added a project: Discovery-Search.Sep 18 2023, 3:46 PM

TJones moved this task from needs triage to Current work on the Discovery-Search board.Sep 18 2023, 3:47 PM

TJones edited projects, added Discovery-Search (Current work); removed Discovery-Search.

EBernhardson set the point value for this task to 3.Sep 18 2023, 3:48 PM

• dcausse moved this task from Incoming to Ready for Dev -- SWE on the Discovery-Search (Current work) board.Sep 18 2023, 3:49 PM

• dcausse claimed this task.Sep 20 2023, 1:38 PM

• dcausse moved this task from Ready for Dev -- SWE to In Progress on the Discovery-Search (Current work) board.

Gehel removed a parent task: T342149: Test common operations in the flink operator/k8s/Flink ZK environment.Sep 20 2023, 1:47 PM

Change 960097 had a related patch set uploaded (by DCausse; author: DCausse):

[wikidata/query/rdf@master] Rework how concurrency limits are configured

https://gerrit.wikimedia.org/r/960097

gerritbot added a project: Patch-For-Review.Sep 22 2023, 4:05 PM

Change 960097 abandoned by DCausse:

[wikidata/query/rdf@master] [WIP] Rework how concurrency limits are configured

Reason:

will restore if actually needed

https://gerrit.wikimedia.org/r/960097

Maintenance_bot removed a project: Patch-For-Review.Sep 25 2023, 9:10 AM

Change 960554 had a related patch set uploaded (by DCausse; author: DCausse):

[wikidata/query/rdf@master] Add mediawiki_max_concurrent_requests

https://gerrit.wikimedia.org/r/960554

gerritbot added a project: Patch-For-Review.Sep 25 2023, 9:15 AM

Change 961020 had a related patch set uploaded (by DCausse; author: DCausse):

[operations/alerts@master] search: simplify flink parallelism configuration

https://gerrit.wikimedia.org/r/961020

Change 961024 had a related patch set uploaded (by DCausse; author: DCausse):

[operations/deployment-charts@master] rdf-streaming-updater: simplify parallelism configuration

https://gerrit.wikimedia.org/r/961024

Change 960554 merged by jenkins-bot:

[wikidata/query/rdf@master] Add mediawiki_max_concurrent_requests

https://gerrit.wikimedia.org/r/960554

Change 961138 had a related patch set uploaded (by DCausse; author: DCausse):

[wikidata/query/deploy@master] flink: simplify parallelism setup

https://gerrit.wikimedia.org/r/961138

• dcausse moved this task from In Progress to To Be Deployed on the Discovery-Search (Current work) board.Sep 28 2023, 6:27 PM

Change 963105 had a related patch set uploaded (by DCausse; author: DCausse):

[operations/deployment-charts@master] rdf-streaming-updater: simplify parallelism and use newer kafka APIs

https://gerrit.wikimedia.org/r/963105

Change 963105 merged by Bking:

[operations/deployment-charts@master] rdf-streaming-updater: simplify parallelism and use newer kafka APIs

https://gerrit.wikimedia.org/r/963105

bking added a parent task: T326409: Migrate the wdqs streaming updater flink jobs to flink-k8s-operator deployment model.Oct 27 2023, 3:46 PM

bking removed a parent task: T326409: Migrate the wdqs streaming updater flink jobs to flink-k8s-operator deployment model.Oct 27 2023, 4:01 PM

bking mentioned this in T349848: Determine and control cirrus streaming updater's usage of MWAPI resources.

Change 961020 merged by Bking:

[operations/alerts@master] search: simplify flink parallelism configuration

https://gerrit.wikimedia.org/r/961020

Change 961024 abandoned by DCausse:

[operations/deployment-charts@master] rdf-streaming-updater: simplify parallelism configuration

Reason:

superseded by Id9a9ec3873f1eebfdc53a97218cef417dcc2e8bd

https://gerrit.wikimedia.org/r/961024

Change 961138 abandoned by DCausse:

[wikidata/query/deploy@master] flink: simplify parallelism setup

Reason:

hopefully won't be needed as we move away from this deployment process

https://gerrit.wikimedia.org/r/961138

Maintenance_bot removed a project: Patch-For-Review.Nov 10 2023, 3:10 PM

• dcausse moved this task from To Be Deployed to Needs Reporting on the Discovery-Search (Current work) board.Dec 4 2023, 4:11 PM

Gehel closed this task as Resolved.Dec 8 2023, 9:08 AM