Implement parallel connection limit for querying ORES
Closed, ResolvedPublic
Actions

Assigned To

Authored By

	Halfak
	Oct 24 2016, 6:43 PM

Description

4 simultaneous connections per client (IP + User-Agent) should be good.

Relatedly, this is done in nginx for dump downloads. See https://phabricator.wikimedia.org/diffusion/OPUP/browse/production/modules/dumps/templates/nginx.dumps.conf.erb

Related Objects
Search...

Status	Assigned	Task
Resolved	Ladsgroup	T148997 Implement parallel connection limit for querying ORES
Resolved	Ladsgroup	T160692 Use poolcounter to limit number of connections to ores uwsgi
Resolved	Ladsgroup	T201823 Implement PoolCounter support in ORES
Resolved	akosiaris	T203465 Site: 4 VM request for ORES poolcounter
Resolved	akosiaris	T201824 Spin up a new poolcounter node for ores
Resolved	Ladsgroup	T201825 Test poolcounter support for ores in beta cluster
Resolved	Ladsgroup	T201826 Implement support for whitelisting and proxy requests for poolcounter in ORES
Declined	Ladsgroup	T204897 Add Wiki Education Dashboard and Programs & Events Dashboard to ORES connection whitelist
Resolved	Tgr	T161029 Forward request data in proxied Action API modules

Event Timeline

Halfak created this task.Oct 24 2016, 6:43 PM

Restricted Application added a subscriber: Aklapper. · View Herald TranscriptOct 24 2016, 6:43 PM

Halfak edited projects, added Machine-Learning-Team; removed Machine-Learning-Team (Active Tasks).Oct 27 2016, 2:36 PM

Halfak added subscribers: awight, ArielGlenn.

Halfak triaged this task as Medium priority.Oct 27 2016, 2:42 PM

Halfak moved this task from Unsorted to Maintenance/cleanup on the Machine-Learning-Team board.

Presumably internal IPs should be exempt from this, and the API should set XFF headers when proxying requests?

Yeah. My thoughts too.

Tgr mentioned this in T137962: [Spec] Tracking and blocking specific IP/user-agent combinations .Feb 13 2017, 8:24 PM

Per T137962#2447946, "Generic ratelimiting (e.g. per client IP) and other similar protection measures for these clusters has been pushed off for post-varnish4" so that's not an option right now.

If we rely on some naive frontend implementation (e.g. ngx_http_limit_conn_module) on the web worker or MW API nodes then we end up with an N connections per node limit instead of a global one. Are IPs assigned to nodes via some deterministic hashing, or just round-robin? In the first case, what would happen when nodes are added / removed? (ie. are we using consistent hashing?)

Maybe it could be done in the ORES load balancer (if it's based on proxying and not DNS lookups). That wouldn't be a proper parallel connection limit but a limit on number of connections initiated per time unit, but under normal operation ORES does not have long-lived connections so that should be close enough.

Alternatively the throttling could be implemented in the app code, using some shared resource such as Redis. That's a probably a performance hit, even if a very small one, so it's doable but less ideal than using some existing varnish/nginx/whatever functionality.

Halfak added a subtask: T160692: Use poolcounter to limit number of connections to ores uwsgi.Mar 20 2017, 4:04 PM

Tgr added a subtask: T161029: Forward request data in proxied Action API modules.Mar 21 2017, 8:08 PM

Tgr mentioned this in T163687: Re-enable ORES data in action API.Apr 24 2017, 1:07 PM

Tgr closed subtask T161029: Forward request data in proxied Action API modules as Resolved.May 25 2017, 9:33 PM

Ladsgroup closed subtask T160692: Use poolcounter to limit number of connections to ores uwsgi as Resolved.Nov 1 2018, 8:00 PM

Bah, this is doneeeeee

Restricted Application added a project: User-Ladsgroup. · View Herald TranscriptNov 7 2018, 6:04 PM

Ladsgroup closed this task as Resolved.Nov 7 2018, 6:05 PM

Ladsgroup moved this task from Parked to Completed on the Machine-Learning-Team (Active Tasks) board.

Implement parallel connection limit for querying ORESClosed, ResolvedPublicActions

Description

Related ObjectsSearch...

Event Timeline

Implement parallel connection limit for querying ORES
Closed, ResolvedPublic
Actions

Related Objects
Search...