Tune thread for osm2pgsql / postgres max connections for Maps
Open, LowPublic
Actions

Assigned To

None

Authored By

	Gehel
	Jun 7 2016, 5:12 PM

Description

Discussion with @Pnorman: it seems that a good starting point for number of threads to use in osm2pgsql is nb CPU/2. The number of connections to postgres is nb threads x nb tables, which will need to be adapted from our current 120 max connections, taking into account the tilerator traffic.

Details

	Subject	Repo	Branch	Lines +/-
	WIP - Tune thread for osm2pgsql / postgres max connections for Maps	operations/puppet	production	+11 -2

Customize query in gerrit

Related Objects
Search...

		Status	Subtype	Assigned	Task
		Declined		None	T137616 Epic: cultivating the Maps garden
		Open		None	T137229 Tune thread for osm2pgsql / postgres max connections for Maps

Event Timeline

Gehel created this task.Jun 7 2016, 5:12 PM

Restricted Application added subscribers: Zppix, Aklapper. · View Herald TranscriptJun 7 2016, 5:12 PM

I suspect that Tilerator will have one connection per worker. Eventually, I would also like to have Kartotherian to use Postgres directly to get some data, so that number will tripple ( tilerator's cpucount/2 + kartotherian's cpucount).

I would expect the number of threads and the number of worker to have no direct relation to each other. Especially in node where IO should be async...

I had a quick look in the code and it seems that we are using pg.js, that seems to have an embbeded connection pool (node.js is really not my cup of tea yet). I'm not entirely sure how it does (or does not) make sense to pool DB connections.

We need measures... as always...

Gehel added a parent task: T137616: Epic: cultivating the Maps garden.Jun 11 2016, 6:29 AM

Gehel removed a parent task: T133744: Epic: switch Maps to production status.

Yurik moved this task from All map-related tasks to Tilerator on the Maps board.Jun 12 2016, 3:12 AM

Pnorman unsubscribed.Jun 12 2016, 3:56 AM

Gehel mentioned this in rOPUP8257606b5daf: WIP - Tune thread for osm2pgsql / postgres max connections for Maps.Jun 17 2016, 6:06 PM

Yurik moved this task from Tilerator to Maps-data on the Maps board.Jun 26 2016, 8:14 PM

Yurik added a project: Maps (Maps-data).Jun 26 2016, 8:24 PM

For import I generally recommend osm2pgsql uses num CPU threads on machines with up to 8 threads, unless there's something else running at the same time. Past 8 threads there's little data available. If you have enough RAM and are doing --slim import without --drop, most of the time is spent on building a large index, which can't be parallized.

For updates, it's a tradeoff for update speed vs load on the system. Lots of people run single threaded to keep the load down, or with just 2 threads.

Yurik removed a project: Maps.Dec 15 2016, 4:32 AM

Yurik added a subscriber: Pnorman.

• Jhernandez removed a project: Discovery-ARCHIVED.Jul 10 2018, 4:39 PM

• Mholloway lowered the priority of this task from Medium to Low.Jul 31 2018, 4:35 PM

• Phabricator_maintenance moved this task from Backlog to Acknowledged on the SRE board.Jan 26 2019, 8:42 PM

Change 293320 abandoned by Gehel:
WIP - Tune thread for osm2pgsql / postgres max connections for Maps

https://gerrit.wikimedia.org/r/293320

MSantos moved this task from Backlog to OSM on the Maps (Maps-data) board.Apr 1 2020, 1:29 PM

LSobanski added a project: serviceops.Jan 3 2023, 2:03 PM

Clement_Goubert moved this task from Incoming 🐫 to 🛎 Services & Oids on the serviceops board.Jan 10 2023, 12:54 PM

Pnorman unsubscribed.Oct 19 2023, 7:22 PM

Tune thread for osm2pgsql / postgres max connections for MapsOpen, LowPublicActions

Description

Details

Related ObjectsSearch...

Event Timeline

Tune thread for osm2pgsql / postgres max connections for Maps
Open, LowPublic
Actions

Related Objects
Search...