Investigate using a cache store/restore system for package managers
Closed, ResolvedPublic
Actions

Assigned To

Authored By

	hashar
	Oct 20 2015, 2:56 PM

Description

In T112560#1643322, @dduvall wrote:

In T112560#1643228, @hashar wrote:

Discussing with OpenStack people, they have some jobs downloading Linux distributions and are looking for a cache/mirroring solution. Their RFC (== spec) is at https://review.openstack.org/#/c/194477/ .

FWICT, that's a proposal around caching nodepool images, not packages or other dependencies that various jobs require. In my mind, they are separate problems with only marginal overlap: CI base images will be almost completely homogenous in our case while dependent system/gem/pip/composer/npm packages vary widely from job to job.

Travis implements a user-/job-specific system that restores and caches specific directories before and after each job executes, storing the data in S3. We could implement something similar but it would require a reliable central store, and the whole setup seems a little 'brute force' to me.

Another possibility that @hashar and I discussed was to provide separate read-only caches for the specific packaging systems—read-only to protect against the corruption that might occur during concurrent updates. Each cache would augment the package manager's read-write destination within the workspace and be periodically updated to include new packages. The update process could be scheduled or triggered at the end of each job as long as we can reliably audit which packages were installed locally during execution.

In T112560#1705835, @JanZerebecki wrote:

This was discussed in https://tools.wmflabs.org/meetbot/wikimedia-office/2015/wikimedia-office.2015-10-06-13.59.html see point 5.

In T112560#1643322, @dduvall wrote:

Travis implements a user-/job-specific system that restores and caches specific directories before and after each job executes, storing the data in S3. We could implement something similar but it would require a reliable central store, and the whole setup seems a little 'brute force' to me.

With tar and s3cmd this would probably be a shell one liner. If we can't get a swift or ceph object store for labs from ops, we could use rsync to an integration instance.

If we go this way to ensure isolation is maintained we need to make sure that only nodepool instances have permission to update caches that were run for gate-and-submit or post-merge, but not for test/check.

Details

	Subject	Repo	Branch	Lines +/-
	contint: rsync server to hold jobs caches	operations/puppet	production	+28 -0

Customize query in gerrit

Related Objects
Search...

Status	Assigned	Task
Resolved	hashar	T119138 [keyresult] Migrate majority of CI jobs to Nodepool (part 2)
Duplicate	None	T136249 Track what tests we have left to convert to nodepool
Resolved	hashar	T119140 [keyresult] Migrate as many misc CI jobs as possible to Nodepool
Resolved	hashar	T114315 [keyresult] Migrate majority of CI jobs to Nodepool (part 1)
Duplicate	None	T126774 Run 'npm' job with Node 4 (instead of Node 0.10)
Resolved	Krinkle	T129617 Jobs sometimes fail with "/usr/local/bin/npm: No such file or directory"
Resolved	hashar	T119143 Migrate javascript npm CI jobs to Nodepool
Resolved	hashar	T112560 [tracking] Disposable VMs need a cache for package managers
Resolved	hashar	T116017 Investigate using a cache store/restore system for package managers

Event Timeline

hashar created this task.Oct 20 2015, 2:56 PM

hashar raised the priority of this task from to Medium.

hashar updated the task description. (Show Details)

hashar added projects: Tracking-Neverending, Continuous-Integration-Scaling.

hashar added subscribers: JanZerebecki, zeljkofilipin, dduvall and 2 others.

hashar mentioned this in T112560: [tracking] Disposable VMs need a cache for package managers.Oct 20 2015, 3:01 PM

Namespacing the caches per repo/branch would causes us to have a lot of different caches which would consume a good chunk of disk space on the central repository. Not sure how much of a problem it can be.

As long as we trust our +2ers, using one cache per type (i.e. one for composer, different one for gem) should be fine. Another solution is to use content addressing to deduplicate.

Change 253322 had a related patch set uploaded (by Hashar):
contint: rsync server to hold jobs caches

https://gerrit.wikimedia.org/r/253322

gerritbot added a project: Patch-For-Review.Nov 16 2015, 1:44 PM

See the proof of concept https://gerrit.wikimedia.org/r/264327 based on rsync and a central rsync cache.

Lets follow up on parent task T112560 , not much left to investigate.

This has been implemented and tracked in the parent task.

In T112560#1938443, @hashar wrote:

Did a first pass using a cache store/restore system based on rsync. Investigated as part of T116017

Danny_B removed a project: Tracking-Neverending.Feb 14 2016, 3:24 PM

Danny_B set Security to None.

Change 253322 merged by Filippo Giunchedi:
contint: rsync server to hold jobs caches

https://gerrit.wikimedia.org/r/253322

Had the left over puppet patch merged via Puppet SWAT.

Investigate using a cache store/restore system for package managersClosed, ResolvedPublicActions

Description

Details

Related ObjectsSearch...

Event Timeline

Investigate using a cache store/restore system for package managers
Closed, ResolvedPublic
Actions

Related Objects
Search...