Set up Varnish purging and caching for RESTBase end points
Closed, ResolvedPublic
Actions

Assigned To

Authored By

	• GWicke
	Aug 20 2015, 5:26 PM

Description

One of the main goals behind moving to a REST Content API is enabling caching of API responses across the board. This will help us scale our APIs to keep up with growing demand, and lowers latency for clients by leveraging our geo-distributed caching infrastructure.

RESTBase requests are already proxied through the regular text varnishes, but caching of responses is still disabled by setting headers to that effect. Before we can allow Varnish to cache specific GET responses, we'll need to set up Varnish purging for those end points. The production logic for this lives in HTCPPurge. The UDP logic itself doesn't look too hard. The config data referenced there is simple enough (one IP address & one TTL) to let us manually replicate it in the RESTBase config for now.

Related Objects

Mentioned In: T113591: Enable caching for the Mobile Content Service's RESTBase public endpoints
T126687: RFC: Publish all resource changes to a single topic
Mentioned Here: T126571: Emit change events from RESTBase
T127387: Split slash decoding from general percent normalization in Varnish VCL
T126687: RFC: Publish all resource changes to a single topic
T117933: Change propagation service, phase 1

Event Timeline

• GWicke created this task.Aug 20 2015, 5:26 PM

• GWicke raised the priority of this task from to Needs Triage.

• GWicke updated the task description. (Show Details)

• GWicke added subscribers: • GWicke, BBlack.

Restricted Application added a subscriber: Aklapper. · View Herald TranscriptAug 20 2015, 5:26 PM

• GWicke added projects: RESTBase, Services.Aug 20 2015, 5:26 PM

• GWicke set Security to None.

• GWicke edited subscribers, added: • Pchelolo, • mobrovac; removed: Aklapper.

Krenair subscribed.Aug 20 2015, 5:26 PM

We just spoke about this on IRC. It turns out that my fears about the config complexity were unfounded:

$wgHTCPRouting = array( '' => array( 'host' => '239.128.0.112', 'port' => 4827));
$wgHTCPMulticastTTL = 8;

So, I think for now we can add the same data in the RESTBase config & port the HTCPPurge method over to JS. It's probably worth making this a separate npm module, so that other services can potentially use it as well. Perhaps call it 'htcp-purge'?

• GWicke assigned this task to • Pchelolo.Aug 20 2015, 5:54 PM

• GWicke renamed this task from Figure out a way to purge Varnishes from RESTBase to Set up Varnish purging and caching for RESTBase end points.Aug 21 2015, 5:50 PM

• GWicke added a subscriber: mark.

• GWicke updated the task description. (Show Details)Aug 21 2015, 5:59 PM

• GWicke updated the task description. (Show Details)

• GWicke added a subscriber: • bearND.Aug 21 2015, 6:10 PM

• Mholloway subscribed.Aug 21 2015, 6:16 PM

• GWicke updated the task description. (Show Details)Aug 21 2015, 8:13 PM

• GWicke updated the task description. (Show Details)

• GWicke updated the task description. (Show Details)Aug 21 2015, 8:15 PM

• GWicke updated the task description. (Show Details)Aug 21 2015, 8:17 PM

The module for purging varnishes lives here: https://github.com/wikimedia/htcp-purge

• GWicke triaged this task as Medium priority.Jan 25 2016, 9:18 PM

As request volumes for HTML resources (especially without an explicit revision) grow, it would be nice to start purging & caching those.

We should figure out whether it makes more sense to do this in RESTBase, or the change propagation service (T117933). I have gone back & forth on this before, but am recently leaning towards RESTBase, mainly so that RESTBase is in full control of its public URL layout & cache setup. @Pchelolo, @mobrovac, @Eevans: I'm curious what you think on this.

As a follow-up, we have decided to purge URLs directly from RESTBase for the time being (cf PR 515). The long-term plan is to leverage the Event-Platform system to expose all resource changes (cf T126687: RFC: Publish all resource changes to a single topic) which would be fed to a purging service.

• GWicke mentioned this in T126687: RFC: Publish all resource changes to a single topic.Feb 26 2016, 8:12 PM

Finally, this is done. RESTBase endpoints are cached and actively purged where appropriate. Some more improvements still to be done, but those are covered by T126571 and T127387

• Pchelolo mentioned this in T113591: Enable caching for the Mobile Content Service's RESTBase public endpoints.Mar 3 2016, 8:38 PM

Set up Varnish purging and caching for RESTBase end pointsClosed, ResolvedPublicActions

Description

Related Objects

Event Timeline

Set up Varnish purging and caching for RESTBase end points
Closed, ResolvedPublic
Actions