Set up WebPageTest for synthetic testing
Closed, ResolvedPublic
Actions

Assigned To

Authored By

	Peter
	Aug 20 2015, 12:17 AM

Description

Documentation: https://wikitech.wikimedia.org/wiki/WebPageTest

Background

Today we collect performance metrics using RUM. That is super and helps us keep track of performance trends. Running our own synthetic testing (automatically testing pages in browsers) will help us find performance problems related to specific browsers, giving us better instruments when analyzing (and talking about) performance: HAR files, SpeedIndex (that is the best way today to show the above the fold content is loaded) & videos.

What makes WebPageTest especially good is that it's open-source and can handle Internet Explorer, Firefox, Chrome & Safari.

Why our own instance?

There's a public instance of WebPageTest where the limit is 200 page views per day. If we test one page first view and repeat view nine times, we can test 10 pages once a day. That is too low. Running our own instance also removes the limit of 9 runs per URL.

Setup our own instance(s)

WebPageTest contains of one web server (the main entry point) that can run on Linux/Windows and test agent (that actually runs the browser tests) that only runs on Windows.

Setting up WebPageTest instances can be quite a lot of work (even though the list of what needs to be done is not too long https://sites.google.com/a/webpagetest.org/docs/private-instances). WebPageTest is known for the lack/outdated documentation.

However, there are ready made AMIs on Amazon that we could could use. That will save us a lot of time on setup and will also add automatically up/down scaling of agents. By default an agent that hasn't been used in one hour will be shut down. It will also add the ability to run agents from different locations, something we can use in the future.

What kind of data will WebPageTest collect?

All the metrics we collect can be public (there's no secrets there, whoever who wants can collect them). It would be great if the instance could be publicly accessible. I think we should aim for that in the future but lets first set it up so it works. We can run the instance headless = no GUI for running tests, then use the API with an API-key. That way we can run the tests automatically that we want and the result will be public (if you know the URL).

Here's an example of WebPageTest run using Chrome for https://en.wikipedia.org/wiki/Barack_Obama:
http://www.webpagetest.org/result/150819_W0_19D3/

SPDY problems

We will have a problem in a way of that we will not have everything exactly how we want it with SPDY for different browsers until we change to HTTP2.

It's like this: WebPageTest has SPDY support for Chrome, meaning it will use SPDY and all the metrics will be right, except for sizes of different objects in the HAR file. That doesn't matter so much for us right now, but if we also wanna pick assets sizes from the run, we need to have workaround (and don't worry there is).

Firefox is using SPDY but the HAR/Waterfall graphs aren't generated because there's no SPDY decoder implemented for Firefox. But it will be there when we support HTTP2.

Internet Explorer 11 isn't supporting SPDY on older version of Windows (and that's what WPT uses).

The Safari version running on WPT is 7:ish meaning not supporting SPDY.

What we need to do on a high level

Setup our own instance of WebPageTest (security etc) and mount the logs dir to EBS (how do we do that?)
Automate run/trigger tests on WebPageTest (we can use sitespeed.io or the WebPageTest nodejs API wrapper )
Define a couple of URLs to start with. I think it good to keep the list small for the beginning. Talked with @ori and logged in/anonymous users and a couple pages should do as a start. I think it's important to keep it as simple as possible as a start just to get something up and running.
Define browsers and connectivity. We should use latest Chrome/Firefox and discuss how we do with Internet Explorer/Safari until WPT versions use SPDY or we switch to HTTP2. We should also run a browser that not will support SPDY/HTTP2 so we also keep track of that
Decide how many times we want to test each URL.
Push the data to Graphite
Find a easy way to map metrics in Graphite to runs in WebPageTest (so we from a specific run easy can look up the data in our WebPageTest instance).

Related Objects
Search...

Status	Assigned	Task
Resolved	Jdlrobson	T112587 [GOAL] Have a way to detect performance regressions to mobile site
Resolved	Peter	T109666 Set up WebPageTest for synthetic testing
Resolved	ori	T109753 Create a reverse statsv daemon
Resolved	• Spage	T109910 Document WebPageTest setup on Wikitech
Resolved	Peter	T117969 Create an overview image on how we use WebPageTest
Duplicate	None	T110122 Set up a mobile WebPageTest rig
Resolved	Peter	T110205 Mount WebPageTest log dir to EBS
Resolved	Peter	T110220 Add support for report the baseline page separately
Resolved	Peter	T110252 Setup tests for the wptstatsv wrapper
Resolved	Peter	T110560 Verify that tests are stored @ S3
Resolved	Peter	T110863 Make it possible to run multiple URLs with the same test setting
Resolved	Peter	T112629 Fetch size per asset type
Resolved	ori	T112713 Support support for "gauge" metric type in Statsv
Resolved	Peter	T112731 Make the CLI take configuration from a file to make it easier to maintain in Jenkins
Resolved	Peter	T112733 Make WPT script files use placeholders (to be able to have login/password outside of the files)
Resolved	Peter	T112734 Setup up configurations file in Git for all the runs
Resolved	Peter	T112735 Setup a dashboard in Grafana using the metrics for WPT
Resolved	Peter	T110230 Configuration for running WebPageTest
Resolved	ori	T114974 Change default Graphite storage aggregation xFilesFactor to 0
Resolved	Peter	T113415 Number of request, user timings and request sizes have wrong namespace in the namespace keys
Resolved	Peter	T113448 Add timestamp to label name when submitting a test to WebPageTest
Resolved	Peter	T113946 Analyze missing values in WebPageTest/Grafana
Resolved	Peter	T114997 Use the url limit when sendings metrics to statsv

Event Timeline

Peter created this task.Aug 20 2015, 12:17 AM

Peter raised the priority of this task from to Needs Triage.

Peter updated the task description. (Show Details)

Peter added a project: Performance-Team.

Peter added subscribers: Peter, ori.

Restricted Application added a subscriber: Aklapper. · View Herald TranscriptAug 20 2015, 12:17 AM

ori triaged this task as Medium priority.Aug 20 2015, 12:25 AM

ori updated the task description. (Show Details)

ori set Security to None.

ori moved this task from Inbox, needs triage to Doing (old) on the Performance-Team board.

Peter updated the task description. (Show Details)Aug 20 2015, 4:42 PM

Peter claimed this task.Aug 20 2015, 5:37 PM

Peter updated the task description. (Show Details)Aug 20 2015, 6:23 PM

ori renamed this task from Setup WebPageTest for synthetic testing to Set up WebPageTest for synthetic testing.Aug 20 2015, 6:25 PM

Peter updated the task description. (Show Details)Aug 20 2015, 6:35 PM

Peter updated the task description. (Show Details)

Krinkle subscribed.Aug 20 2015, 7:27 PM

• Wwes subscribed.Aug 20 2015, 9:11 PM

ori mentioned this in rODNS6bc83f7683fc: Add wpt.wmftest.org.Aug 21 2015, 10:33 PM

Peter updated the task description. (Show Details)Aug 24 2015, 9:50 PM

Would be nice to define for how long time we will keep the test, using the auto deletion feature of S3.

Peter closed subtask T110560: Verify that tests are stored @ S3 as Resolved.Aug 28 2015, 10:27 PM

Peter closed subtask T110252: Setup tests for the wptstatsv wrapper as Resolved.Aug 28 2015, 10:30 PM

Peter closed subtask T110205: Mount WebPageTest log dir to EBS as Resolved.

Krinkle closed subtask T109753: Create a reverse statsv daemon as Resolved.Sep 4 2015, 2:24 AM

Peter closed subtask T110863: Make it possible to run multiple URLs with the same test setting as Resolved.Sep 9 2015, 10:20 AM

Peter closed subtask T110220: Add support for report the baseline page separately as Resolved.

ori mentioned this in T112401: Fivefold increase in render-blocking CSS size for logged-in users due to Echo loading OOUI on all page views.Sep 12 2015, 9:22 PM

The current example Jenkins job @Peter and I set up today at https://integration.wikimedia.org/ci/job/performance-webpagetest/10/console produces the following statsv url:

//www.wikimedia.org/beacon/statsv?webpagetest.enwiki.Facebook.anonymous.ie.firstView.SpeedIndex=1808ms&webpagetest.enwiki.Facebook.anonymous.ie.firstView.render=1711ms&webpagetest.enwiki.Facebook.anonymous.ie.firstView.TTFB=344ms&webpagetest.enwiki.Facebook.anonymous.ie.firstView.fullyLoaded=7639ms&webpagetest.enwiki.Facebook.anonymous.ie.firstView.mwLoadStart=1678ms&webpagetest.enwiki.Facebook.anonymous.ie.firstView.mwLoadEnd=2937ms&webpagetest.enwiki.Facebook.anonymous.ie.repeatView.SpeedIndex=1684ms&webpagetest.enwiki.Facebook.anonymous.ie.repeatView.render=1098ms&webpagetest.enwiki.Facebook.anonymous.ie.repeatView.TTFB=237ms&webpagetest.enwiki.Facebook.anonymous.ie.repeatView.fullyLoaded=5887ms&webpagetest.enwiki.Facebook.anonymous.ie.repeatView.mwLoadStart=1024ms&webpagetest.enwiki.Facebook.anonymous.ie.repeatView.mwLoadEnd=5312ms

That's 853 characters (when expanded to https). So we'll need to keep url size in mind (currently 1000) and potentially require splitting up into multiple requests.

ori merged a task: T91198: Investigate variance in load.php response times (www.webpagetest.org).Sep 18 2015, 8:48 PM

ori merged a task: T106292: Provide instrumentation to support ResourceLoader improvements.

ori added a subscriber: Nemo_bis.

ori added subscribers: • MZMcBride, Jdlrobson, Peter.Hedenskog.

Jdlrobson added a parent task: T112587: [GOAL] Have a way to detect performance regressions to mobile site.Sep 18 2015, 9:11 PM

Nemo_bis awarded a token.Sep 18 2015, 9:59 PM

Peter closed subtask T112733: Make WPT script files use placeholders (to be able to have login/password outside of the files) as Resolved.Sep 22 2015, 8:51 PM

Peter closed subtask T112629: Fetch size per asset type as Resolved.Sep 22 2015, 9:00 PM

Peter closed subtask T112731: Make the CLI take configuration from a file to make it easier to maintain in Jenkins as Resolved.

Peter closed subtask T113415: Number of request, user timings and request sizes have wrong namespace in the namespace keys as Resolved.Sep 23 2015, 7:54 AM

Peter closed subtask T113448: Add timestamp to label name when submitting a test to WebPageTest as Resolved.Sep 28 2015, 10:46 AM

Peter closed subtask T113946: Analyze missing values in WebPageTest/Grafana as Resolved.Sep 30 2015, 12:36 PM

Peter closed subtask T112734: Setup up configurations file in Git for all the runs as Resolved.

Peter raised the priority of this task from Medium to High.Oct 5 2015, 11:13 AM

Peter moved this task from Doing (old) to To-do: Goals prioritized current Quarter on the Performance-Team board.

Peter closed subtask T114997: Use the url limit when sendings metrics to statsv as Resolved.Oct 27 2015, 7:07 AM

Peter closed subtask T110230: Configuration for running WebPageTest as Resolved.Oct 29 2015, 7:27 AM

Krinkle added a project: Wikimedia-Performance-publish.Oct 30 2015, 7:48 AM

ori closed subtask T112735: Setup a dashboard in Grafana using the metrics for WPT as Resolved.Oct 31 2015, 6:16 AM

Krinkle moved this task from Untriaged to Published on the Wikimedia-Performance-publish board.Nov 3 2015, 7:14 AM

Krinkle updated the task description. (Show Details)Nov 5 2015, 12:50 AM

intracer subscribed.Nov 5 2015, 9:31 PM

Did WPT show T112401#1791282 ?

Yes think so, but I don't follow why we don't get a straight line. This isn't perfect right now because I things get cached on the first hit after we login (the redirect). Think to be sure to know if the size changes for logged in users, we should measure the whole login step (and not the next access that we do today). Let me add task to check out what values we will have then. Would be nice to have a number that are easy to alert on.

Screen Shot 2015-11-07 at 8.12.04 PM.png (518×578 px, 50 KB)

It turns out the bulk of the JavaScript code was loaded outside the critical path, via mw.loader.load(). So the WPT run could fail to pick it up if it terminates too early.

We should change when it terminates then, it's configurable. I'll look into what's default.

Krinkle moved this task from Published to Ready for write-up on the Wikimedia-Performance-publish board.Nov 10 2015, 6:49 PM

Re https://lists.wikimedia.org/pipermail/wikitech-l/2015-December/084257.html , thank you a lot to everyone involved: the new dashboard is great (snapshot for the archives: F3058111).

There was a small scare about a sudden increase of the SpeedIndex value across the board: https://grafana.wikimedia.org/dashboard/db/webpagetest But it was entirely explained by the fundraising banner, which doesn't appear immediately on pageload.

I'll note that this is not an artificial performance degradation but a very real one, which we should measure correctly in order to estimate the cost of fundraising. I'm very happy that the speed index is able to measure the degradation caused by centralnotice banners, I'd call it a huge bonus benefit.

The next step would be to either find a way to measure the real world, on the field speed index; or make webpagetest simulate an average visit behaviour (e.g. X pages opened for each of Y visits in a month over Z wikis) to try to keep into account the hiding cookies etc.

@Nemo_bis there's RUMSpeedIndex that can be used to calculate the SpeedIndex using Javascript but we will be shipping a lot extra bytes to do it and the values aren't perfect as doing with a video.

However: when I tried it out before the metrics looks really good for some sites and not good for others. Haven't spent any time evaluating if it would work for us.

Peter added a project: WebPageTest.Feb 15 2016, 8:06 PM

Krinkle closed subtask T109910: Document WebPageTest setup on Wikitech as Resolved.Mar 4 2016, 6:31 PM

Peter closed this task as Resolved.Mar 31 2016, 7:26 PM

Krinkle mentioned this in Blog Post: Perf Matters at Wikipedia in 2015.Mar 2 2020, 9:36 PM

Krinkle moved this task from Ready for write-up to Published on the Wikimedia-Performance-publish board.

Peter mentioned this in T302279: Strategy for WebPageTest.Mar 21 2022, 1:52 PM