Page MenuHomePhabricator

Evaluate number of runs for WebPageTest to get more stable metrics
Closed, ResolvedPublic

Description

@Krinkle added the change for increasing the number of runs to see if we can get more stable metrics for WebPageTest. We haven't done any testing since the beginning when we setup WebPageTest and we have changed the size of the instance, changes URLs and added timeline and other data collection.

I've added some test: This morning I've changed number runs from 5 to 11 on my test instance and then also crated a new one that runs 21 runs per URL (yes twentyone). I'll add the graphs here tonight and we can keep it running for a couple of days just to see how it's doing and then after that take decision.

Event Timeline

Ok I got some numbers. I've compared only SpeedIndex and calculated standard deviation for 30-35 runs running over a little more than a day however it isn't perfect since i used sitespeed.io and Digital Ocean for 21 runs and 5 our regular WebPageTest instance.

Runs URL AVGStdevRelative stdev
21  Obama  1037201,9%
21Facebook1114282,5%
21Sweden1162524,4%
 Obama 17391166,5%
5Facebook1664442,6%
5Sweden1726683,9%

In these runs we only see a real win for the Obama page. To test with WebPageTest it's more work, but I think we should do it so we get some numbers and compare apples with apples.

Maybe nothing perfectly match for this issue but it seems like we started to have real big diff in mwLoadStart (1 second) ... going back in time I can see that we have had it for a month at least, maybe longer.

Screen Shot 2017-03-09 at 8.03.45 PM.png (606×2 px, 117 KB)

Peter triaged this task as Medium priority.May 30 2017, 9:40 AM

Before doing anymore testing for find the optimal runs, I think we should investigate the large difference in mwLoadStart.

Firefox (the difference is 2 seconds):

Screen Shot 2017-03-13 at 8.41.33 AM.png (620×2 px, 155 KB)

Chrome (the difference is 1 second):

Screen Shot 2017-03-13 at 8.42.26 AM.png (616×2 px, 170 KB)

Internet Explorer (using HTTP1 the difference is less than 100 ms):

Screen Shot 2017-03-13 at 8.42.48 AM.png (610×2 px, 133 KB)

Moving to proxy in the future.