Page MenuHomePhabricator

Increased first visual change for both Chrome and Firefox on desktop
Closed, ResolvedPublic

Assigned To
Authored By
Peter
Mar 15 2021, 4:14 PM
Referenced Files
F34162245: before.har
Mar 15 2021, 8:09 PM
F34162244: after.har
Mar 15 2021, 8:09 PM
F34162150: Screenshot 2021-03-15 at 19.52.12.png
Mar 15 2021, 6:53 PM
F34162090: Screenshot 2021-03-15 at 19.05.59.png
Mar 15 2021, 6:11 PM
F34162097: Screenshot 2021-03-15 at 19.09.05.png
Mar 15 2021, 6:11 PM
F34162089: Screenshot 2021-03-15 at 19.06.11.png
Mar 15 2021, 6:11 PM
F34162099: Screenshot 2021-03-15 at 19.09.38.png
Mar 15 2021, 6:11 PM
F34161987: Screenshot 2021-03-15 at 18.31.38.png
Mar 15 2021, 5:41 PM
Subscribers

Description

I've briefly checked the TTFB and that seems to be stable so this looks like a legit alert, we should look into it:

https://grafana.wikimedia.org/d/000000057/webpagetest-drilldown?orgId=1&from=now-7d&to=now&var-base=sitespeed_io&var-path=webpagetest&var-group=en_wikipedia_org&var-page=_wiki_Facebook&var-browser=&var-location=us-east-chrome&var-connectivity=cable&var-view=firstView

Screenshot 2021-03-15 at 17.11.35.png (1×1 px, 619 KB)

I couldn't see anything with WebPageReplay or first paint in the navigation timing when I briefly looked, so need to dig deeper.

Event Timeline

I can't see anything obvious comparing a run before and after. It look like network-wise everything is a little slower.

No I cannot see anything either. I looked at commit log for WPT and no changes. I also run tests directly on sitespeed.io without WebPageReplay (on another AWS server) and looking at the graph for Obama it looks like it could be an increase(in first visual change) but those tests runs only every four hours so need more data:

Screenshot 2021-03-15 at 18.31.38.png (1×1 px, 597 KB)

Looking at the Facebook there's no diff:

Screenshot 2021-03-15 at 18.33.42.png (1×1 px, 737 KB)

What's interesting that for WebPageTest we can see the same for both Firefox and Chrome for all three URLs we test:

Screenshot 2021-03-15 at 18.35.43.png (1×2 px, 908 KB)

Screenshot 2021-03-15 at 18.35.12.png (1×2 px, 889 KB)

Screenshot 2021-03-15 at 18.34.53.png (1×2 px, 915 KB)

Screenshot 2021-03-15 at 18.35.03.png (1×2 px, 917 KB)

Screenshot 2021-03-15 at 18.35.21.png (1×2 px, 965 KB)

Screenshot 2021-03-15 at 18.35.34.png (1×2 px, 1004 KB)

This smells that it something going on on that server. We also do some test on the Russian Wikipedia and its the same there:

Screenshot 2021-03-15 at 18.37.24.png (1×2 px, 926 KB)

I wonder if we can see something on the emulated mobile tests on the same server?

Screenshot 2021-03-15 at 18.38.17.png (1×2 px, 1019 KB)

No nothing of first visual change, however if look at fully loaded it started to be slower at the same time as first visual change on desktop:
Screenshot 2021-03-15 at 18.39.58.png (1×2 px, 939 KB)

It looks like its just slower on that WebPageTest agent, let me try to just restart it and see if something happens.

I rebooted the agent through the AWS GUI and added a annotation in Grafana to be able track when it happened.

Looked at the total content size and pages had an increase around that time see:

Screenshot 2021-03-15 at 19.06.11.png (1×2 px, 954 KB)

Screenshot 2021-03-15 at 19.05.59.png (1×2 px, 955 KB)

Cannot see it in the other tools though.
Comparing the HARs it looks like this:

Screenshot 2021-03-15 at 19.09.05.png (556×2 px, 386 KB)

So it seems the change is in the images:

Screenshot 2021-03-15 at 19.09.38.png (1×2 px, 1 MB)

No that restart didn't help (the little blue line is the restart):

Screenshot 2021-03-15 at 19.52.12.png (1×1 px, 508 KB)

All images got bigger at once? Can you check the headers to see if the ones that changed were WebP before and aren't anymore?

Not all but some. Yes I checked for https://upload.wikimedia.org/wikipedia/en/thumb/f/f8/Thefacebook.png/220px-Thefacebook.png and that one went from webp to png.

Attaching the HAR files.

I looked at a few that got bigger and they were all webp before, png or jpeg after. I think that explains the size difference for all those images.

The cache host being hit is different, it used to be cp1082 where those images were hot enough to be webp, now it's cp1078 where they aren't (yet?).

I think it's simply because WebPageTest is now having its traffic routed to cp1078 where a lot of thumbnails aren't hot enough to be turned into webp. That might change if the images get hot enough on that host. If they don't, that's the new normal for WPT.

Yep, I can't understand why its changed first visual change? The last two runs the first change is back to normal for all URLs.

This solved itself on WebPageTest.