Page MenuHomePhabricator

Move tests from AWS to bare metal
Closed, ResolvedPublic

Description

Goal: we want to be able to find smaller performance regressions and be able trust the metrics. Today we use AWS but there we have problems with noisy neighbours that makes our metrics change over time and causing false alerts. Moving the tests to bare metal servers will help us to avoid the noisy neighbour problem and by therefor find smaller performance regressions that we can trust.

Event Timeline

During my vacation I moved my personal open source projects from running on a cloud provider to use Hetzner dedicated servers. The move was smooth and didn't take more time than one day or two. I also got rid off using S3, serving the data directly from one of our servers (that's something we also can do in the future). It's been running flawlessly since, however I haven't analyzed the stability in the metrics (I can see that it's better than the cloud provider I used to use but I haven't compared it with the numbers I got when running the POC earlier this year). Comparing those numbers would be a good start before we test out a providor.

We should checkout https://www.supplier.io and see if we can find a match.

Today I turned on all WebPageReplay tests that we run AWS so they also run on bare metal. I used the exact same configuration except that I hacked the start script on the bare metal server to change the Graphite reporting key, so it reports under baremetal. If this looks ok, I think this is a good first step, then we can move WebPageReplay tests to the bare metal server and turn off a couple of AWS servers.

Change 907715 had a related patch set uploaded (by Phedenskog; author: Phedenskog):

[performance/synthetic-monitoring-tests@master] Remove CPU throttling on mobile (moving to bare metal).

https://gerrit.wikimedia.org/r/907715

Change 907715 merged by jenkins-bot:

[performance/synthetic-monitoring-tests@master] Remove CPU throttling on mobile (moving to bare metal).

https://gerrit.wikimedia.org/r/907715

Change 907722 had a related patch set uploaded (by Phedenskog; author: Phedenskog):

[performance/synthetic-monitoring-tests@master] Remove config that now is default and remove emulated CPU throttle.

https://gerrit.wikimedia.org/r/907722

Change 907722 merged by jenkins-bot:

[performance/synthetic-monitoring-tests@master] Remove config that now is default and remove emulated CPU throttle.

https://gerrit.wikimedia.org/r/907722