I'm done with the coding of using Mann Whitney and its ready to deploy:
- Deploy the new version
- Increase the number of runs so we do 21 to make sure it's enough to get statistical significance.
- Create new alerts that alerts when we have a regression on a couple of metrics.
If it turns out well, change the other alerts and document the new setup.