We should start running some load tests (even basic ones) to understand basic things:
- Are our metrics enough and reliable? Do the dashboards depict a complete picture of how the traffic flows?
- How much traffic a single pod can take?
- We should test how scale-up/down pods works for Knative, and what settings are needed. Initially we could be generous and oversize a bit the min set of pods for wikis, and allow them to scale up to absorb traffic peaks.
- If the rate limit is implemented on the API-Gateway, test it. Otherwise we should implement it and see if it works.