As an engineer I want to have a way of comparing load test results so that I know if a new code change is making a server slower/faster.
As part of this task I want to have a POC of a way to do such a comparison which could then be applied to all model servers.
An example could be (as written in the parent task):
- Load the csv in a pandas dataframe
- Run the new load tests and join the new results with the old ones
- Calculate the differences in latencies (or even better run a t-test) and report results.