Description
Some code related to running these tests is duplicated: setting up the benchmark suite, running the benchmark tests twice (once with Mocha assertions, once without), etc. This code can be factored out into function-schemata.
Desired behavior/Acceptance criteria (returned value, expected error, performance expectations, etc.)
- less code duplication between evaluator and orchestrator benchmark tests
Completion checklist
- Before closing this task, review one by one the checklist available here: https://www.mediawiki.org/wiki/Abstract_Wikipedia_team/Definition_of_Done#Back-end_Task/Bug_completion_checklist