Description
Coverage numbers for the evaluator repo are currently sketchy because the coverage is spread across various test suites.
Desired behavior/Acceptance criteria (returned value, expected error, performance expectations, etc.)
- ensure that each testing cut excludes irrelevant files from its calculation
- [BONUS] aggregate coverage calculation across all test suites
Completion checklist
- Before closing this task, review one by one the checklist available here: https://www.mediawiki.org/wiki/Abstract_Wikipedia_team/Definition_of_Done#Back-end_Task/Bug_completion_checklist