Existing setup
We have two test servers that rely on the same database, one of them displaying the state of wikidiff2 BEFORE any of our changes, and the second one with the MOST RECENT version of wikidiff2 (i.e. including our changes).
Task
Create a way we can (semi) automate testing, that ideally shows us
- cases where we did not expect a changed diff, but it was changed
- cases where we did expect a changed diff, but it was not changed
- ideally: cases where we did not receive exactly what we wanted
for specific thresholds.