We should define a format which we use to offer a test suite for evaluation engines.
Basically, inputs and expected outputs.
This can be used then for other evaluation engines besides our own, but we should also tie it up with out own evaluation engine, i.e. the function-orchestrator, and use it to check it.
For now, this should be living in function-schemata.