WM have requested this. Should be possible to add comments to make it clearer what's happening.
What we were looking for was a short header explaining which interface is being adapted for and any version restrictions. Any built in logic, such as pre-processing, should also be explained and documented.
Some general code cleanup (removing out-commented code, adding docstrings etc.) will also make this easier to read and review for Wikimedia Foundation staff and other external parties.
The latter applies to the rest of wikispeech_server as well.