Per T314789#8140256, we'll want to store/serve only MP3 in Phonos rather than WAV. This format is widely supported and has a smaller storage footprint than lossless WAV.
Google already offers MP3 output. For the other two engines, we'll apparently need to use Lame (already installed on prod) to convert to MP3.
Acceptance criteria
- The user should only be served audio in MP3 format
- (Implementation detail) Ideally only one shell command is ran, to prevent unnecessary overhead of communicating with the remote Shellbox server
- This is only for the Google and eSpeak engines. Larnyx has apparently has a unique issue, and that's being tracked at T319242.