Several people (e.g. T33221, though the original report asks for computer text-to-speech, and http://comments.gmane.org/gmane.org.wikimedia.wiktionary/1265) have requested a tool to simplify the workflow of recording the pronunciation of a word.
The basic idea is to provide a wizard flow for picking a word (which may be the page you're on), recording it, choosing a free license, then uploading it to Wikimedia Commons with the appropriate metadata.
T33221: Audio pronunciation: Automatic text-to-speech to convert IPA to sound
T22252: Support for WAV and AIFF by converting files to FLAC automatically.
T55074: Add component for PronunciationRecording MediaWiki extension