Map TTS response to page HTML
Closed, ResolvedPublic1.5 Estimated Story Points
Actions

Assigned To

Authored By

	Lokal_Profil
	Jul 12 2016, 2:45 PM

Description

In order to highlight the text being recited (T122158), skip by token (T140089, T133687) etc., ~~the time stamps~~ token information returned from the TTS must be mapped to the HTML on the page.

This HTML is passed to the TTS via the Cleaner hence the need for a mapping.

Expected result:
Map the tokens in the TTS response to the words in the HTML.

Ideas:

Add markup for elements removed by the Cleaner and make TTS ignore these for audio generation but keep them in the response.
Make Cleaner add marker to the page HTML for any skipped elements. These can then be ignored when doing sequential mapping of tokens.

Details

	Subject	Repo	Branch	Lines +/-
	Map tokens from TTS responses to HTML	mediawiki/extensions/Wikispeech	master	+1 K -317

Customize query in gerrit

Related Objects
Search...

Status	Assigned	Task
Resolved	Sebastian_Berlin-WMSE	T122132 [Story] Support re-listening (Wikispeech)
Resolved	None	T122133 [Story] Skip ahead in recitation (Wikispeech)
Open	None	T152430 Run Wikispeech offline
Resolved	HaraldBerthelsen	T143644 Multiple requests to TTS server should not cause delay
Resolved	Jopparn	T151786 Publically accessible demo (player) [Stage 1+2]
Resolved	Sebastian_Berlin-WMSE	T122158 Highlight recited text (was: Display the read word)
Declined	None	T133854 [Task] Audio player (Wikispeech)
Invalid	None	T134848 [Task] Apply code conventions to existing code (Wikispeech)
Invalid	None	T134750 [Task] Generate tags with time information (Wikispeech)
Resolved	Sebastian_Berlin-WMSE	T148623 Highlight recited word
Resolved	Sebastian_Berlin-WMSE	T148622 Highlight recited sentence
Resolved	Sebastian_Berlin-WMSE	T133687 Skip back in recitation (was: Re-listening to recitation)
Resolved	Sebastian_Berlin-WMSE	T133688 [Task] Skip ahead (Wikispeech)
Resolved	Sebastian_Berlin-WMSE	T140089 Skip ahead (word)
Resolved	Sebastian_Berlin-WMSE	T140105 Map TTS response to page HTML

Event Timeline

Lokal_Profil created this task.Jul 12 2016, 2:45 PM

Lokal_Profil moved this task from Incoming to Proposed for next sprint on the Wikispeech board.

Jopparn added a project: Wikispeech-WMSE.Jul 22 2016, 9:46 AM

Lokal_Profil added a parent task: T140089: Skip ahead (word).Aug 10 2016, 10:12 AM

Sebastian_Berlin-WMSE moved this task from Proposed for next sprint to Sprint 2016-08-10 on the Wikispeech board.Aug 10 2016, 12:24 PM

Sebastian_Berlin-WMSE edited projects, added Wikispeech (Sprint 2016-08-10); removed Wikispeech.

Sebastian_Berlin-WMSE claimed this task.Aug 23 2016, 11:15 AM

Sebastian_Berlin-WMSE moved this task from Backlog to In progress on the Wikispeech (Sprint 2016-08-10) board.

Lokal_Profil mentioned this in T133680: [Task] Functionality: Play selected text (Wikispeech).Aug 24 2016, 9:26 AM

Worked on in: 2016-08-10:

Initial investigation

To do in: Sprint 2016-08-24:

Remainder

There is problem when a token in the response from the TTS-server doesn't match the request, e.g. the input "1965" gives the token ["nineteen sixty five", 1.43] (there are also a bunch of newlines, but I'm assuming they aren't supposed to be there). Wikispeech-STTS: Would it be possible to also return the string that is the input for a token, i.e. ["nineteen sixty five", 1.43, "1965"]?

Sebastian_Berlin-WMSE added a subscriber: Wikispeech-STTS.Aug 25 2016, 12:15 PM

Yes.
https://morf.se/wikispeech/?lang=en&input=1965
now returns:

{
   audio: "http://morf.se//wikispeech_mockup/tmp/tmpciiev6_n.opus",
   tokens: [
      {
          endtime: 1.43,
          expanded: "nineteen sixty five",
          orth: "1965"
      },
     {
         endtime: 1.645,
         orth: ""
     }
   ]
}

So the player can use "orth" or "expanded", if it exists, depending on the use.

That was quick. Thanks.

Lokal_Profil edited projects, added Wikispeech (Sprint 2016-08-24); removed Wikispeech (Sprint 2016-08-10).Aug 26 2016, 12:26 PM

Lokal_Profil moved this task from Backlog to In progress on the Wikispeech (Sprint 2016-08-24) board.