Caching on TTS server
Closed, DeclinedPublic
Actions

Assigned To

None

Authored By

	Lokal_Profil
	Nov 29 2016, 2:17 PM

Description

It would be desirable if any request which has been synthesized would also be cached so that subsequent identical requests don't need to be resynthesized.

Related Objects
Search...

Status	Assigned	Task
Open	None	T152430 Run Wikispeech offline
Resolved	HaraldBerthelsen	T143644 Multiple requests to TTS server should not cause delay
Resolved	Jopparn	T151786 Publically accessible demo (player) [Stage 1+2]
Declined	None	T151880 Caching on TTS server

Event Timeline

Lokal_Profil created this task.Nov 29 2016, 2:17 PM

Lokal_Profil moved this task from Incoming to Proposed for next sprint on the Wikispeech board.

Lokal_Profil added a parent task: T151786: Publically accessible demo (player) [Stage 1+2].

Lokal_Profil mentioned this in T151786: Publically accessible demo (player) [Stage 1+2].

Based on need in T151786

Sebastian_Berlin-WMSE moved this task from Proposed for next sprint to Sprint 2016-11-30 on the Wikispeech board.Nov 30 2016, 1:33 PM

Sebastian_Berlin-WMSE edited projects, added Wikispeech (Sprint 2016-11-30); removed Wikispeech.

Caching currently works as expected in the browser: If the page isn't reloaded the audio elements are still there and play without resynthesising. Fair enough. But this issue refers more to caching either the generated sound files for an entire wikipedia article, or the sound file for a unique sentence. This could certainly be done, based on the id of the article or the text of the sentence, but it will be a bit problematic to know when to resynthesise, if the lexicon or the markup or the textprocessing or the synthesis has been changed in any way.

A note from our previous discussions:
One solution is that the cache could either be time limited in such a way that resynthesis would happen regularly enough anyway.
The other is that you can somehow delete any pronunciations from the cache which contain a given word if that is changed in the lexicon (or some other part of the server side logic which would affect it). New markup on the wiki side would send a different request to the server so should not match the cached result anyway.

HannaLindgren edited projects, added Wikispeech (Sprint 2016-12-14); removed Wikispeech (Sprint 2016-11-30).Dec 14 2016, 1:39 PM

using simple key-value store, we can cache responses on server side too,
TTS result would be store in file and indexed according to the page and text coordinates + utterance itself.

The parameters for the synthesis (like the ones in the in the HTTP requests) could be used as keys. This would avoid generating identical audio when utterance string, language, voice etc. are the same.

Yes, but also the "page" is going to indicate language but not voice,

language
voice
utterance

That could be sufficient. And maybe:

page
text coordinates on page

Simple Redis, Memcached could be used to store indexes and paths to the files.

Priority change i just to reflect it is no longer a blocker for the stage 1+2 demo

Sebastian_Berlin-WMSE moved this task from Sprint 2016-12-14 to Backlog on the Wikispeech board.Mar 8 2017, 1:16 PM

Sebastian_Berlin-WMSE edited projects, added Wikispeech; removed Wikispeech (Sprint 2016-12-14).

Sebastian_Berlin-WMSE raised the priority of this task from Lowest to Low.Sep 20 2017, 1:55 PM

HannaLindgren removed a project: Wikispeech-STTS.Sep 20 2017, 2:24 PM

Sebastian_Berlin-WMSE edited projects, added Wikispeech-Text-to-Speech; removed Wikispeech.Oct 28 2019, 7:40 AM

I have a feeling that some kind of caching will be required when Wikispeech goes live. @HannaLindgren, @HaraldBerthelsen, @NikolajLindberg: is there any caching on the server currently?

In T151880#5610145, @Sebastian_Berlin-WMSE wrote:

@HannaLindgren, @HaraldBerthelsen, @NikolajLindberg: is there any caching on the server currently?

No, there is no caching right now.

Sebastian_Berlin-WMSE added a project: Wikispeech-Jobrunner.Nov 13 2019, 4:16 PM

Note that this will not be necessary if the backend becomes stateless. Closing

Caching on TTS serverClosed, DeclinedPublicActions

Description

Related ObjectsSearch...

Event Timeline

Caching on TTS server
Closed, DeclinedPublic
Actions

Related Objects
Search...