Page MenuHomePhabricator

Avoid duplicate recording : compare list to speaker's previous records in target language so to hide words previously recorded
Open, HighPublic

Description

Ideally, when in the Record wizard > Studio, facing a words list and ready to record, the list on screen can be compared with the speakers' previously recorded words. Via a checkbox, words already recorded could be toggled grey and skipped or be completely hidden. This feature request emerged recently as users are coming back <3 to record again different words lists.

Note also:

J'avais enregistré 20 mots "autour de moi". Là, je viens d'en lancer 20 autres... et c'est les mêmes.
Il pourrait être intéressant d'ajouter une option pour éviter d'enregistrer plusieurs fois la même chose (mon accent ne change pas d'un jour à l'autre).
Exilexi (talk) 05:36, 11 October 2018 (UTC)

Event Timeline

Yug created this task.Dec 23 2018, 6:27 PM
Yug moved this task from Backlog to RecordWizard on the Lingua Libre board.Dec 23 2018, 6:29 PM
Yug updated the task description. (Show Details)Dec 23 2018, 9:52 PM
Yug renamed this task from Compare list to speaker's previous records in target language so to hide words previously recorded to Avoid duplicate recording : compare list to speaker's previous records in target language so to hide words previously recorded.
0x010C updated the task description. (Show Details)Dec 26 2018, 8:21 PM
0x010C added a subscriber: 0x010C.EditedDec 26 2018, 8:30 PM

This could be a cool feature, but I fear all technical implementations will be ressource-expensive and time-consuming. So before choosing a solution, we should test its scalability.

  1. Doing a SPARQL request could be a good idea, but when I tried to fetch all records of Davidgrosclaude —which is currently the biggest contributor to Lingua Libre with ~20 000 audio recordings— it took up to 9s, which is way too much.
  2. Fetching the current user contributions via the MediaWiki API could be even more expensive, due to the API limits set to 500 pages per request, and no abilities no filter per speaker or per language.
  3. Saving a static list somewhere will produce many edge-cases and desyncs to manage.
0x010C triaged this task as High priority.Fri, Mar 15, 11:03 AM