Page MenuHomePhabricator

Audio processing: Identify corrupt audio files, tag them, reject them
Open, HighPublic

Description

  • Identify corrupted audio recordings
  • Tag those recording's Qid page with proper property P33 & value
  • Bot code so corrupted audio files are not added.

It think that Lingua Libre Bot does not consider the P33 property when it adds pronunciation file on Wikimedia projects (Wiktionary and Wikidata). To avoid adding bad quality files on projects, the bot should check the presence of P33 and do not add the pronunciations that have this statement (whatever is the value).

In addition, it would be interesting the bot remove pronunciation file for which P33 has been added after the file were imported on Wikimedia project.

Related Objects

Mentioned Here
P33 New loop

Event Timeline

Yug renamed this task from Do not add pronunciation files on Wikimedia projects if file has a problem to Audio processing: Identify corrupt audio files, tag them, reject them.Jul 7 2022, 2:30 PM
Yug updated the task description. (Show Details)
Yug triaged this task as High priority.Jul 21 2022, 8:14 AM