Page MenuHomePhabricator

Create system to identify abbreviations made with punctuation
Open, Needs TriagePublic

Description

Lots of languages use points to make abbr. (e. g. English, French, Basque...). TTS system must handle this abbreviations as part of the same sentence and give the correct pronunciation (abbreviation instead of abbr) instead of handling them as separate sentences. "E.g." and "i.e." must be read as "In example" and not as two sentences with one only work.

Event Timeline

The correct pronunciation of abbreviations will have to be handled by each of the TTS engines (they usually do). The extension itself w ill have to detect abbreviations properly so that it does not incorrectly split a sentence into two when it finds the dot in an abbreviation.

Change 439844 had a related patch set uploaded (by Elhuyar Fundazioa; owner: Elhuyar Fundazioa):
[mediawiki/extensions/Wikispeech@master] Fix correct handling of abbreviations when splitting sentences

https://gerrit.wikimedia.org/r/439844

Vvjjkkii renamed this task from Create system to identify abbreviations made with punctuation to spcaaaaaaa.Jul 1 2018, 1:09 AM
Vvjjkkii triaged this task as High priority.
Vvjjkkii updated the task description. (Show Details)
Vvjjkkii removed a subscriber: gerritbot.
CommunityTechBot renamed this task from spcaaaaaaa to Create system to identify abbreviations made with punctuation.Jul 1 2018, 6:51 PM
CommunityTechBot raised the priority of this task from High to Needs Triage.
CommunityTechBot updated the task description. (Show Details)
CommunityTechBot added a subscriber: gerritbot.

Change 458478 had a related patch set uploaded (by Elhuyar Fundazioa; owner: Elhuyar Fundazioa):
[mediawiki/extensions/Wikispeech@master] Fix correct handling of abbreviations when splitting sentences

https://gerrit.wikimedia.org/r/458478

Change 439844 abandoned by Elhuyar Fundazioa:
Fix correct handling of abbreviations when splitting sentences

https://gerrit.wikimedia.org/r/439844