Page MenuHomePhabricator

Expose translated messages from the Translate extension in a parallel corpora
Open, MediumPublic

Description

The Translate extension deals with a relevant set of translations that could help those creating open source translation services. This ticket proposes to (a) define way to export the translations in an appropriate format and (b) integrate the data with an open repository such as OPUS (the open parallel corpus).

In a similar case, Content translation has integrated the data on published translations in the OPUS repository and explored how to integrate MarianMT to use such data for machine translation (T234194).


This was proposed during discussions at the Language team offsite 2019

Event Timeline

Pginer-WMF renamed this task from Expose translations in a parallel corpora to Expose translated messages from the Translate extension in a parallel corpora.Dec 3 2019, 11:03 AM
Pginer-WMF triaged this task as Medium priority.
Pginer-WMF added a subscriber: Nikerabbit.