As an improvement for usability for translation service front-end, detect the language of source content automatically.
This is based on Compact Language Detector 2 library which is able to detect 83 languages.
Description
Details
Status | Subtype | Assigned | Task | ||
---|---|---|---|---|---|
Open | None | T363306 Wish and focus area pages translation | |||
Open | None | T295862 View translated messages in Talk pages | |||
Declined | None | T98728 View translated messages in Flow | |||
Resolved | santhosh | T99666 Provide a service to detect which language the user is writing on | |||
Resolved | santhosh | T334465 MinT: Detect language of source content automatically |
Event Timeline
Change 905782 had a related patch set uploaded (by Santhosh; author: Santhosh):
[mediawiki/services/machinetranslation@master] Automatic language detection for source content
Change 905782 merged by jenkins-bot:
[mediawiki/services/machinetranslation@master] Automatic language detection for source content
Hi @santhosh, can you please associate one or more active project tags with this task (via the Add Action... → Change Project Tags dropdown)? That will allows to see a task when looking at project workboards or searching for tasks in certain projects, and get notified about a task when watching a related project tag. Thanks!
I have WIP patch to use fasttext to increase the coverage of languages, but on low priority now .
For long term solution discussion, follow https://phabricator.wikimedia.org/T99666
Change 929148 had a related patch set uploaded (by Santhosh; author: Santhosh):
[mediawiki/services/machinetranslation@master] Do not change current target selection when detecting language
Change 929148 merged by jenkins-bot:
[mediawiki/services/machinetranslation@master] Do not change current target selection when detecting language
Change 929439 had a related patch set uploaded (by KartikMistry; author: KartikMistry):
[operations/deployment-charts@master] Update MinT to 2023-06-12-125157-production
Change 929439 merged by jenkins-bot:
[operations/deployment-charts@master] Update MinT to 2023-06-13-061519-production
Mentioned in SAL (#wikimedia-operations) [2023-06-13T07:09:24Z] <kart_> Updated MinT to 2023-06-13-061519-production (T337656, T334465)
Initial support for 83 languages seems good-enough for the immediate use cases. Further developments in the space can be captured in follow-up tickets as sub-tasks of T99666: Provide a service to detect which language the user is writing on