Machine translation models used by MinT operate with plain text. However, Wikipedia content contains links, references, and styling adjustments such as bold or italics that we want to preserve as contents are translated. As part of the Content Translation work approaches to reapply styling were developed and these have been ported to MinT (T341478).
With the current approach some rich text elements may be misplaced or disappear. A reliable system to translate rich text content becomes more relevant as we consider exposing MinT to wikipedia readers. This ticket will capture work for the exploration of better approaches to support rich text translation and issues that can be used as test cases to check that support has improved.
Related tickets:
- General support for rich text (not MinT-specific): T314127: Improve approach to re-apply rich text elements on content from plain-text translation services
- Support for Wikitext markup: T347018: MinT support for Wikitext