Server-side caching for MinT
Open, MediumPublic
Actions

Assigned To

None

Authored By

	Pginer-WMF
	Apr 24 2024, 10:30 AM

Description

MinT supports machine translation for different products. In the initial integrations of MinT to support Content Translation and the Localization infrastructure, the requested contents to translate are quite unique. That is, an editor creating a new translation for a Wikipedia article will generate requests for MinT to translate the different sentences of such article. which will be very different from those translated by another editor working on a different article. In this context, there was no need for caching.

However, as MinT is applied in contexts where multiple users may be reading the same translated content, there may be a bigger benefit in applying caching to reduce waiting times and the general workload for the service. This may be relevant for context such as the translation of wishes (T363306) for the new Wishlist process and MinT for Wikipedia readers (T359072).

This ticket proposes to consider ways in which the MinT service could use caching to provide a better performance in the described contexts. Given a certain request text, language pairs and models, the result should come from the caching system if the same exact request was made previously/recently.
We need to consider that (a) language models are not updated frequently, but we also plan to allow users to provide community-verified translations (T351748) and the caching system should not get in the way of users being able to correct a bad translation (i.e. taking time to see changes reflected).

Related Objects
Search...

		Status	Subtype	Assigned	Task
		Open		None	T363306 Wish and focus area pages translation
		Open		None	T363308 Server-side caching for MinT

Event Timeline

Pginer-WMF created this task.Apr 24 2024, 10:30 AM

Restricted Application added a project: Community-Tech. · View Herald TranscriptApr 24 2024, 10:30 AM

Pginer-WMF triaged this task as Medium priority.Apr 24 2024, 10:30 AM

Pginer-WMF added a project: MinT.

Pginer-WMF mentioned this in T361514: Users should see automatically translated text.Apr 24 2024, 10:33 AM

CXServer may be a better place for caching to keep MinT itself simple(r). One could even go one step further and make a caching proxy API in MediaWiki, given MediaWiki has easy to use caching options.

Simple LRU cache with a maximum storage limit could be a good starting point, with monitoring to monitor cache hit rate to see if the cache is effective at all.

It may also be beneficial to support client provided cache hints to determine which content should be cached and for how long.

Thanks for surfacing potential approaches @Nikerabbit!

I'd defer to the engineers to identify the most promising directions. From my perspective, I think starting simple and reusing existing capabilities from the platform sounds great to me.

In this way, we can avoid premature optimization since this is a case where we anticipate that caching may be beneficial, but we still have not a sense of how often this potential overlap among requests will be happening in practice.

JWheeler-WMF moved this task from New & TBD Tickets to On deck (June 25-July 5) on the Community-Tech board.Apr 24 2024, 4:15 PM

• Nikerabbit removed a project: Language-Team.Apr 25 2024, 8:30 AM

• Nikerabbit moved this task from Backlog to Infrastructure on the MinT board.

JWheeler-WMF moved this task from On deck (June 25-July 5) to CommTech Backlog on the Community-Tech board.May 22 2024, 9:28 PM

JWheeler-WMF moved this task from CommTech Backlog to On deck (June 25-July 5) on the Community-Tech board.

JWheeler-WMF moved this task from Backlog to Translations on the Community Wishlist Survey board.May 28 2024, 9:17 PM

JWheeler-WMF edited projects, added Community Wishlist Survey (Translations); removed Community Wishlist Survey.

JWheeler-WMF moved this task from On deck (June 25-July 5) to CommTech Backlog on the Community-Tech board.Fri, May 31, 7:01 PM

JWheeler-WMF moved this task from CommTech Backlog to July 1-12 2024 (Corsac Fox) on the Community-Tech board.Fri, May 31, 8:02 PM

JWheeler-WMF edited projects, added Community-Tech (July 1-12 2024 (Corsac Fox)); removed Community-Tech.