Lua error: too many language codes requested
Closed, ResolvedPublic
Actions

Assigned To

Authored By

	• Elitre
	Dec 29 2014, 6:23 PM

Description

When trying to assemble the December's issue of VE multilingual newsletter,
I got this message after this edit (I guess LUA can't actually handle so many languages. If I'm right, this needs to change - 20-something languages aren't "many" at all.)

More info:

Backtrace:
[C]: in function "isRTL"
mw.language.lua:99: in function "isRTL"
mw.language.lua:132: in function "getDir"
Module:Assemble_multilingual_message:46: in function "chunk"
mw.lua:490: ?

Details

	Subject	Repo	Branch	Lines +/-
	Raise Scribunto maxLangCacheSize to 200	operations/mediawiki-config	master	+1 -0
	Make the maximum language cache size configurable	mediawiki/extensions/Scribunto	master	+24 -3

Customize query in gerrit

Related Objects
Search...

		Status	Subtype	Assigned	Task
		Resolved		Anomie	T85461 Lua error: too many language codes requested
		Open		None	T43103 initialization of the Language object is very heavy

Event Timeline

• Elitre created this task.Dec 29 2014, 6:23 PM

• Elitre raised the priority of this task from to Needs Triage.

• Elitre updated the task description. (Show Details)

• Elitre added a project: MediaWiki-extensions-Lua.

• Elitre added subscribers: • Elitre, gpaumier, Amire80.

Jackmcbarn edited projects, added Scribunto; removed MediaWiki-extensions-Lua.Dec 29 2014, 7:04 PM

Jackmcbarn set Security to None.

Thanks, Guillaume. Splitting the page doesn't seem very feasible though - one would have to split recipients' lists as well, and maintaining them... doesn't really look sustainable. Has this never occurred for Tech News?

I note the restriction was added by Tim (see Gerrit change 36496, which was the original source of Gerrit change 42050 that was actually merged), so adding him as a CC here.

In T85461#947512, @Elitre wrote:

Has this never occurred for Tech News?

No, we've never had so many translations in Tech News.

Aklapper triaged this task as Medium priority.Dec 29 2014, 8:06 PM

Thank you, Anomie! The VE newsletter is quite popular, so I'd really like to get rid of the restriction somehow if possible (/bragging).

Liuxinyu970226 subscribed.Jan 19 2015, 1:15 AM

gpaumier mentioned this in T91175: Create MassMessage simulation tool for crosswiki preview.Feb 28 2015, 8:23 PM

gpaumier mentioned this in T90820: Run MassMessage to contact user talk pages of all affected accounts.Mar 2 2015, 11:59 PM

Ricordisamoa subscribed.Mar 3 2015, 6:58 AM

This limit is also why Commons template:Dir (https://commons.wikimedia.org/wiki/Template:Dir ), which is the most transcluded template on Commons (with 32,721,515 pages transcluding it) is not using Lua's "isRTL" function.

Jarekt added a subtask: T43103: initialization of the Language object is very heavy.Jul 9 2015, 6:05 PM

@Johan, I gather this is a problem you may also have at some point.

Yes.

Nikerabbit subscribed.Jul 3 2016, 9:21 PM

This seems to go back to the issue that constructing language objects was slow and memory consuming. The most likely culprit was the preloading a big set of language data and some messages every time a language object was constructed. But I was under the impression that this has been improved enough a long time ago, so maybe the restriction in Lua is not necessary anymore. Or if it turns out there are still issues, Language object construction should be made light-weight.

Would it be feasible to raise the cap to, say, 30 languages (although that's a limit the VE newsletter has already crushed at least once) and see how it goes?

Trizek-WMF subscribed.Jan 31 2017, 2:00 PM

Johan mentioned this in T156674: Fix or replace Module:Assemble multilingual message.Feb 1 2017, 3:19 PM

IIRC my concern was that there was no eviction from the cache in LocalisationCache::$data, which continues to be the case. If recaching is required, memory usage is about 1.5MB per language, measured locally today. So a limit of 20 implies 30MB, which seems reasonable compared to the Lua memory limit of 50MB.

However, in the WMF production setup, l10n recaching is done in advance, so there's no way recaching would be done in response to a Lua request. It only needs to load the preloaded messages plus the data actually requested (isRTL). Using eval.php on a production server, I measured 145KB per language, which is not so concerning.

So I think the options are:

Fix LocalisationCache so that it doesn't use unlimited amounts of memory when recaching on demand. Implementation ideas can be found in LocalisationCacheBulkLoad.
Make MAX_LANG_CACHE_SIZE configurable and set it to say 200 on WMF, leave it at 20 by default. We would probably need to add ScribuntoEngineBase::getOptions() so that Scribunto_LuaLanguageLibrary::register() can get access to the configuration options, which are stored in a protected variable.
Increase the default MAX_LANG_CACHE_SIZE very modestly, say to 30. This seems like a bad choice since in a month or two someone will reopen the same bug, having found a really important use case for loading 31 languages.

Somewhat off topic, but speaking of lightweight Language object construction, I think having $wgLangObjCacheSize = 10 by default is actually bit rot, I don't think there's any reason for that anymore. Language objects used to hold message arrays, presumably that is the reason for it being so low.

Thanks Tim. What needs to be done here? Do we need to involve someone in particular in this conversation? (I'd really love to be able to deliver the next visual editor newsletter into more than 20 languages.)

Change 343590 had a related patch set uploaded (by Tim Starling):
[mediawiki/extensions/Scribunto] Make the maximum language cache size configurable

https://gerrit.wikimedia.org/r/343590

gerritbot added a project: Patch-For-Review.Mar 20 2017, 3:04 AM

Alright @Elitre, I wrote the patch for option 2. https://gerrit.wikimedia.org/r/#/c/343590/

Rical subscribed.Mar 20 2017, 5:51 AM

• Elitre awarded a token.Mar 20 2017, 3:57 PM

• Whatamidoing-WMF awarded a token.Mar 20 2017, 10:29 PM

Quiddity awarded a token.Mar 21 2017, 5:13 PM

Change 343590 merged by jenkins-bot:
[mediawiki/extensions/Scribunto@master] Make the maximum language cache size configurable

https://gerrit.wikimedia.org/r/343590

ReleaseTaggerBot added a project: MW-1.29-release (WMF-deploy-2017-03-28_(1.29.0-wmf.18)).Mar 23 2017, 3:01 PM

Krinkle mentioned this in T43103: initialization of the Language object is very heavy.Mar 27 2017, 11:39 PM

Krinkle removed a project: MW-1.29-release (WMF-deploy-2017-03-28_(1.29.0-wmf.18)).May 25 2017, 2:16 PM

Rical unsubscribed.Mar 20 2018, 11:03 AM

I just run into this issue again at https://commons.wikimedia.org/wiki/Module_talk:Name/testcases Maybe some functions do not need to be in the language objects, like: mw.language:isRTL or mw.language:lc.

One consequence of this error is that lua code writers avoid calling mw.language functions and use frame:callParserFunction instead. For example c:Module:Date (used on 46M pages) uses datestr = mw.getCurrentFrame():callParserFunction( "#time", { dFormat, timeStamp, lang } ) instead more logical datestr = mw.language.new(lang):formatDate( dFormat, timeStamp) . The output is the same except for no "Lua error: too many language codes requested". From performance point of view, is there a difference between mw.language.new(lang) and mw.getCurrentFrame():callParserFunction calls?

Change 430068 had a related patch set uploaded (by Anomie; owner: Anomie):
[operations/mediawiki-config@master] Raise Scribunto maxLangCacheSize to 200

https://gerrit.wikimedia.org/r/430068

Change 430068 merged by Tim Starling:
[operations/mediawiki-config@master] Raise Scribunto maxLangCacheSize to 200

https://gerrit.wikimedia.org/r/430068

Anomie closed this task as Resolved.May 20 2018, 11:07 AM

Anomie claimed this task.

Wouldn't it be nice to automatically set the value of whether manual recache is enabled or not? Hopefully things will be faster when we start hitting the 200 languages limit :)

Liuxinyu970226 unsubscribed.Jun 3 2018, 2:40 AM

Aklapper removed a subscriber: Anomie.Oct 16 2020, 5:42 PM

Lucas_Werkmeister_WMDE mentioned this in T342418: Speed up Language creation.Jul 24 2023, 2:39 PM

Maintenance_bot removed a project: Patch-For-Review.Jul 24 2023, 3:11 PM

Tacsipacsi mentioned this in T310581: Make LanguageCode::bcp47() available in Lua.Mar 26 2024, 10:36 AM

Lua error: too many language codes requestedClosed, ResolvedPublicActions

Description

Details

Related ObjectsSearch...

Event Timeline

Lua error: too many language codes requested
Closed, ResolvedPublic
Actions

Related Objects
Search...