[Task] UI should get list of supported languages from the backend, via dedicated resource loader module
Closed, DeclinedPublic
Actions

Description

We should have a single source for available languages. The backend should determine them (see T78006), and provide them to the frontend via a dedicated ResourceLoader module, similar to the SitesModule.

It should be possible to both add and remove available language codes via configuration (see T86182). Sources to be considered for available language codes:

MediaWiki's list of UI languages
$wgExtraLanguageNames (for additional codes)
$wgDummyLanguageCodes (for alias codes)
UniversalLanguageSelector / CLDR (if available)
Babel (if it has support for extra codes)

Related Objects
Search...

Status	Assigned	Task
Resolved	Lydia_Pintscher	T74126 [Story] Monolingual text does not accept sr-cyrl and a number of other language codes
Declined	None	T74590 [Bug] Monolingual code is missing for Romani (rom) and Scandoromani (rmg-variant)
Resolved	adrianheine	T72205 [Story] Mono-lingual text datatype should support "no linguistic content" and "undetermined language"
Resolved	Smalyshev	T85385 Implement "Monolingual text" in Pywikibot
Resolved	adrianheine	T95286 [Story] Monolingual text should support macrolanguages
Open	None	T124286 [Epic] Wikidata language support
Resolved	Addshore	T78006 [Story] Determine list of available languages in a uniform way
Declined	adrianheine	T78007 [Task] UI should get list of supported languages from the backend, via dedicated resource loader module

Event Timeline

daniel created this task.Dec 9 2014, 3:06 PM

daniel raised the priority of this task from to Medium.

daniel updated the task description. (Show Details)

daniel added projects: Wikidata, MediaWiki-extensions-WikibaseRepository.

daniel changed Security from none to None.

daniel added subscribers: thiemowmde, JanZerebecki, Lydia_Pintscher and 3 others.

Note that there need to be at least two sets of languages:
one for languages that is supported for the UI (Names.php, pretty much),
and one for input (labels, description, monolingual text, etc).

The user interface languages may already be available as a resource (at least if ULS is present), so we could reduce the size of the data to load by only including the additional languages in the custom module.

Lydia_Pintscher moved this task from incoming to ready to go on the Wikidata board.Dec 15 2014, 1:26 PM

Terms currently are silently limited to UI languages, since you only get the input controls for UI languages. Internally, the views wrongly work with $.uls.data.languages, though.
With https://github.com/wmde/ValueView/pull/144, monolingual text uses UI languages consistently (i. e., frontend and backend).

Actually, the views work with wgULSLanguages (UI languages), but fall back on $.uls.data.languages if the requested language is not in wgULSLanguages. We could probably just remove this fallback. @thiemowmde

adrianheine added a subtask: T78006: [Story] Determine list of available languages in a uniform way.Jan 13 2015, 2:03 PM

adrianheine removed a subtask: T78006: [Story] Determine list of available languages in a uniform way.Jan 13 2015, 2:09 PM

Lydia_Pintscher added a project: § Wikidata-Sprint-2015-02-03.Feb 3 2015, 1:54 PM

I'm wondering how to best implement this. One goal would be to not duplicate ext.uls.languagenames (which currently has 3.8kb gzipped / 7.3kb uncompressed for en). I'd also like to have something quite flexible: It should support at least ULS, ULS + static language list, ULS - static language list, static language list as values. I think this issue mirrors the question of how to configure content languages. I suggest to take a similar approach for both problems, if not exactly the same.

What I'm currently thinking about is basically giving a class hierarchy as config and serialization:

$monolingualTextValueLanguages = FilteringContentLanguages::spec(
  MergingContentLanguages::spec(
    UlsContentLanguages::spec(),
    ListContentLanguages::spec( array( 'zxx', 'und' ) )
  ),
  ListContentLanguages::spec( array( 'en', 'fr', 'de' ) )
); /* => array(
  'type' => 'Filtering',
  '_left' => array(
    'type' => 'Merging',
    '_left' => array( 'type' => 'Uls' ),
    '_right' => array( 'type' => 'List', '_list' => array( 'zxx', 'und' ) )
   ),
   '_right' => array( 'type' => 'List', '_list' => array( 'en', 'fr', 'de' ) )
) */

mw.config.set( 'wbMonolingualTextValueLanguages', {
  type: 'Filtering',
  _left: {
    type: 'Merging',
    _left: { type: 'Uls' },
    _right: { type: 'List', _list: [ 'zxx', 'und' ] }
   },
  _right: { type: 'List', _list: [ 'en', 'fr', 'de' ] }
}
} );

@JeroenDeDauw @daniel @thiemowmde What do you think?

Lydia_Pintscher assigned this task to adrianheine.Feb 25 2015, 12:16 PM

Tobi_WMDE_SW added a project: § Wikidata-Sprint-2015-02-25.Feb 25 2015, 12:16 PM

Tobi_WMDE_SW moved this task from Backlog to Review on the § Wikidata-Sprint-2015-02-25 board.Feb 25 2015, 12:20 PM

Lydia_Pintscher moved this task from Backlog to Review on the § Wikidata-Sprint-2015-02-03 board.Feb 25 2015, 12:23 PM

Tobi_WMDE_SW added a project: § Wikidata-Sprint-2015-03-11.Mar 11 2015, 12:12 PM

Needs feedback and confirmation. If confirmed implementation needs to be done.

Tobi_WMDE_SW moved this task from Backlog to Review on the § Wikidata-Sprint-2015-03-11 board.Mar 11 2015, 12:14 PM

adrianheine mentioned this in T86182: [Story] Allow list of available languages to be configurable.Mar 12 2015, 10:27 AM

The proposal in T78007#1049400 looks good to me.

Tobi_WMDE_SW added a project: § Wikidata-Sprint-2015-03-24.Mar 24 2015, 3:06 PM

Tobi_WMDE_SW moved this task from Backlog to Review on the § Wikidata-Sprint-2015-03-24 board.Mar 24 2015, 3:11 PM

Tobi_WMDE_SW removed a project: § Wikidata-Sprint-2015-03-24.

Ricordisamoa subscribed.Jul 27 2015, 5:51 AM

• Jonas renamed this task from UI should get list of supported languages from the backend, via dedicated resource loader module to [Task] UI should get list of supported languages from the backend, via dedicated resource loader module.Sep 10 2015, 7:36 PM

daniel updated the task description. (Show Details)Oct 8 2015, 11:19 AM

@adrianheine, the example is confusing me a bit. Why are you filtering en, fr and de from a list that was just composed from ULS plus a few extra languages?
One problem I see with the serialization format is that it doesn't give the individual parts names. For example, the merged sub-list in the example can not be addressed and reused.
Why not do something like:

mw.config.set( 'wbMonolingualTextValueLanguages', [
    { type: 'Uls' },
    { type: 'Merging', list: [ 'zxx', 'und' ] },
    { type: 'Filtering', list: [ 'en', 'fr', 'de' ] }
] );

To me it looks like this could do exactly the same while being way easier to read, parse and understand.

The example allows all languages known to ULS plus zxx and und, but minus en, fr and `de. It's obviously artificial :)
I don't see why you would need to reuse parts in the serialization. Can you give me a use-case for that?
Merging and filtering are operations with two operands. The tree structure makes this structure very explicit. Your proposal seems to be a postfix notation, which makes it difficult to follow which operand belongs to which operation. Also you're breaking polymorphism by only allowing plain lists as second operand. A true postfix notation would in my opinion look like this:

mw.config.set( 'wbMonolingualTextValueLanguages', [
  { type: 'Uls' },
  { type: 'List', data: [ 'zxx', 'und' ] },
  { type: 'Merging' },
  { type: 'List', data: [ 'en', 'fr', 'de' ] },
  { type: 'Filtering' }
] );

I would probably switch to the mathematical operation names: ›Union‹ instead of ›Merging‹, ›Difference‹ instead of ›Filtering‹.

daniel added a project: Wikidata-Sprint-2015-11-03.Nov 2 2015, 7:17 PM

I agree that it would be nice to have a flexible system like this. Whether it is given in the form of nested constructors, or nested object literals, I don't care much. But it should be possible to combine the bits freely. In Thiemo's version, I'm not sure how I would add the ULS languages to a fixed list (instead of vice versa).

In addition to fixed list, ULS, union, and difference, I think may also want a "core" languages list. Or would "ULS" just use the core languages, if ULS is not there? We need a sane fallback for the case that ULS is not installed.

For the structural representation, I would probably go for something very simple:

[ { op: "add", source: "uls" }, { op: "add", data: ['a', 'b'] }, { op: "remove", data: ['x', 'y'] } ]

However, when adding explicit languages, just specifying the language code is not enough. For each of those languages, we would need to provide at least a localized name, and maybe some additional info, like rtl/ltr. So we'd need something like:

{ op: "add", data: { a: { name: "Aaarg" }, b: { name: "Böörk", dir: "rtl" } } }

adrianheine added a project: MediaWiki-extensions-WikibaseView.Nov 4 2015, 8:11 AM

Can be even simpler with the same flexibility:

Adrians original example: [ { add: "uls" }, { add: ["zxx", "und"] }, { remove: ["en", "fr", "de"] } ]
Daniels example: [ { add: "uls" }, { add: ["a", "b"] }, { remove: ["x", "y"] } ]
[ { add: { a: { name: "Aaarg" }, b: { name: "Böörk", dir: "rtl" } } } ]

This is almost identical to Daniels suggestion, the only difference is that I ditched the keys "op", "source" and "data". I can see that this is a bit more readable, but technically it's not necessary. Strings describe a source, arrays and objects are data. The op is a key instead of a string.

I'm strongly against putting information in keys, and I'm against data and source keys. They break extensibility and composability.

Seems like we all broadly agree on what information and structure is needed. I think we can leave the color of the bike shed to whoever builds it, now :)

"add" and "remove" is not more or less information than "type" or "data".
Not sure what's wrong with Daniels suggestion. I'm fine with it, including "source" and "data", which by the way was also in the original proposal.

In T78007#1782201, @thiemowmde wrote:

"add" and "remove" is not more or less information than "type" or "data".

Not sure what's wrong with Daniels suggestion. I'm fine with it, including "source" and "data", which by the way was also in the original proposal.

type should be the only key that's known to the factory/deserializer/whatever. data is internal to ListContentLanguages. The class responsible for performing a union does not know about sources and data literals, it just knows about two (or more) ContentLanguages. source was not in my original proposal.

Yellow! it should be yellow!

Liuxinyu970226 subscribed.Nov 16 2015, 5:54 AM

We decided in T124758 to not pass the complete list(s) of language codes to the UI, but instead implement a language suggestion API endpoint.

Restricted Application removed a subscriber: Liuxinyu970226. · View Herald TranscriptFeb 25 2016, 12:41 PM

[Task] UI should get list of supported languages from the backend, via dedicated resource loader moduleClosed, DeclinedPublicActions

Description

Related ObjectsSearch...

Event Timeline

[Task] UI should get list of supported languages from the backend, via dedicated resource loader module
Closed, DeclinedPublic
Actions

Related Objects
Search...