Motivation
There are two cases, where the Wikibase interface is tempted to show fingerprints in the termbox for languages that are not supported anymore:
Case 1
- The ULS extension sends user preferred languages, which are not all included in the language set allowed for the wiki. This can happen, e.g. because someone has a not accepted language in their babel box. In this case, there is never any content for the language.
Example of current buggy behavior
A user has fil as a language in their babel box, resulting in a weird looking and behaving first line of the in more languages section. Note the red text for missing label and description is shifted to the left because the language is totally missing.
Case 2
- A language was supported at some point, but it was then removed from the list of expected languages. As of now, there is no content for current revisions in such languages. Thus, the language only appears in the most recent revision with case 1, and it may appear in previous revisions back when the language was still supported and there was content for it.
Example (and the only one so far) of language that is not anymore supported
Support for Toki Pona stopped quite a while ago. In Spring 2019, all of its contents was deleted from items where some existed.
As a Wikidata reader or editor
I want to only see languages that are actually supported by Wikidata
so that I can edit everything I see
Acceptance Criteria
- Item and property pages (including diff views) do not show labels, descriptions, nor aliases for language codes currently not recognized by Wikibase
- Data returned by action API (wbgetentities, wbsearchentities) , and by Special:EntityData API for items and properties do not include labels, descriptions, nor aliases for language codes currently not recognized by Wikibase, for all revisions of item/property
- Dumps in all formats (JSON, RDF, all flavours) do not include item and property labels, descriptions, nor aliases for language codes currently not recognized by Wikibase
Notes
- Please note, that this means that for a few revisions (the ones were Toki Pona items were changed), diffs between 2 revisions look super weird. As a next step, one might want to indicate what happened in those revisions. The task for this is T225789: Indicate when there were changes on not anymore supported languages
- T222790: getFrequentLanguageList: expected behavior for 'redirected' languages has been raised to clarify if an upstream change to the ULS data source is feasible.
- In scope of this story are only items and properties. Lexemes, MediaInfo entities, Entity Schemas are not considered there
- The behavior with regards to the monolingual text values in statements is not in scope here, and should not change.
More info about how the selection of languages for users currently work in Wikibase
Termbox is showing languages considered "preferred by the user" (T213720) in the "more languages" sections.
These languages are sourced from their babel box (config.get( 'wbUserSpecifiedLanguages' )) or ULS (uls.getFrequentLanguageList()) - the latter being influenced by e.g. the country you are surfing the web from, user agent languages, languages previously used on mediawiki projects. They can, apparently, contain language codes that do exist but are not considered to be full fledged MediaWikiContentLanguages (wb terminology) but delegate to another language code instead.


