Page MenuHomePhabricator

Ensure correct information about Wikimedia sites in the Sites facility on the Wikimedia cluster.
Open, MediumPublic

Description

Currently, SiteLookup is backed by the information in the sites table. The information in this table is incorrect for many wikis in the Wikimedia cluster. In particular:

  • the site language is set to always be the same as the interwiki prefix. This is not correct for sites like simple.wikipedia.org, which should have the language code "en". This could be solved by applying the $wgDummyLanguageCode mapping when importing (or when reading, as proposed by T137534: Map dummy language codes in sites)
  • the language code interwiki prefixes (e.g. "fr" or "en") are set to always point to wikipedia; this is incorrect for sites like wiktionary, where "fr" should point to fr.wiktionary.org, not fr.wikipedia.org.

Possible solutions:

Event Timeline

daniel renamed this task from Ensure currect information about Wikimedia sites in the Sites facility on the Wikimedia cluster. to Ensure correct information about Wikimedia sites in the Sites facility on the Wikimedia cluster..Jun 10 2016, 10:42 AM
daniel created this task.

Currently, SiteLinksRdfBuilder and OtherProjectsSitesGenerator depend on the wrong language codes.

thiemowmde triaged this task as Medium priority.Sep 6 2016, 10:41 AM
thiemowmde moved this task from in progress to hold on the Wikidata board.