Page MenuHomePhabricator

Wikidata Languages Landscape
Open, Needs TriagePublic

Description

For WikidataCon 2019, develop

  • an extensive study of how different languages are used and re-used in Wikidata and Wikimedia products;
  • deliver a dashboard encompassing an overview of the most significant findings.

Event Timeline

Restricted Application added a subscriber: Aklapper. · View Herald TranscriptApr 26 2019, 1:20 PM
  • Fundamental dataset (language code x language code similarity/distance matrix) produced from WD JSON dump, hdfs copy w. Pyspark;
  • PoC completed; can be done;
  • details forthcoming.