Page MenuHomePhabricator

Develop Metrics for the Language Gap: Develop metrics for language coverage on Wiki Commons (descriptions)
Open, Needs TriagePublic

Description

A proposed metric facet for the State of Languages Metrics is Wiki Commons coverage, to include the following proposed metrics:

Overall coverage:

  • Number of languages with captions on Commons (see T372641)
  • Number of languages with descriptions on Commons (this ticket)

Per language:

  • Status of Commons captions in the language (e.g., present, absent) (see T372641)
  • Status of Commons descriptions in the language (e.g., present, absent) (this ticket)
  • Number of captions in the language (see T372641)
  • Number of descriptions in the language (this ticket)

Tasks:

  • Build notebook to scrape and aggregate file description languages from Commons wikitext
  • Build a notebook(s) for metrics calculation and visualization