A proposed metric facet for the State of Languages Metrics is Wiki Commons coverage, to include the following proposed metrics:
Overall coverage:
- Number of languages with captions on Commons (this ticket)
- Number of languages with descriptions on Commons (see T374279)
Per language:
- Status of Commons captions in the language (e.g., present, absent) (this ticket)
- Status of Commons descriptions in the language (e.g., present, absent) (see T374279)
- Number of captions in the language (this ticket)
- Number of descriptions in the language (see T374279)
Tasks:
- Build notebook to aggregate caption language counts from Commons data dump
- wrangle
- standardize language codes for joining with wiki project languages
- Build a notebook(s) for metrics calculation and visualization