A proposed metric facet for the State of Languages Metrics is script coverage, to include the following metrics:
- Number of world's languages with script(s) supported on Wikimedia projects
- Number of scripts represented across hosted content projects
- Number of scripts represented across hosted and pre-hosted (i.e. test) content projects
Currently, no structured data exists connecting our public scripts data with Wikimedia projects.
Tasks:
- Build notebook to
- scrape public scripts data
- join with public ISO 15925 data
- wrangle for use in metrics calculations
- Build a notebooks for metrics calculation and visualization