As a PM of Wikidata Analytics @Manuel needs to know what proportion of Wikidata is a //“knowledge-base of general statements about the world”//?
**Indicator**
- Does an Item have a link to an other Wikimedia project?
- i.e. We will be looking into the number of Sitelinks per item
**Notes on analytical systems that might be of help here **
- We track some data on Sitelinks on Grafana: https://grafana.wikimedia.org/d/000000167/wikidata-datamodel?orgId=1&refresh=30m
- The Wikidata Analytics Portal has [[ https://wikidata-analytics.wmcloud.org/app/WD_percentUsageDashboard | Wikidata Usage and Coverage Dashboard ]]
- The Wikidata Analytics Portal has [[ https://wikidata-analytics.wmcloud.org/app/WDCM_SitelinksDashboard | WDCM (S)itelinks Dashboard ]]
**Segments**
- all Items excluding astronomical data and citation data
- astronomical data
- citation data
**Hypotheses**
- The different Wikimedia projects do not really overlap so in sum Wikidata consists mainly of general-purpose information (e.g. https://iccl.inf.tu-dresden.de/web/Wikidata/Maps-06-2015/en)
**The fundamental dataset for this task**
- Rows: Wikidata Classes
- Columns: Wikimedia projects
- Cells: number of items in that class that link to the Wikimedia project in the respective column
- Comment: this is essentially a distribution of the number of Sitelinks across items in per class
- Additional:
- Number of items per class
- Number of items with Sitelinks per class
- Total number of Sitelinks per class
- Proportions (% of items that have a Sitelink towards a specific project)