Notes from discussion with Lydia!
- **Number of entites by entity type**
- Can probably be done just counting pages in a namespace
- Count properties in SPARQL below, (all the rest are items)
- SELECT (count(?cs) as ?count) WHERE { ?cs a wikibase:Property }
- **Number of redirects by entity type**
- As above redirects are recorded in the page table and we can filter by NS
- look for ?x owl:sameAs ?y in SPARQL for redirects
- **Number of Item talk pages**
- **Average blob size of items**
- Easy db query
- **Max blob size of items**
- Easy db query
- **Number of properties by datatype**
- Use the wb_property_info table
- **Number of statements by data type**
- Could probably run a sparql query for each property?
- **Number of ranks by type?**
- Can be done in SPARQL
- SELECT (count(distinct(?s)) AS ?scount) WHERE {?s wikibase:rank wikibase:PreferredRank}
- For normal rank we just asume ( statement count - preferd rank - deprecated rank = normal rank )
- **Number of uses of Novalue and SomeValue in mainsnaks?**
- **Number of qualifiers**
- ??????????????????????????????????????????
- **References to Wikipedia**
- SELECT (count(distinct(?s)) AS ?scount) WHERE {?s prov:wasDerivedFrom wdref:004ec6fbee857649acdbdbad4f97b2c8571df97b}
- **Referenced statements**
- ??????????????????????????????????????????
- **Number of labels, descriptions & aliases per lang**
- can be done in the wb_terms table
- **Number of sitelinks per site**
- Can be done using wb_items_per_site
**More complex / long? :/**
- **Count grouped by Number of statements per item**
- **Count grouped by Number of labels, descriptions & aliases per item**
- Should be possible through the wb_terms table
- **Count grouped by Number of site links per item**
- Should be possible through the wb_items_per_site table
- **Count grouped by Number of qualifiers per statement?**