Page MenuHomePhabricator

Better understand the makeup of specific Wikidata object types that probably can't be dropped
Open, Needs TriagePublic

Description

As a user, I want to better understand the size of crucial elements of Wikidata that can't be dropped in the case of catastrophic data loss, so I can make informed decisions about data curation where universally dropping a data type is not possible.

What is the data size/proportion, connectedness for:

  • All Properties
  • All sitelinks
  • All classifying statements (aka the ontology) - this would be all statements using the Property "subclass of", "instance of", "part of", "has part" or "parent taxon"
  • All humans