The number of unreferenced statements in Wikidata should decrease. One of the ways to do that is by automatically adding references to existing statements based on schema.org markup. @Addshore has written scripts to start this work. It should be expanded.
Existing work:
- https://addshore.com/2015/12/wikidata-references-from-microdata
- https://github.com/addwiki/wikimedia-commands/tree/master/src/WikidataReferencer
What needs to be done next:
- Expand it to more types (Right now it only supports people and a few others.)
- Expand the kind of information it can extract for each type
- Take into account the links in the item itself as well as external IDs (So far it only takes into account the links linked in the Wikipedia articles connected to the item.)
- Potentially create a autopopulated blacklist of sites that don't contain microdata in order to decrease the number of sites that need to be checked (Right now every linked page is checked regardless of the domain having been checked unsuccessfully several times before.)