Page MenuHomePhabricator

Use absolute URIs in DCAT-AP document
Open, LowPublic

Description

The Wikidata dumps description in RDF given by DCAT-AP. Only refers to Wikimedia Foundation by name. I created a patch do use an additional URI.

Moreover there should be a link from the described dataset and Wikidata as project (http://www.wikidata.org/entity/Q2013). The data model is as following:

dcat:Catalog (list of Wikidata dumps)
dcat:dataset
dcat:Dataset (Wikidata dump of a specific date)
dcat:distribution
dcat:Distribution (particular dump file)

A dcat:Dataset could be connected to Wikidata URI via http://www.wikidata.org/entity/P629 (edition or translation of) because each Wikidata dump is a published edition of Wikidata. Shall I create another pull request to implement this or is there a better way to model the connection?

Event Timeline

Change 386366 had a related patch set uploaded (by Zoranzoki21; owner: JakobVoss):
[operations/dumps/dcat@master] Identify publisher with URI

https://gerrit.wikimedia.org/r/386366

Change 424288 had a related patch set uploaded (by Lokal Profil; owner: Lokal Profil):
[operations/puppet@production] Identify publisher with URI A publisher should be identified by an URI. For Wikimedia Foundation we can use the Wikidata entity URI http://www.wikidata.org/entity/Q180.

https://gerrit.wikimedia.org/r/424288

@nichtich So I just spotted that this was never merged. I've updated your patch and made a companion patch for the config.

Feel free to make more patches for the other missing uris and I'll try to follow up on them more quickly.