Page MenuHomePhabricator

Create RDF export for structured data stored for files
Closed, ResolvedPublic

Description

Structured data information on Commons should have RDF representation, just like the information on Wikidata has. We need to create code that would allow RDF representation of that data.

The URL should be like this:
https://commons.wikimedia.org/wiki/Special:EntityData/M77666670.ttl (just like JSON one works now).

One of the ideas of how RDF could look like is here: https://www.mediawiki.org/wiki/User:Tpt/WikibaseMediaInfo_RDF_Dump_Format

Open questions:

  • What should be the main prefix? wd: is wikidata, so we can't use that. sdc: maybe?
  • How to ensure that URLs used by Wikidata commonsMedia type match what we're using here? Right now commonsMedia uses Special:FilePath URL, which is different from sitelink. Should we add another triple? Which URL should we use for schema:contentUrl - current full URL or maybe FilePath redirect?

Related Objects

StatusSubtypeAssignedTask
Declineddchen
OpenNone
OpenNone
DuplicateNone
OpenFeatureNone
OpenFeatureNone
DuplicateNone
ResolvedNone
ResolvedNone
ResolvedNone
DuplicateNone
ResolvedArielGlenn
ResolvedSmalyshev
ResolvedSmalyshev
ResolvedSmalyshev
ResolvedSmalyshev
ResolvedGehel
ResolvedSmalyshev

Event Timeline

Smalyshev triaged this task as Medium priority.
Smalyshev moved this task from Backlog to Doing on the User-Smalyshev board.

Change 507627 had a related patch set uploaded (by Smalyshev; owner: Smalyshev):
[mediawiki/extensions/WikibaseMediaInfo@master] RDF export of MediaInfo entities

https://gerrit.wikimedia.org/r/507627

Change 507627 merged by jenkins-bot:
[mediawiki/extensions/WikibaseMediaInfo@master] RDF export of MediaInfo entities

https://gerrit.wikimedia.org/r/507627

Change 520078 had a related patch set uploaded (by Smalyshev; owner: Smalyshev):
[operations/mediawiki-config@master] Enable RDF output for MediaInfo

https://gerrit.wikimedia.org/r/520078

Change 520078 merged by jenkins-bot:
[operations/mediawiki-config@master] Enable RDF output for MediaInfo

https://gerrit.wikimedia.org/r/520078

Mentioned in SAL (#wikimedia-operations) [2019-07-08T18:04:14Z] <urbanecm@deploy1001> Synchronized wmf-config/InitialiseSettings.php: SWAT: [[:gerrit:520078|Enable RDF output for MediaInfo]] (T221916) (duration: 00m 49s)

Change 522141 had a related patch set uploaded (by Smalyshev; owner: Tpt):
[mediawiki/extensions/WikibaseMediaInfo@master] Adds some RDF triples from File metadata

https://gerrit.wikimedia.org/r/522141

I think this is done since the export now exists and is enabled, we can tweak the specifics in follow-up tasks.

Change 522141 merged by jenkins-bot:
[mediawiki/extensions/WikibaseMediaInfo@master] Adds some RDF triples from File metadata

https://gerrit.wikimedia.org/r/522141

Change 533077 had a related patch set uploaded (by Tpt; owner: Tpt):
[mediawiki/extensions/WikibaseMediaInfo@master] Adds more File metadata to RDF output

https://gerrit.wikimedia.org/r/533077

Change 533077 merged by jenkins-bot:
[mediawiki/extensions/WikibaseMediaInfo@master] Adds more File metadata to RDF output

https://gerrit.wikimedia.org/r/533077

Hi,

is this export (dump) available to download somewhere? Is this planned? Should I add an additional ticket for that?

Thank you
D063520