[Story] switch default rdf format to full (include statements)
Closed, ResolvedPublic
Actions

Assigned To

Authored By

	Lydia_Pintscher
	Jun 9 2015, 2:37 PM

Description

Currently the RDF output that includes statements is only available via a format switch: https://www.wikidata.org/wiki/Special:EntityData/Q42.ttl?flavor=full
It should be the default.

Details

	Subject	Repo	Branch	Lines +/-
	Set the default flavor to full in EntityDataSerializationService	mediawiki/extensions/Wikibase	master	+3 -4

Customize query in gerrit

Related Objects
Search...

View Standalone Graph

This task is connected to more than 200 other tasks. Only direct parents and subtasks are shown here. Use View Standalone Graph to show more of the graph.

Status	Assigned	Task
		· · ·
Open	None	T44063 [Epic] Provide a plain linked data interface for accessing entities
Resolved	hoo	T101837 [Story] switch default rdf format to full (include statements)
Open	None	T50143 Implement complete RDF mapping for entities (tracking)
		· · ·

Event Timeline

Lydia_Pintscher created this task.Jun 9 2015, 2:37 PM

Lydia_Pintscher raised the priority of this task from to High.

Lydia_Pintscher updated the task description. (Show Details)

Lydia_Pintscher added projects: Wikidata, MediaWiki-extensions-WikibaseRepository.

Lydia_Pintscher added subscribers: Lydia_Pintscher, daniel.

Restricted Application added a subscriber: Aklapper. · View Herald TranscriptJun 9 2015, 2:37 PM

Lydia_Pintscher added a subtask: T50143: Implement complete RDF mapping for entities (tracking).Jun 9 2015, 2:38 PM

Lydia_Pintscher added a parent task: T44063: [Epic] Provide a plain linked data interface for accessing entities.

Hi Lydia (and all), it's Chiara from the BBC here. I post here to allow everyone working on this issue to comment back to me. To make it brief, any idea of when this task is going to be implemented? Thanks!

Lydia_Pintscher moved this task from incoming to ready to go on the Wikidata board.Jun 10 2015, 6:35 AM

We're working on the blockers but it's not clear when we'll finish them, sorry.

We want to have the mapping mostly stable when we turn this on per default. For this, it would be very helpful to get feedback about the RDF data you can already get with flavor=full.

Thanks Daniel, my colleague Alex and I will come up with some tests to evaluate the RDF data. Is there any question you might want to investigate?

Jimkont subscribed.Jul 9 2015, 8:27 PM

Dear all,
I've just sent an email with our comments to Lydia, thanks for your patience!
Cheers,
Chiara

Lydia_Pintscher renamed this task from switch default rdf format to full (include statements) to [Story] switch default rdf format to full (include statements).Aug 13 2015, 8:11 PM

Lydia_Pintscher set Security to None.

Rybesh subscribed.Aug 21 2015, 2:31 PM

Smalyshev subscribed.Sep 9 2015, 6:30 AM

Lydia_Pintscher moved this task from ready to go to consider for next sprint on the Wikidata board.Sep 9 2015, 6:46 AM

One the mailing list, Stas brought up the question "which RDF" should be delivered by the linked data URIs by default. Our dumps contain data in multiple encodings (simple and complex), and the PHP code can create several variants of RDF based on parameters now.

I think the default should be to simply return all data that is in the dumps. This would address the BBC's use case of building a linked data crawler that fetches live data rather than using dumps. Such a crawler would not have any way to specify which part of RDF is needed, since linked data is such an extremely simple, parameter-free API.

Dump format however does not contain the data on the referenced entities (due to the fact that dump has all entities anyway, so no reason to repeat), while full one does. Not sure if that fits the use case mentioned or not.

Data on the referenced entities does not have to be included as long as one can get this data by resolving these entities' URIs. However, some basic data (ontology header, license information) should be in each single entity export.

I believe that "stub" data on referenced entities should be included per default, for convenience. That's also how the feature was originally speced with Denny.

Including more data (within reason) will not be a problem (other than a performance/bandwidth problem for your servers).

However, if there are further ideas and small improvements that will take time to implement, it would be good to switch to "dump" as the default right now. It is already a big improvement over the current (statement-free) default. Further improvements can then be done in small steps.

JanZerebecki added a project: Story.Sep 25 2015, 8:27 PM

Lydia_Pintscher added a project: Wikidata-Sprint-2015-09-29.Sep 28 2015, 1:55 PM

Tobi_WMDE_SW moved this task from Proposed to Backlog on the Wikidata-Sprint-2015-09-29 board.Sep 29 2015, 1:30 PM

Lydia_Pintscher moved this task from consider for next sprint to in progress on the Wikidata board.Sep 29 2015, 3:27 PM

Change 242492 had a related patch set uploaded (by Hoo man):
Set the default flavor to full in EntityDataSerializationService

https://gerrit.wikimedia.org/r/242492

gerritbot added a project: Patch-For-Review.Sep 30 2015, 8:58 AM

hoo moved this task from Backlog to Review on the Wikidata-Sprint-2015-09-29 board.Sep 30 2015, 9:05 AM

@mkroetzsch IIRC, the "dump" mode does not include information about referenced entities, which makes it inconvenient for third parties. And it doesn't resolve redirects, which violates the same-as semantics. "dump" mode should really only be used for dumps.

Change 242492 merged by jenkins-bot:
Set the default flavor to full in EntityDataSerializationService

https://gerrit.wikimedia.org/r/242492

hoo mentioned this in rEWBAf1a6ac2398dd: Set the default flavor to full in EntityDataSerializationService.Sep 30 2015, 9:43 AM

Diffusion mentioned this in rMEXT89721a8e0c6f: Updated mediawiki/extensions Project: mediawiki/extensions/Wikibase….Sep 30 2015, 9:44 AM

hoo closed this task as Resolved.Sep 30 2015, 9:46 AM

hoo claimed this task.

hoo removed a project: Patch-For-Review.

Please note that this is not going to be deployed before October 14 (but possibly later).

hoo moved this task from Review to Done on the Wikidata-Sprint-2015-09-29 board.Sep 30 2015, 10:02 AM

Smalyshev awarded a token.Sep 30 2015, 12:51 PM

[Story] switch default rdf format to full (include statements)Closed, ResolvedPublicActions

Description

Details

Related ObjectsSearch...

Event Timeline

[Story] switch default rdf format to full (include statements)
Closed, ResolvedPublic
Actions

Related Objects
Search...