Page MenuHomePhabricator

Wikidata qid of articles is not present in export/dump
Open, Needs TriagePublic

Description

When exporting articles from a Wikipedia (through [[Special:Export]]), or in the XML dump, there is no mention of the Wikidata qid corresponding to each article.

It would be great to have it added, to save further API queries to get it or to avoid having to download a full dump of Wikidata on top of the Wikipedia(s) one.

Event Timeline

TTO added a subscriber: TTO.

I suppose the WikibaseClient would need to attach to one of the hooks in XmlDumpWriter or something similar, like XmlDumpWriterOpenPage.

Restricted Application added a project: Wikidata. · View Herald TranscriptJun 13 2018, 10:42 AM
Vvjjkkii renamed this task from Wikidata qid of articles is not present in export/dump to l4aaaaaaaa.Jul 1 2018, 1:04 AM
Vvjjkkii triaged this task as High priority.
Vvjjkkii updated the task description. (Show Details)
ArielGlenn renamed this task from l4aaaaaaaa to Wikidata qid of articles is not present in export/dump.Jul 2 2018, 12:30 PM
ArielGlenn raised the priority of this task from High to Needs Triage.
ArielGlenn updated the task description. (Show Details)

I worry that this will make the dumps that much slower. Thre ought to be some sort of entity dump for Wikidata that could be used in tandem with the regular dumps. Adding @hoo for comments.