Page MenuHomePhabricator

[Story] Provide test data for development installations
Open, LowPublic

Description

We should provide a set of test data for people to play with. This should include:

EDIT: struck through as these are out of date....

  • An XML dump of all properties from wikidata.org
  • An XML dump of items about chemical elements (plus a few associated items)
  • An XML dump of wikidata's sites table

These should come with instructions about how to import these, and some configuration snippets needed for setting up the wiki in a way that is compatible with the dumps (items in the main namespace, etc)

Event Timeline

daniel raised the priority of this task from to Low.
daniel updated the task description. (Show Details)
daniel added projects: Wikidata, Documentation.
daniel added a subscriber: daniel.

I would make this a maintenance script that pulls the items via the wikidata api.

if there are id conflicts for the properties, they can be renumbered and any usages of the properties (or referenced items) can be remapped during "import" (via api)

actually have done this to have all wikidata properties in my dev wiki (and I think some items), although it was a hacky script and should be made into something nice.

and sites table contents should be available via api, although populateSitesTable script generally works fine as well to replicate the contents of wikidata's sites table.

For generating the dump, it would be nice to have a script that can generate a dump based on a list of entity IDs, automatically pulling in all referenced entities (one level, not recursively).

@aude Is it possible to get that script you wrote? A hacky version that works is better than nothing so far.

I would like to have the properties XML dump :)

aude set Security to None.

https://github.com/filbertkm/WikibaseImport is an initial version so far.

my approach is to fetch a list of properties from the wikidata (or other wikibase api) and import them via maintenance script. property and item ids get renumbered (as needed) and tracked, so links from one entity to another can work.

it doesn't import site links yet (because badge items need to be imported) nor stuff like qualifiers.

and i'll be making the code a bit nicer and add tests. also, want to put this on gerrit.

it might be possible to make a xml or json dump mode, but not sure it's necessary yet.

Jonas renamed this task from Provide test data for development installations to [Story] Provide test data for development installations.Sep 10 2015, 1:29 PM
Addshore updated the task description. (Show Details)