This script will be useful to generate lots of data so that we can run some performance tests and get some measurements re different implementations of anything we have performance questions on. Also remember to allow high degree of duplication in term texts across languages, types and entity type.
|mediawiki/extensions/Wikibase : master||Add random entities and terms generator maintenance script to repo.|
|Open||None||T208425 [EPIC] Kill the wb_terms table|
|Open||ArielGlenn||T226167 audit public tables and make sure we dump them all|
|Open||None||T219175 [Mega] - Migrate data from wb_terms to new schema|
|Resolved||Addshore||T219120 [Checkpoint 1] Create Schema, Migration plan and Doctrine DBAL connection|
|Resolved||• alaa_wmde||T220210 Create a script to generate lots of Items/Properties with lots of Terms|
Yesterday while thinking about design stuff I randomly realized that we might not need a script like this. Can't we just use https://github.com/Wikidata/WikibaseImport to important a bunch of real entities? If that is too slow, then perhaps we can use https://github.com/JeroenDeDauw/Replicator to import JSON dumps.
WikibaseImport contains a limited number of items and properties
What does this mean? I thought WikibaseImport gets items and properties from Wikidata. How does it contain a limited number?
@JeroenDeDauw I'm testing WikbaseImport in the meanwhile .. it isn't really limited technically and one seem to be able to import all properties and entities in given ranges (from it's readme) .. though it seems to be: 1) quite slow, and 2) importing everything (incl. statements and linked entities) with no way to disable it (adding to the slowness) and 3) requires a separate wikibase instance to import from (a dependency that one might not want to have, esp. locally).
This script generates entities with only terms attached to them (no statements yet, but could be added later with an option, say --with-statements). Those generated random entities can be used for stress tests, and maybe as fixtures for integration tests.