Investigation: Measure load times when the complete lexeme data is loaded to display lexeme in the statement
Closed, ResolvedPublic
Actions

Assigned To

Authored By

	WMDE-leszek
	Feb 23 2018, 4:14 PM

Description

Experiment plan:

create 500 lexemes (with random lemma etc)
- each of lexemes should contain 3-5 forms (with random representation)
- each of lexemes should have between 5 and 100 statements, each of these referencing some of those 500 lexemes (referencing self should probably be limited)
for each of lexeme, do the following
- do action=purge on its page
- start the counter
- load the page
- stop the counter

Test result would be the every page load time per the lexeme statement count class (e.g. when referencing 1-3 lexemes per page the load takes x secs, when 10-15 lexemes y secs).
If it seems to making test more "reliable", the experiment could be repeated and the average of results reported.

Related Objects
Search...

Status	Assigned	Task
Open	None	T194253 Configure the CI job that runs WikibaseLexeme's browser tests against test wikidata
Resolved	Addshore	T168260 Deploy WikibaseLexeme extension on Wikimedia cluster
Resolved	Addshore	T191457 Deploy WikibaseLexeme on www.wikidata.org
Resolved	Addshore	T191458 Deploy WikibaseLexeme on test.wikidata.org
Resolved	Lydia_Pintscher	T168263 WikibaseLexeme functional baseline
Resolved	Lydia_Pintscher	T175030 labels and descriptions for Lexemes for display in listings and search (derived labels, virtual labels)
Resolved	Lydia_Pintscher	T184997 Representation of a Lexeme in a statement value (L)
Resolved	WMDE-leszek	T188108 Investigation: Measure load times when the complete lexeme data is loaded to display lexeme in the statement
Resolved	None	T187316 Create a formatter for displaying a Lexeme as a statement value

Event Timeline

WMDE-leszek triaged this task as Medium priority.Feb 23 2018, 4:14 PM

WMDE-leszek created this task.

Lydia_Pintscher moved this task from incoming to in progress on the Wikidata board.Feb 24 2018, 5:22 PM

WMDE-leszek mentioned this in T187775: Investigation: Constraints for a database schema to store representations of a Lexeme.Feb 26 2018, 9:11 AM

daniel added a subtask: T187316: Create a formatter for displaying a Lexeme as a statement value.Feb 26 2018, 2:53 PM

Lydia_Pintscher closed subtask T187316: Create a formatter for displaying a Lexeme as a statement value as Resolved.Feb 27 2018, 1:23 PM

WMDE-leszek added a project: Wikidata-Sprint-2018-02-28.Feb 27 2018, 1:41 PM

WMDE-leszek mentioned this in T187323: Store data needed for presenting the lexeme in the index(es) allowing efficient lookup.

The actual test went like that (slightly diverged from the description above, the test set is still fine IMO)

473 lexemes
each of them having between 1 and 28 statements referencing lexeme
each of them having between 3 and 5 random functions.

Bonus: created one lexeme with 1000 statements as an edge case: https://wikidata-lexeme.wmflabs.org/index.php/Lexeme:L546.

Measured page load times for each of above test lexemes. Results collected in the table below.
Only the time to get HTML was measured, i.e. didn't measure the time needed to load JS (which does not seem related to the issue at hand, neither seem to have a significant performance impact).

1-28 statements on Lexeme
AVG load time	0.747
MIN load time	0.548
MAX load time	1.925
1000 statements on Lexeme
load time	4.947

For the record, the full data (load time for each lexeme, including its size etc) is published as P6755

Pinging @Jonas, @thiemowmde, @Lydia_Pintscher, @daniel to have a look at the result. It seems to me the current load times are acceptable. Of course when the load on the system is higher (more users and more data), then we would have to implement the more performant way of displaying lexemes (see T187323). For now the existing approach seems good enough to me.

WMDE-leszek updated the task description. (Show Details)Feb 28 2018, 1:17 PM

Looks good, thanks! Since this was brought up, here are some numbers from the Wikidata Item "Germany" for comparison:

622 statements
684 qualifiers
579 non-empty references, containing 1789 snaks (that's an average of 3 snaks per reference)
3095 value snaks, including qualifiers and references snaks

thiemowmde moved this task from Review to Done on the Wikidata-Sprint-2018-02-28 board.Mar 1 2018, 11:08 AM

WMDE-leszek closed this task as Resolved.Mar 2 2018, 9:01 AM

Investigation: Measure load times when the complete lexeme data is loaded to display lexeme in the statementClosed, ResolvedPublicActions

Description

Related ObjectsSearch...

Event Timeline

Investigation: Measure load times when the complete lexeme data is loaded to display lexeme in the statement
Closed, ResolvedPublic
Actions

Related Objects
Search...