Unify various wikidata description consumption
Open, Needs TriagePublic
Actions

Assigned To

None

Authored By

	Ladsgroup
	Sun, May 19, 10:20 PM

Description

Currently 15% of all of our databases resources is being spent on responding to the term lookups in wikidata: https://performance.wikimedia.org/arclamp/svgs/daily/2024-05-18.excimer-wall.all.reversed.svgz?x=895.1&y=837 That is potentially is up to half of s8. We can't reduce that to zero as they serve important functionalities but they can certainly be improved.

One major problem I see is that there are three major consumers of descriptions:

Mobile Frontend adding tagline (via MobileFrontendHooks::onOutputPageParserOutput()
WikibaseClient adding json+ld schema to pages via SkinAfterBottomScriptsHandler
A lot of API calls which is mostly coming from apps: https://github.com/wikimedia/apps-android-wikipedia/blob/d6e160120af5872c2620029a7392d737b9d3b160/app/src/main/java/org/wikipedia/dataclient/Service.kt#L44 and https://github.com/wikimedia/wikipedia-ios/blob/08f5693881a6d0fac23e649643f673a3fb4e724e/Wikipedia/Code/WMFSearchFetcher.m#L167
- Maybe search in new vector does it too? if so, then we should definitely cache value of descriptions in memcached.

We can deduplicate their work and reduce the load on our databases drastically, let's assume we just reduce it to half, that's 25% of replicas in s8 and can translate to ~$20,000 cost reduction every year just for the hardware purchases. One simple fix is that WikibaseClient put the description to Parseroutput object and reads it cached (which it doesn't do that currently) and then MF reads from that value as well.

Related Objects

Mentioned In: T362560: Find a way to get description information from Wikibase
Mentioned Here: T282170: Move "Short Descriptions" feature outside of main Wikibase.git code

Event Timeline

Ladsgroup created this task.Sun, May 19, 10:20 PM

Restricted Application added a project: Wikipedia-iOS-App-Backlog. · View Herald TranscriptSun, May 19, 10:20 PM

Restricted Application added a subscriber: Aklapper. · View Herald Transcript

Note: There are currently two types of description, one is Wikibase/Wikidata item description, another is locally defined short description (fallbackable to Wikidata one), currently lives in Wikidata extension, but planned to be moved to a dedicated extension (T282170: Move "Short Descriptions" feature outside of main Wikibase.git code) independent of Wikibase.

Current usages of description differs, and we may want to reconsider which to use:

Mobile Frontend adding tagline: use Wikibase description only
WikibaseClient adding json+ld schema to pages: use Wikidata description only
new vector: use local description
API: has option of choose between local or central description, local by default - note this may need breaking change when local description is moved to dedicated extension
iOS app: use local description

Another thing to note is local (but currently not central) description is stored as a page property in local wiki, so we do not need requests in Wikidata to fetch it.

bvibber subscribed.Tue, May 21, 6:10 PM

Bugreporter mentioned this in T362560: Find a way to get description information from Wikibase .Wed, May 22, 2:41 PM

Unify various wikidata description consumptionOpen, Needs TriagePublicActions

Description

Related Objects

Event Timeline

Unify various wikidata description consumption
Open, Needs TriagePublic
Actions