Understand or mitigate duplicate ParserCache fetches in the same request
Open, MediumPublic
Actions

Assigned To

None

Authored By

	Krinkle
	Mar 19 2021, 1:03 AM

Description

Follows-up from T269593: ParserCache should use CachedBagOStuff

Details

	Subject	Repo	Branch	Lines +/-
	ParserCache: only use in-process caching for metadata	mediawiki/core	master	+53 -21

Customize query in gerrit

Related Objects

Mentioned Here: T269593: ParserCache should use CachedBagOStuff

Event Timeline

Krinkle created this task.Mar 19 2021, 1:03 AM

Restricted Application added a subscriber: Aklapper. · View Herald TranscriptMar 19 2021, 1:03 AM

In T269593#6914601, @gerritbot wrote:

Change 671357 merged by jenkins-bot:
[mediawiki/core@master] ParserCache: Instrument CachedBagOStuff to understand dupe fetches

https://gerrit.wikimedia.org/r/671357

Krinkle added subscribers: • Pchelolo, daniel.Mar 19 2021, 1:03 AM

The results are in, and quite volumous in Grafana: Production logging:

Screenshot 2021-03-19 at 01.04.46.png (1×1 px, 601 KB)

You can dig in on Logstash: mediawiki where there's 300,000 entries per hour.

The vast majority (295,000/hour) are idoptions keys, and happen predominently on regular page views, across all wikis:

Duplicate get(): "<some wiki>:pcache:idoptions:<some id>" fetched 2 times"

The second most common (2,500/hour) are idhash keys with canonical and responsiveimages=0 set.

Duplicate get(): "<some wiki>:pcache:idhash:<some id>-0!canonical!<sometimes a language>!responsiveimages=0" fetched 2 times

These seem to mostly happen on User_talk and File_talk pages for some reason. For example:

Duplicate get(): "commonswiki:pcache:idhash:41461911-0!canonical!responsiveimages=0" fetched 2 times

url: https://commons.wikimedia.org/wiki/File_talk:Cruiser_Mk2.jpg

(page ID 41461911)

Krinkle moved this task from Limbo to Watching on the Performance-Team (Radar) board.Mar 19 2021, 1:12 AM

I think I know the reason for idoptions.

During a normal page view, we reuse the cached parse metadata to construct the PoolCounter work key. That's needed cause we want the PoolCounter work key to differentiate by parser options just like ParserCache does, and the used options are stored in parser cache.

Then, when we actually do the pool work, it looks up the cache entry in the parser cache - that internally looks up metadata first.

So we basically always fetch from the metadata twice.

In T277829#6927502, @Pchelolo wrote:

I think I know the reason for idoptions.

During a normal page view, we reuse the cached parse metadata to construct the PoolCounter work key. That's needed cause we want the PoolCounter work key to differentiate by parser options just like ParserCache does, and the used options are stored in parser cache.

Then, when we actually do the pool work, it looks up the cache entry in the parser cache - that internally looks up metadata first.

So we basically always fetch from the metadata twice.

IIRC, by extension, this also explains why the parser output itself (idhash keys) also need to be fetched twice. A ParserCache miss will trigger a PoolCounter-protected parse, and once one of the threads has completed the work, waiting threads will re-fetch the parser output from the cache via getCachedWork(), which should, if everything went well, now be a hit rather than a miss.

Nikerabbit subscribed.Mar 20 2021, 10:48 AM

BPirkle triaged this task as Medium priority.Mar 23 2021, 8:14 PM

BPirkle edited projects, added Platform Team Workboards (Clinic Duty Team); removed Platform Engineering.

BPirkle moved this task from Inbox to Later on the Platform Team Workboards (Clinic Duty Team) board.

Krinkle moved this task from Backlog to ParserCache/ParserOutputAccess on the MediaWiki-Parser board.Apr 2 2021, 8:58 PM

Krinkle removed a project: MediaWiki-libs-BagOStuff.

In T269593#6914601, @gerritbot wrote:

Change 671357 merged by jenkins-bot:
[mediawiki/core@master] ParserCache: Instrument CachedBagOStuff to understand dupe fetches

https://gerrit.wikimedia.org/r/671357

In T277829#6927502, @Pchelolo wrote:

I think I know the reason for idoptions.

During a normal page view, we reuse the cached parse metadata to construct the PoolCounter work key. That's needed cause we want the PoolCounter work key to differentiate by parser options just like ParserCache does, and the used options are stored in parser cache.

Then, when we actually do the pool work, it looks up the cache entry in the parser cache - that internally looks up metadata first.

So we basically always fetch from the metadata twice.

It looks like CachedBagOStuff caches a cache-miss. Which, if I understand correctly, seems like it would break this PoolCounter logic as it would mean we can no longer do the "wait for lock and then fetch again" thing that this is supposed to do, right?

And this aspect of CachedBagOStuff appears to be intentional, because:

BagOStuff uses false to mean absence. In general, consumers use null, 0, or [] etc if they need to persist a bottom value.
CachedBagOStuff::get unconditionally calls Hash->set() after calling Backend->get(), so it will be calling set() for false.
CachedBagOStuff::get explicitly checks hashKey() and makes sure to short-circut and not check with the backend a second time,

Can you confirm that PoolWork is still working correctly for this use case?

The metadata seems like we can avoid a second fetch since the same business logic is responsible for both fetches and should be able to hold on to it and pass it in or something like that. Or alternatively, maybe the PoolWork side of things doesn't need to know about this at all and can perhaps let the ParserCache side of things worry about it e.g. by acknowleding the use case of "I call for this and if you tell me it's not there, trust that I will ask for it again". Or maybe we can make sure to use the wrapped one only for parser options and not the rest, if that would work correctly still as a way to declaratively remember it in the common case without any ad-hoc logic. But it might be easier to just enhance the code that performs the options fetch to have its own HashBag(maxKeys:2) object to consult.

Change 677299 had a related patch set uploaded (by Ppchelko; author: Ppchelko):

[mediawiki/core@master] ParserCache: only use in-process caching for metadata

https://gerrit.wikimedia.org/r/677299

gerritbot added a project: Patch-For-Review.Apr 6 2021, 4:13 PM

@Krinkle you're totally right, caching negatives is wrong in this case. The above patch implements in-process caching for metadata only.

Change 677299 merged by jenkins-bot:

[mediawiki/core@master] ParserCache: only use in-process caching for metadata

https://gerrit.wikimedia.org/r/677299

ReleaseTaggerBot added a project: MW-1.37-notes (1.37.0-wmf.1; 2021-04-13).Apr 7 2021, 1:00 AM

Maintenance_bot removed a project: Patch-For-Review.Apr 7 2021, 1:10 AM

Krinkle moved this task from Watching to Perf recommendation on the Performance-Team (Radar) board.Aug 18 2023, 8:05 PM

Krinkle edited projects, added Wikimedia-Performance-recommendation; removed Performance-Team (Radar).Aug 18 2023, 8:42 PM

	F34171563: Screenshot 2021-03-19 at 01.04.46.png
	Mar 19 2021, 1:12 AM

Understand or mitigate duplicate ParserCache fetches in the same requestOpen, MediumPublicActions

Description

Details

Related Objects

Event Timeline

Understand or mitigate duplicate ParserCache fetches in the same request
Open, MediumPublic
Actions