Page MenuHomePhabricator

Incomplete deepcategory search results despite of no warning message
Open, Needs TriagePublic3 Estimated Story PointsBUG REPORT

Description

Steps to replicate the issue (include links if applicable):

What happens?:
The result count differs and there is no warning.

What should have happened instead?:
The result count should be the same and if it really doesn't work, a warning about incomplete results should be displayed.

Software version (on Special:Version page; skip for WMF-hosted wikis like Wikipedia):

Other information (browser name/version, screenshots, etc.):

I mentioned this in T414763: Wikimedia Commons deepcategory searches return unexpected results for categories with spaces in name but it looks like a separate problem.

(The example is what's used to populate 2020s maps of the world in unidentified languages which is how at least / starting with the most relevant world maps are categorized by language to e.g. better enable translations and hopefully eventually better search results that doesn't show maps in some niche language I can't read at the top when that's not in my configured language(s).)

Event Timeline

pfischer set the point value for this task to 3.Jan 26 2026, 4:46 PM

I finally had a chance to dig into this one. As far as i can tell, English-language SVG maps is not excluded in the first query, but is explicitly added as an exclusion in the second query. So the result descripency is likely to be due to this addition.

We can look at the query dump as see that English-language SVG maps is not part of the categories exluded in the first query. only English-language SVG maps of the world and English-language SVG maps of the world by Our World in Data. I tried to look around and understand if this is the correct behaviour, as far as I can tell it is.

The parent categories of English-language SVG maps are SVG maps by language English-language maps English-language SVG
The parent categories of SVG maps by language are SVG maps Maps by language SVG by language
The parent categories of English-language maps are Maps by language Legend in English
The parent categories of English-language SVG are SVG by language English text

I put together a self-contained .html page that will request the querys that are executed for two different deepcat queries and report on the differences in categories that will be included/excluded:

TJones subscribed.

@EBernhardson's analysis looks good and explains the discrepency between the original queries (modulo the original incomplete result counts).