Page MenuHomePhabricator

Searching by category (deepcat) is broken
Open, Needs TriagePublicBUG REPORT

Description

Steps to replicate the issue (include links if applicable):

I have a number of search URLs bookmarked to assist in patrolling pending AfC drafts. Up until today they were working fine. For example: https://en.wikipedia.org/w/index.php?title=Special%3ASearch&limit=500&offset=0&ns118=1&sort=create_timestamp_desc&search=coverage+-%22routine+coverage%22+-%22significant+coverage%22+deepcat%3A%22Pending+AfC+submissions%22&sort=create_timestamp_desc&advancedSearch-current={%22fields%22%3A{%22deepcategory%22%3A[%22Pending+AfC+submissions%22]}}&searchToken=1gp5lziima6jhyyrq24ko37gj

Alternatively, enter the search term coverage -"routine coverage" -"significant coverage" and select the category "Pending AfC submissions" in Advanced Search -> Pages in these categories.

What happens?:
No results. For some queries, it will show results for an unrelated word that shares some letters and with deepcat removed (e.g. draft deepcat:"Pending AfC submissions" -> draw)

What should have happened instead?:
I should get multiple pages containing the word "coverage". To be clear, I have tried this with other search queries as well, such as the "draft" example I mentioned above (which should *definitely* return results).

Event Timeline

I opened a similar ticket for Commons (looks like this issue is across the board):
https://phabricator.wikimedia.org/T414763

Change #1228158 had a related patch set uploaded (by Shivaansh Singh; author: Shivaansh Singh):

[mediawiki/extensions/CirrusSearch@master] Fix T414859: Restore deepcat category matching for tokenized fields

https://gerrit.wikimedia.org/r/1228158

I have uploaded a patch regarding this task!

Change #1228158 merged by jenkins-bot:

[mediawiki/extensions/CirrusSearch@master] Repair deepcat matching against spaced titles

https://gerrit.wikimedia.org/r/1228158

Change #1229177 had a related patch set uploaded (by Ebernhardson; author: Ebernhardson):

[mediawiki/extensions/CirrusSearch@wmf/1.46.0-wmf.11] Repair deepcat matching against spaced titles

https://gerrit.wikimedia.org/r/1229177

Change #1229178 had a related patch set uploaded (by Ebernhardson; author: Ebernhardson):

[mediawiki/extensions/CirrusSearch@wmf/1.46.0-wmf.12] Repair deepcat matching against spaced titles

https://gerrit.wikimedia.org/r/1229178

Change #1229177 merged by jenkins-bot:

[mediawiki/extensions/CirrusSearch@wmf/1.46.0-wmf.11] Repair deepcat matching against spaced titles

https://gerrit.wikimedia.org/r/1229177

Change #1229178 merged by jenkins-bot:

[mediawiki/extensions/CirrusSearch@wmf/1.46.0-wmf.12] Repair deepcat matching against spaced titles

https://gerrit.wikimedia.org/r/1229178

Mentioned in SAL (#wikimedia-operations) [2026-01-20T21:07:00Z] <ebernhardson@deploy2002> Started scap sync-world: Backport for [[gerrit:1229177|Repair deepcat matching against spaced titles (T414859)]], [[gerrit:1229178|Repair deepcat matching against spaced titles (T414859)]]

Mentioned in SAL (#wikimedia-operations) [2026-01-20T21:09:05Z] <ebernhardson@deploy2002> ebernhardson: Backport for [[gerrit:1229177|Repair deepcat matching against spaced titles (T414859)]], [[gerrit:1229178|Repair deepcat matching against spaced titles (T414859)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there.

Mentioned in SAL (#wikimedia-operations) [2026-01-20T21:19:51Z] <ebernhardson@deploy2002> Finished scap sync-world: Backport for [[gerrit:1229177|Repair deepcat matching against spaced titles (T414859)]], [[gerrit:1229178|Repair deepcat matching against spaced titles (T414859)]] (duration: 12m 51s)

Example query from description now works as expected