Page MenuHomePhabricator

Improve search index text for Schema pages
Closed, ResolvedPublic

Description

Problem:
Currently, the search index text contains the complete json, including the hidden fields and all the labels. So searching for schematext or shexc would bring up every Schema.

acceptance criteria

  • search index for Schema pages contains only the text from the labels, descriptions, aliases and the Schema text

Event Timeline

Restricted Application added a subscriber: Aklapper. · View Herald Transcript

Change 501534 had a related patch set uploaded (by Michael Große; owner: Michael Große):
[mediawiki/extensions/WikibaseSchema@master] Improve the search index text for Schemas

https://gerrit.wikimedia.org/r/501534

Change 501534 merged by jenkins-bot:
[mediawiki/extensions/WikibaseSchema@master] Improve the search index text for Schemas

https://gerrit.wikimedia.org/r/501534

Note for verification: This might need a rebuild of the search index and it could be that it only works on newly edited pages correctly.

Looks like regular Wikitext pages have the same problem: https://wikidata-shex.wmflabs.org/w/index.php?search=Änderung&ns1=1

I assume on Wikidata it’ll behave differently due to CirrusSearch anyways 🤷