Page MenuHomePhabricator

Regex search in labels, descriptions and statements
Open, LowPublic

Description

Originally reported at https://www.wikidata.org/w/index.php?title=Wikidata:Project_chat&oldid=1060149338#Problem_with_Q-items_without_any_P31

It should be able to use regex when searching labels and statement values. e.g. inlabel:/(John|Jack)/ will match all items with labels containing word John or Jack. haswbstatement:P214=/@&~([0-9]+)/ will match all items with VIAF ID that is not number.

Event Timeline

Restricted Application added a subscriber: Aklapper. · View Herald TranscriptNov 23 2019, 10:46 PM
Bugreporter renamed this task from Regex search in labels and statements to Regex search in labels, descriptions and statements.Nov 23 2019, 10:54 PM
EBernhardson triaged this task as Low priority.Nov 25 2019, 5:16 PM
EBernhardson moved this task from needs triage to Wikidata Search on the Discovery-Search board.
EBernhardson added a subscriber: EBernhardson.EditedNov 25 2019, 5:25 PM

labels and descriptions are covered with the existing insource:// functionality, statements might be considered but wikidata already has indices that are almost too big to manage, and use more fields than elasticsearch supports. It's not likely we will be adding many more indexing methods to wikidata unless we end up buying a wikidata search cluster.

I suppose my best suggestion would be for wikidata to improve what is reported as the source text so that it covers the actual data in the item.

So9q added a subscriber: So9q.Nov 29 2019, 11:17 AM