=== Purpose ===
{T370416}
* Identify SPARQL queries that should ideally go to [[ https://www.wikidata.org/wiki/Wikidata:Data_access#Search | Elasticsearch ]] instead of WDQS.
=== Scope ===
* Identify WDQS SPARQL queries that only retrieve Items based on a simple statement (e.g. Which Item has IMDb ID "tt0133093"?)
=== Desired output ===
[] Initial simple identification algorithm that can be used in T370854 (and iteratively improved on later).
=== Notes ===
* Using Elasticsearch for Wikidata this should currently work for all properties with "external identifier", "string", "item", "property", "lexeme", "form" and "sense" datatypes, except published in (P1433) and cites (P2860), which are currently omitted for performance reasons. Also, it is impossible to make use of Class hierarchy.
=== Open questions ===
* Should we only focus on the SPARQL equivalents of `haswbstatement`, or should we also go for `inlabel`, `wbstatementquantity`, `hasdescription`, or `haslabel`?
=== Urgency ===
When this task should be completed by. If this task is time sensitive then please make this clear. Please also provide the date when the output will be used if there is a specific meeting or event, for example.
DD.MM.YYYY
---
**Information below this point is filled out by the Wikidata Analytics team.**
== General Planning ==
Information is filled out by the analytics product manager.
== Assignee Planning ==
Information is filled out by the assignee of this task.
=== Estimation ===
Estimate:
Actual:
=== Sub Tasks ===
Full breakdown of the steps to complete this task:
[ ] subtask
=== Data to be used ===
See [Analytics/Data_Lake](https://wikitech.wikimedia.org/wiki/Analytics/Data_Lake) for the breakdown of the data lake databases and tables.
The following tables will be referenced in this task:
- link_to_table
=== Notes and Questions ===
Things that came up during the completion of this task, questions to be answered and follow up tasks:
- Note