As a first step towards building a benchmark dataset for search, we want to figure out which types of queries we should include. For example, we will want to include, both, keyword queries and natural language questions (see for example Questions vs. Queries in Informational Search Tasks). There are other many other potentially relevant groupings of queries such as the classic taxonomy of web search (navigational, informational, transactional) or whether they are closed/open-ended (Quriosity: Analyzing Human Questioning Behavior and Causal Inquiry through Curiosity-Driven Queries). It is important to identify the a set of relevant types of queries in order to make sure that the benchmark dataset will contain a representative sample.
The goal of this task is to identify a (small) set of query types that we believe are relevant for Wikipedia search. At the minimum, we will have 2 groups (keyword queries vs natural language queries). Ideally, we would like to align these types of queries with the different use-cases of Wikipedia readers from the Readers Foundational research.