Acceptance criteria:
- For the top 10 most frequently visited Wikipedias (by user pageviews) get 50 fulltext search session abandonments apiece (approach for sampling probably includes a mix of fulltext head queries and some form of random sampling)
- As a first step after figuring out sampling routine, generate queries from the abandoned sessions
- Target namespace 0 article searches
Probable deliverable: a structured file and its re-runnable Jupyter notebook