Page MenuHomePhabricator

Classify fulltext search abandonment: English, French, Spanish
Closed, ResolvedPublic13 Estimated Story Points

Description

It's not the first time, and it probably won't be the last: the goal of this task is to better understand the sources of fulltext search session abandonment, in hopes of finding opportunities for improvement where possible.

As a search stakeholder, when determining enhancement options it would be helpful to understand what things are possibly within the control of the search platform and which things are actually not easily treated by the search platform.

Currently, fulltext search session abandonment appears to hover near 48% on desktop.

Screenshot from 2024-09-24 14-42-39.png (633×1 px, 58 KB)

Acceptance criteria:

  • As this is exploratory in part, identify apparent component(s) leading to fulltext search session abandonments
  • From the samples, classify the component(s) of abandonment
  • If possible, indicate possible alternative search or presentation strategy (or strategies) that could be used (this may pertain to both zero results and non-zero results cases) (BONUS if there's any way to automate this)
  • Identify whether search could be satisfied with an external search engine / conversational agent with Wikipedia/Wikimedia content (be careful to not overwhelm them and beware of filter bubbles)
  • Identify whether search could be satisfied with an external search engine / conversational agent with not-Wikipedia/Wikimedia content (be careful to not overwhelm them and beware of filter bubbles)
  • Analyze and make a report
  • Document the approach
  • Describe potential next tickets to act on the data (or if data are fully inconclusive or nonactionable, describe why)
  • File the next task to pursue the next 7 top read languages

Some notes:

  • Commons, Wikidata, and other sister projects excluded due to different interaction patterns
  • The target for the analysis could be this task or a wiki page or both, and seems likely to be informed by an access controlled Jupyter notebook and hand-coding Sheet

Thinking out loud, possible components of this seemingly moderately high abandonment rate include:

  • Automata
  • Inadequate content for things that probably could realistically exist on the given Wikimedia project if notability criteria are someday met
  • Inadequate content for things that probably won't realistically ever exist on a given Wikimedia project given its content policies
  • Inadequate content for things that are likely already eligible for the given Wikimedia project (i.e., notability criteria are probably met and content policies would probably allow it, it's just that it hasn't been added yet)
  • Inadequate search terms
  • Overabundant search terms (e.g., natural language query too long)
  • Typing accidents
  • Copy-paste accidents
  • Bad spelling guesses
  • "Wrong keyboard" issue
  • Wrong language (e.g., searching for French on English Wikipedia)
  • "Non-encyclopedic" queries (e.g., advice, restaurant hours, food delivery, porn)
  • Referred search sessions that aren't same-site organic fulltext search (the traffic may be organic, or it may be organized activity)
  • Non-automata UA spurious calls
  • User power tools spawning lots of sessions likely to be less prone to clicks
  • Sister search results sidebar clicks (?)
  • Users who were actually satisfied by looking at the SERP. T375387: Include fulltext search results Page Previews of sufficient dwell time in Search Metrics dashboard intends to help identify one kind of "satisfied" but abandoned searcher behavior. Others can be harder to track, such as when search snippets satisfy search intent or when the UA configuration reduces well intentioned measurement instruments
  • Data collection that inflates the denominator of fulltext search (perhaps when a page is served but the user didn't/couldn't see the SERP; maybe there are also unanticipated redirects or some such thing)

There are probably more potential components at play, but those are some that came to mind.

There can be overlap between these components. And there can be "fixes" above and beyond current approach to mitigate these components sometimes, ranging from updating measurement/visualization approach to applying different search or presentation strategies. Some of the potential components are likely observable in "fulltext head queries", and some likely are not.

Event Timeline

dr0ptp4kt updated the task description. (Show Details)
dr0ptp4kt updated the task description. (Show Details)
dr0ptp4kt updated the task description. (Show Details)
dr0ptp4kt updated the task description. (Show Details)
dr0ptp4kt updated the task description. (Show Details)
dr0ptp4kt renamed this task from Classify fulltext search abandonment to Classify fulltext search abandonment: English, French, Spanish.Oct 1 2024, 3:56 PM
dr0ptp4kt set the point value for this task to 8.
dr0ptp4kt updated the task description. (Show Details)
TJones triaged this task as High priority.

Current write-up—including tags & stats for the English Wikipedia sample, and lots, lots more details—on MediaWiki.

I've finished the English analysis, which included coming up with an annotation scheme for grappling with the queries and the scant information we can glean from them. The full list of goals in this ticket is too ambitious: user intent and search "success" are, in general, notoriously hard to pin down, and in this data isn't always possible to discern the searcher's intent, let alone their reason for abandoning their query.

I struggled at first to even find any patterns in the data that I could identify and label, but retracing the search sessions helped me understand what searchers were probably thinking about and searching for. I settled on labelling fairly objective attributes of searches, suggestions, results, and searcher behavior.

There's no glaring, smack-you-in-the-face solution to query abandonment. Some features of abandoned queries and search sessions:

  • People are most often searching "well", that is, looking for something that looks like an article title—though questions, complex queries, etc. do certainly occur.
  • Sometimes the info they want just isn't in Wikipedia—there's no article on that thing. Sometimes, the best piece of info about a thing is on the page they are searching from (i.e., they are looking for more info, but there is no more info.)
  • Sometimes people ignore good autocomplete suggestions, which is sad when they then get poor fulltext results.
  • Unclear snippets can hide good results.
  • Google, without notability requirements or limits to encyclopedic content, often has more information.
    • Google's AI Overview does not help nearly as often its fulltext results. Mostly it helps on queries that look more like phrases or sentences than keywords.
    • I've noticed that DuckDuckGo has a smaller index than Google. I have not checked how often it gets results.

I've noted lots of potential improvements to search, autocomplete, snippets, and even query mining. Not all are directly related to abandoned queries. Some highlights:

  • Don't show DYM suggestion or results for DYM suggestion when there are no results!
  • Use page-specific stats for snippet selection.
  • Better formatting for lists in snippets.
  • Make section anchors work from autocomplete suggestions.
  • Create a "text-only" title index for the Go feature.
  • Think again about showing cross-language "round-trip" results.
  • Thesaurus!
  • Maybe mine abandoned queries—they seem to be a better source of mining for possible new articles or redirects than zero-results queries. (All the usual caveats about query mining still apply.)

I've finished reviewing the Spanish abandoned samples. Very broadly, the tags stats are roughly the same, with a few standouts:

  • Spanish searchers were more likely to search from an article page than the main or search page.
  • Spanish searchers were more likely to get no suggestions for their full query
  • Spanish searchers were more likely to abandon their query even though a page with the info they probably needed did exist
  • Google was less likely to have the answers they wanted

Observations and Ideas:

  • DYM suggestions in Spanish were more likely to look kinda dumb because they are ungrammatical, or they are essentially the same query because all that got changed was grammatical endings on words.
  • We could in theory do a little better surfacing better sister search results and results from the wikidata widget, but the math and/or UI is hard.

More details in the full write up (search for eswiki and Spanish or check the diffs).

TJones changed the point value for this task from 8 to 13.Apr 7 2025, 3:18 PM

Whew, done at last! I've finished the analysis of French, and did a super brief review of just the queries themselves for German, Italian, Portuguese, and Russian. (Our old friends the wrong-keyboard queries showed up in Russian!)

Abbreviated Summary

There's no obvious single smoking gun to explain why searchers abandon their fulltext queries, which is not really a surprise. English, Spanish, and French have very broadly similar patterns of queries and detectable searcher behavior. German, Italian, Portuguese, and Russian do not have obviously wildly different patterns of queries, though Russian does have more queries that are not just straightforward "human queries in the home language of the wiki".

There are a lot of well-formed queries by people who seem to understand how to search Wikipedia. Sometimes they get results, sometimes not. Some people don't know how to search particularly well, but sometimes these sub-optimal queries still get pretty good search results.

Our notion of "abandoning" a fulltext query may need some refinement, since our autocomplete is so good and handles so much of our search traffic. Some users clearly navigate away from "abandoned" fulltext queries by using autocomplete to take them to the next page (or several pages) of interest.

  • We might want to reconsider whether we should count a search session as abandoned if the searcher clicks on an autocomplete suggestion after a fulltext search.
  • If we look at abandoned queries again, we might want to try to get a more complete version of the searcher/reader's path through our pages.

General improvements to search quality, query handling, second-try searching, and presentation may help people find information more easily, but we are always going to have people just ending their wiki spree on the search page.


A few new thoughts:

  • Sometimes snippets can give "hints" to searchers who have typos or other problems in their queries, but it is happenstance and unreliable.
  • Wikidata (possibly via the Wikidata widget) is also really helpful when there is a lot of variation in a name. (Example of 19th century nobleman named Karl, Carl, Charles, Carlos, Carlo, Karel, or Karol, depending on the language... and most don't bother to list his name in his native language!)
    • What other cool ideas have been implemented as widgets or gadgets? French provides cross-language and cross-project links if you get zero results, for example!
  • Matching category titles (exact or near exact) might help for some category-like searches, e.g., Ukrainian musicians

More details in the full write up (search for frwiki and French, see the de/it/pt/ru section, and the summary.. or check the diffs).