Summarize what we know about the "zero results" queries
Closed, DuplicatePublic
Actions

Assigned To

None

Authored By

	• Jdouglas
	Jun 23 2015, 8:50 PM

Description

We'll be brainstorming how to reduce the "zero results" rate. We're probably going to think of lots of really good, interesting ideas, but we're shooting in the dark unless we have some insight into the queries that are currently yielding no results.

For this task, let's analyze the current data (i.e. what we used to come up with the 25% figure), and see what we can learn about queries that are failing to come up with results.

Stakeholder: The users who are currently failing to get search results
Benefit: Frame the team's planning for the next quarter
Estimate: Needs to be done before the brainstorming meeting

Related Objects

Mentioned In: T107035: Review elastic search logs for useful patterns of search activity that get no results.
T105184: Parallelize the theory-testing pipeline
Mentioned Here: T107035: Review elastic search logs for useful patterns of search activity that get no results.
T104505: EPIC: Improve results when users enter searches in other languages

Event Timeline

• Jdouglas created this task.Jun 23 2015, 8:50 PM

• Jdouglas raised the priority of this task from to Needs Triage.

• Jdouglas updated the task description. (Show Details)

• Jdouglas added a project: CirrusSearch.

• Jdouglas subscribed.

Restricted Application added a project: Discovery-ARCHIVED. · View Herald TranscriptJun 23 2015, 8:50 PM

Restricted Application added a subscriber: Aklapper. · View Herald Transcript

• Jdouglas updated the task description. (Show Details)Jun 23 2015, 8:52 PM

• Jdouglas set Security to None.

• Jdouglas triaged this task as Unbreak Now! priority.Jun 23 2015, 8:54 PM

• Jdouglas added a project: Discovery-Search (Current work).

The zero-results logs have lots of prefix searches -- are these from typeahead searches run automatically from the search input box, or did folks search for these literal phrase fragments?

Jim_Carter subscribed.Jun 24 2015, 7:59 AM

In T103596#1394761, @Jdouglas wrote:

The zero-results logs have lots of prefix searches -- are these from typeahead searches run automatically from the search input box, or did folks search for these literal phrase fragments?

Prefix search is almost entirely typeahead. They come from firefox plugins and the site directly.

Did @Ironholds say something about analyzing these separately from the rest, and that both still came up with a 25% no-results rate?

Yeah, but it was a very ad-hoc job; we need to set up more structured and robust reporting before I'd rely on that number.

This would ordinarily be an Analysis task, but since Oliver's out, an engineer can do this one because we need it really soon!

• Deskana added a project: Discovery-Search (Current work).Jul 2 2015, 4:48 PM

I don't think anyone except oliver currently has access to this data in a reasonable format

• Jdouglas mentioned this in T105184: Parallelize the theory-testing pipeline.Jul 9 2015, 3:55 PM

• Deskana edited projects, added Discovery-Analysis (Current work); removed Discovery-Search (Current work), CirrusSearch.Jul 9 2015, 4:36 PM

TJones subscribed.Jul 9 2015, 4:39 PM

dcausse subscribed.Jul 10 2015, 3:51 PM

I'd like to have access to this data and start to evaluate performances of the cybozu language detector on it (T104505).

Has the data been collected? (asked the newbie who didn't know.)

It would be really wonderful to have (appropriately anonymized, reasonably formatted) data sitting in a pile somewhere that everyone has access to. David and I were talking today, and it'd be great to do a quick-n-dirty name-detection pass over the query strings to see how many are likely names, for example, to gauge what kind of impact improved name indexing and searching might have on zero results.

It hasn't, no - or, we have a way of collecting it but we're not consistently using it. If that's a desired thing, poke Dan; people have been asking for this a lot but it's not highly prioritised

I think this is a duplicate of T107035, which is where this analysis is actually taking place.

@Deskana, I created T107035 as a subtask of this one, since my analysis isn't necessarily the only analysis. I'm focusing mostly on the wikipedia queries as a place to get started, and I wasn't sure what additional analysis this task would cover.

In T103596#1489544, @TJones wrote:

@Deskana, I created T107035 as a subtask of this one, since my analysis isn't necessarily the only analysis. I'm focusing mostly on the wikipedia queries as a place to get started, and I wasn't sure what additional analysis this task would cover.

Not a problem. You're doing exactly the right thing. I think we can just merge this task into your task.

• Deskana closed this task as a duplicate of T107035: Review elastic search logs for useful patterns of search activity that get no results..Jul 28 2015, 10:20 PM

• Deskana removed a subtask: T107035: Review elastic search logs for useful patterns of search activity that get no results..

• Deskana mentioned this in T107035: Review elastic search logs for useful patterns of search activity that get no results..

• Deskana moved this task from Backlog to Done on the Discovery-Analysis (Current work) board.Aug 4 2015, 8:16 PM

• Deskana moved this task from Done to Resolved on the Discovery-Analysis (Current work) board.Aug 31 2015, 6:27 PM

Summarize what we know about the "zero results" queriesClosed, DuplicatePublicActions

Description

Related Objects

Event Timeline

Summarize what we know about the "zero results" queries
Closed, DuplicatePublic
Actions