Evaluate DYM metrics available in current search satisfaction logging
Closed, ResolvedPublic
Actions

Assigned To

Authored By

	EBernhardson
	Jul 16 2019, 11:09 PM

Description

The search satisfaction schema records various information about DYM usage, but we've never used it for anything so don't have any metrics. Work with a sample of the data and see if we have useful metrics here.

The result of this ticket will be a jupyter notebook commited to the relforge repository to calculate and display metrics, and potentially minor patches to data collection as necessary.

Potential metrics. % of X refers to either per-search or per-session:

% of X shown a dym suggestion
% of X shown the search results of a dym suggestion
^ but excluding 'autorewrite'?
% of X shown a dym suggestion that clicked through to dym results
% of X shown dym results that clicked a result

Details

	Subject	Repo	Branch	Lines +/-
	Calculate DYM metrics for full text search	wikimedia/discovery/relevanceForge	master	+453 -0

Customize query in gerrit

Related Objects

Mentioned In: T229268: Build superset dashboard for search satisfaction did you mean metrics
Mentioned Here: T216058: Spike. Load search data into turnilo to test whether exploratory data can do away with some of the dashboards

Event Timeline

EBernhardson created this task.Jul 16 2019, 11:09 PM

Restricted Application added a subscriber: Aklapper. · View Herald TranscriptJul 16 2019, 11:09 PM

@dcausse @TJones Any other suggestions for metrics? For reference what I've put together so far is a transformation of the search satisfaction events into a simplified "dym search event" table. This table has one row per search performed and has a boolean indicating each of the following conditions. We can probably track more but this might be sufficient.

Is this query a suggested query?
Is DYM shown on top of SERP?
Was DYM clicked on top of SERP?
Were any results clicked?

Aklapper added a project: Discovery-ARCHIVED.Jul 17 2019, 10:55 AM

TJones updated the task description. (Show Details)Jul 17 2019, 1:59 PM

Looks good.

% of X shown dym results that interacted with the result list

Not 100% sure I get this—do you mean the user clicked on a DYM result?

^ but excluding 'autorewrite'?

It would definitely be nice to do one of three things: ignore autorewrite status, only include autorewrites, or exclude all autorewrites. (I'm curious to see how autorewrites compare to simple suggestions in terms of frequency, click through rates, etc.)

In T228226#5341102, @TJones wrote:

Looks good.

% of X shown dym results that interacted with the result list

Not 100% sure I get this—do you mean the user clicked on a DYM result

I mean that the user clicked on any result on the page, essentially a "success"

^ but excluding 'autorewrite'?

It would definitely be nice to do one of three things: ignore autorewrite status, only include autorewrites, or exclude all autorewrites. (I'm curious to see how autorewrites compare to simple suggestions in terms of frequency, click through rates, etc.)

ok i'll slice the metrics on that dimension as well

EBernhardson updated the task description. (Show Details)Jul 17 2019, 9:51 PM

EBernhardson claimed this task.Jul 20 2019, 12:22 AM

EBernhardson triaged this task as Medium priority.

EBernhardson added a project: Discovery-Search (Current work).

EBernhardson moved this task from Incoming to Needs review on the Discovery-Search (Current work) board.

Change 524814 had a related patch set uploaded (by EBernhardson; owner: EBernhardson):
[wikimedia/discovery/relevanceForge@master] Calculate DYM metrics for full text search

https://gerrit.wikimedia.org/r/524814

gerritbot added a project: Patch-For-Review.Jul 22 2019, 4:03 PM

Change 524814 merged by jenkins-bot:
[wikimedia/discovery/relevanceForge@master] Calculate DYM metrics for full text search

https://gerrit.wikimedia.org/r/524814

Maintenance_bot removed a project: Patch-For-Review.Jul 23 2019, 9:10 AM

EBernhardson moved this task from Needs review to Needs Reporting on the Discovery-Search (Current work) board.Jul 23 2019, 5:19 PM

Followup will be in T216058 to test import the backing data into druid and evaluate if one of the druid interfaces can visualize our metrics.

Metrics we should use moving forward

% of search shown a [auto / non-auto] dym
- Target: Increase % without significantly reducing the other metrics
% of people shown non-auto dym that click through to dym results
- Target: Increase % of clickthrough
% of searches shown dym search results [auto / non-auto] dym results that clicked a result
- Target: Increase % of clickthrough

debt closed this task as Resolved.Jul 24 2019, 5:32 PM

EBernhardson mentioned this in T229268: Build superset dashboard for search satisfaction did you mean metrics.Jul 29 2019, 8:18 PM

Evaluate DYM metrics available in current search satisfaction loggingClosed, ResolvedPublicActions

Description

Details

Related Objects

Event Timeline

Evaluate DYM metrics available in current search satisfaction logging
Closed, ResolvedPublic
Actions