Page MenuHomePhabricator

[SPIKE] Determine whether Reply Tool traffic warrants lowering sampling rate through wider deployment.
Closed, ResolvedPublic

Description

This task is about determining whether the number of events being emitted by the Reply Tool are high enough to warrant us lowering the sampling rate –currently set at 100% [i] – as it's scaled to be an opt-out feature at all Wikipedias (T262331).

We will couple what we learn in this task with what we learn in T284153 to determine how, and if, the DiscussionTools' sampling rate(s) need to be adjusted (T274471).

Question(s) this spike should answer

  • Do we anticipate the growth in Reply Tool traffic (read: the number of events it's emitting) to be high enough to warrant us lowering the sampling rate (currently set at 100% [i]) as it's scaled to be an opt-out feature at all Wikipedias (T262331)?

Done

  • Answers to all ===Open questions are documented in the task description
  • Followup tickets are created [if necessary]

i. Reference needed

Event Timeline

@ppelberg

Based on a review of daily traffic for the reply tool, I do not anticipate the increase in reply tool traffic to be high enough to warrant lowering the sampling rate. I recommend we can keep the current 100% reply tool sampling rate for wider deployment.

Reasoning: I reviewed the daily average number of reply tool init and page edit init events coming from wikis where the reply tool is currently deployed as opt-out. Even with oversampling, reply tool edits represent only about 4 to 6% of all init events logged on EditAttemptStep. I estimated a proportional increase in daily reply tool events based on the size (read: number of editors) of larger wikis and continued oversampling through wider deployment should not cause any significant increases in traffic logged on editattemptstep.

MNeisler triaged this task as Medium priority.
MNeisler moved this task from Next 2 weeks to Done on the Product-Analytics (Kanban) board.
MNeisler subscribed.

@ppelberg

Based on a review of daily traffic for the reply tool, I do not anticipate the increase in reply tool traffic to be high enough to warrant lowering the sampling rate. I recommend we can keep the current 100% reply tool sampling rate for wider deployment.

Understood and sounds good to me.

As such, I'm going to:

  1. Resolve this task
  2. De-prioritize T284153 considering we are going to continue sampling Reply and New Discussion Tool events at 100% until new information surfaces that causes us to reevaluate this approach.