Page MenuHomePhabricator

[SPIKE] Investigate volume of rejected Tone Checks
Closed, DeclinedPublic

Description

This task involves of the work of calculating how many "Improve Tone" edit suggestions we can expect to generate were we to use rejected Tone Checks as the backlog/basis for such a suggestion.

Background

In T397731, the Growth Team is exploring potential approaches for building a backlog of Improve Tone edit suggestions/suggested edits.

One such approach is building a backlog of rejected Tone Checks.

In addition to the engineering and design constraints that will inform the viability of such an approach, we also need to know how many Tone Checks are rejected and thereby the size of the queue we'd be building were we to pursue this approach. This last bit is what we will be investigating in this task.

Event Timeline

@ppelberg (+ @MNeisler?): I added T387918: [MILESTONE] Run an A/B test to evaluate impact of Tone Check as a parent to this because we (Growth) are blocked on crucial decisions on the outcome of this task.

Please let me know if there is anything we can do to support your team working on this and getting the scaffolding ready for the A/B test.

We're looking to learn both about the volume of the rejected Tone Checks, but also their quality. Would it be possible to record somehow also the paragraph (literal text) of the tone checks that were rejected / not acted upon?

Once the a/b test is running you'd be able to query recent changes for ones that're tagged with editcheck-tone ("this edit has tone issues") and editcheck-tone-shown ("the user was shown a tone check") which should mean edits which rejected a tone check. You'd need to do a bit of manual examination of the revision to guess the quality -- but most of them are going to be edits within a single paragraph so it'll be obvious.

Once the a/b test is running you'd be able to query recent changes for ones that're tagged with editcheck-tone ("this edit has tone issues") and editcheck-tone-shown ("the user was shown a tone check") which should mean edits which rejected a tone check. You'd need to do a bit of manual examination of the revision to guess the quality -- but most of them are going to be edits within a single paragraph so it'll be obvious.

That makes sense, thank you! Then we'll aim to be ready for this when the A/B tests starts.

@ppelberg (+ @MNeisler?): I added T387918: [MILESTONE] Run an A/B test to evaluate impact of Tone Check as a parent to this because we (Growth) are blocked on crucial decisions on the outcome of this task.

Thank you for letting us know, @Michael !

FYI: I made T387918 a sub-task of this one seeing as how, it seems to be, blocking y'alls (Growth) work her.

A long while back, we made the decision to get the backlog of tasks not from rejected Tone Checks in VE, but actively generated by the ML Team. So this task can be declined as the information that was supposed to be gathered in here is no longer relevant to us.

(Also removing T387918 as the subtask to declutter the task-graph of T396162: [EPIC] Revise Tone: Structured Task (WE1.1.2, FY25-26).)