This task involves the work of conducting a usability test of the Peacock Check proof of concept that integrates the first iteration of the peacock language detection model the ML Team is developing.
More broadly, this task supports WE 1.2.13:
If we conduct usability tests of an initial engineered version of Peacock Check with ≥10 newcomers and Junior Contributors and ≥80% of them describe the experience using terms like "helpful," "makes sense," and "clear", then we can be confident the proposed UX has the potential to lower the rate at which the new content edits are reverted on the grounds of WP:WTW (and related policies)
Decision(s) to be made
Research Questions
- To what – if any – extent did people find the feedback Edit Check offered confusingly/unhelpfully generic?
- Context: this question is a response to us deciding we can be okay with flagging non-neutral language even if the type of non-neutral language varies across cases. | See Slack and comments from @jhsoby, @Strainu, and @matej_suchanek in T388215.