Page MenuHomePhabricator

How will communities configure Automoderator?
Closed, ResolvedPublicSpike

Description

We need to decide on the direction we will take to enable community administrators to configure Automoderator.

We may consider using Community Configuration, which is under active development by the Growth team. They have offered engineering support in Q3 if we go this route.

The kinds of things we expect communities to need to configure include:

  • Mode: Log-only or Reverting (toggle)
  • Caution level (number or selection)
    • Users should also be able to fine-tune the threshold by changing the number directly even if this is a selection in the UI.
  • Whether the bot should use the bot flag or not
  • What edit summary should be used when reverting
  • A link to a MediaWiki page to define the talk page message
  • How frequently the tool can revert the same editor on a page

User stories

  • As an administrator, I want to change Automoderator’s configuration, so that I can fine-tune the settings to reflect what behaviour my community is looking for.
  • As a patroller, I want to see what changes have been made to Automoderator's configuration, so that I can review how other editors changed the tool's behaviour in the past and have transparency into the tool.
  • As a patroller, I want to know when changes are made to Automoderator, so that I can understand changes in its behaviour.
  • As a patroller, I want to know what Automoderator’s current configuration is, so that I can understand how it will behave.

Technical
There are some notes in T345092, but we want to further think about:

  • Is Community Configuration a good fit?
    • In T349295, it was determined that the revert process will occur with a MediaWiki extension. Based on that premise, it seems like Community Configuration would be a good fit for Automoderator. These are the advantages of using Community Configuration:
      • We will be able to surface the relevant configuration options for admins to modify in different Wikis.
      • The configurations will have a history, making changes to them transparent for everyone.
      • As we discussed in the pre-mortem, having another user approve a configuration change is in the future work for Community Configuration, which for us is a mitigating factor against "purposefully harmful" configuration changes.
  • How will we access these configurations from the edit reverting component? (T349295)
    • Since the reverting component will be a MediaWiki extension, we should be able to access configurations easily.

Design goals - T349238
We want changes to Automoderator's configuration to be transparent, easily visible, and for there to be a clear log.

We may want to build novel illustrations or interactive elements into the configuration interface, for example to help users understand the impact of changing the caution level.

Determination

We have decided to use Community Configuration as a way for the Wikipedias to configure some of the Automoderator settings.

Event Timeline

Restricted Application changed the subtype of this task from "Task" to "Spike". · View Herald TranscriptOct 20 2023, 12:19 PM
Scardenasmolinar changed the task status from Open to In Progress.Nov 8 2023, 6:42 PM
Scardenasmolinar claimed this task.
Scardenasmolinar moved this task from Ready to In Progress on the Moderator-Tools-Team (Kanban) board.

@Scardenasmolinar - The Growth team is happy to meet and discuss any questions you may have about Community Configuration!

We don't have very thorough technical documentation yet, as the Community Configuration 2.0 project is still early on, but here's some info that might help: Community Configuration Product Requirements Document.

If you want to meet to chat through any technical questions, @Urbanecm_WMF or @Sgs could help.

Hi @KStoller-WMF, thanks for the help! While thinking about potential risks/failures of Automoderator, we found that if the revert risk range is changed drastically, the tool might be rendered useless or overly aggressive in removing edits. I see there is a potential feature (Feature 2) that could help mitigate that risk. Is there a plan to implement it, or are you still on the fence about it? We were also thinking it would be nice to be able to send an alert to admins when a configuration range changes drastically. Could that happen in the longer term?

@Scardenasmolinar do you think this would still be a problem if the config access would be limited to interface admins? For us on rowiki this works just fine for our revert bot, but we change the config very rarely.

While thinking about potential risks/failures of Automoderator, we found that if the revert risk range is changed drastically, the tool might be rendered useless or overly aggressive in removing edits.

That's definitely a legit concern. It's one of the risks we've discussed when conducting a Community Configuration pre-mortem. I don't think I have a perfect solution, but here are some ideas that might help:

  • As @Strainu mentions, you could consider limiting config access to a more priviledged user group. Currently the Growth team limits Configuration changes to admins. Anyone can view the configuration, but only admins can actually edit the form.
  • Admins (or any account holders) can watch the associated config pages, so there is some built-in oversight of changes. Example for English WIkipedia's GrowthExperimentsConfig.
  • We also plan to work with communities to set guidelines for Community Configuration use. For example, on larger wikis, there may be a norm in place that a discussion is started and consensus is reached before a configuration change is made. Admins may then add a link or explanation in their Edit summary to explain the change.
  • When setting up Automoderator configuration you might decide on which ranges are appropriate. Perhaps pre-determine some generally healthy ranges like rather than leave it totally open. Something like: High (catches the most vandalism, but will trigger more false positives), Medium, Low (will catch some vandalism and limit false positives).
  • The new Community configuration form designs are still underway, but will likely include options to link out to metrics that can help a community monitor and see the impact of a configuration change. The metrics / monitoring won't be built into Configuration, but for example, Growth's Suggested Edits configuration could link out to this metrics dashboard so communities have the ability to see current feature usage and the impact of a change after they make it.

Does that help?

While thinking about potential risks/failures of Automoderator, we found that if the revert risk range is changed drastically, the tool might be rendered useless or overly aggressive in removing edits.

That's definitely a legit concern. It's one of the risks we've discussed when conducting a Community Configuration pre-mortem. I don't think I have a perfect solution, but here are some ideas that might help:

  • As @Strainu mentions, you could consider limiting config access to a more priviledged user group. Currently the Growth team limits Configuration changes to admins. Anyone can view the configuration, but only admins can actually edit the form.
  • Admins (or any account holders) can watch the associated config pages, so there is some built-in oversight of changes. Example for English WIkipedia's GrowthExperimentsConfig.
  • We also plan to work with communities to set guidelines for Community Configuration use. For example, on larger wikis, there may be a norm in place that a discussion is started and consensus is reached before a configuration change is made. Admins may then add a link or explanation in their Edit summary to explain the change.
  • When setting up Automoderator configuration you might decide on which ranges are appropriate. Perhaps pre-determine some generally healthy ranges like rather than leave it totally open. Something like: High (catches the most vandalism, but will trigger more false positives), Medium, Low (will catch some vandalism and limit false positives).
  • The new Community configuration form designs are still underway, but will likely include options to link out to metrics that can help a community monitor and see the impact of a configuration change. The metrics / monitoring won't be built into Configuration, but for example, Growth's Suggested Edits configuration could link out to this metrics dashboard so communities have the ability to see current feature usage and the impact of a change after they make it.

Does that help?

Thanks for the ideas! Those sound great, including the ranges for the revert risk configuration. As we advance in building Automoderator, maybe we can touch base with the Growth team to see if Feature 2 might also be a possibility in a not-so-distant future.

Thinking about how this might work alongside an ores-enhanced recent changes page with language-agnostic-revert-risk, I'd suggest that we use a selection for caution level
We could align the language and values to ores when it's configured so that there we can leverage existing tooling for users to see the impact of changing the value. Want to see which edits would get caught at various settings? go to recent changes and filter by risk level. Here's an example of the filter options for another model that has multiple thresholds set:

image.png (289×631 px, 43 KB)
What do you think about this @aishwaryavardhana?

Thinking about how this might work alongside an ores-enhanced recent changes page with language-agnostic-revert-risk, I'd suggest that we use a selection for caution level
We could align the language and values to ores when it's configured so that there we can leverage existing tooling for users to see the impact of changing the value. Want to see which edits would get caught at various settings? go to recent changes and filter by risk level. Here's an example of the filter options for another model that has multiple thresholds set:

image.png (289×631 px, 43 KB)
What do you think about this @aishwaryavardhana?

This is a good point & nice idea. Do we know what the fate of the ORES extension is with regards to moving to revert risk for these scores (or not)?

EDIT: I see there's a Slack thread about the extension, so I've just caught up on that.