Page MenuHomePhabricator

Recommendation Equity: Best Practices
Closed, ResolvedPublic

Description

Develop best practices for data collection for measuring content equity in recommender systems based on the analyses I've conducted of SuggestedEdits, Newcomer Tasks, and various campaigns.

Related Objects

Event Timeline

Isaac moved this task from Backlog to In Progress on the Research board.
Isaac moved this task from In Progress to FY2021-22-Research-Oct-Dec on the Research board.
Isaac edited projects, added Research (FY2021-22-Research-Oct-Dec); removed Research.

Updates:

  • Spoke with MM and MW from Growth about my analysis of Newcomer Tasks but also the general framework that I use for evaluating recommender systems. MM very much appreciated the framework, in particular a few components;
    • Breaking down a recommender system into stages.
    • Callling out which stages are under whose 'control' -- e.g., this is where we can intervene, this part is on the editor side and we can only support, etc.
    • Relating the analyses back to the pipeline so they understand how equity shifts throughout it.

I will clean up the framework a bit to share more broadly but the broad stages are:

  1. Status Quo: Start with all (biased) content
  2. Algorithm Design: Filter down to just content "eligible" for recommendation
  3. Prioritization: Select individual pieces of content to recommend
  4. Impression: Editors see content
  5. Click-through: Editor choose whether or not to accept recommendation
  6. Edit: Editor does or does not make the edit
  7. Impact: What is cumulative effect of edits on content equity dimensions?

Updates:

Updates:

Updates: missed the update a while back that I discussed this work with MG and MR. Some good discussion came out of that, especially on the relationship between the Knowledge Gaps work and this work. We have a follow-up next week to discuss more, in particular what it means to have a "representative" sample of content on Wikipedia (sounds simple but quite difficult to define and very important question for a lot of our recommender systems)..

Update: reworking the framing around the slide deck right now based on feedback from MR/MG. Making explicit the choice around who to empower through our recommender systems and the role that different approaches might take. Plans to bring the recommendations back to Product in Q3. Next week I'll summarize the recommendations and then close out this task.

Summary of recommendations:

  • Short-term:
    • Empower editors: invest in more topic filters for users (Research / ML Platform / Product).
      • This means productionizing the language-agnostic topic models, a country-based geography model (already strong prototype available based on Wikidata), and potentially reworking the biography side of the topic models.
    • Measure impact: standardize evaluation of recommender systems for gender and geographic impact (Product Analytics)
      • This means having a process for keeping the gender/geography data up-to-date on HDFS so it's easy to join in with edit tag data for the various recommender systems
  • Long-term:
    • Empower organizers: connect campaigns ecosystem with recommender systems (Product)
      • I view this as the best long-term and most sustainable approach to aligning our recommender systems with Wikimedia's content equity goals.
    • Diversify editors: continue focus on recruiting and supporting greater geographic distribution of editors to address geographic / cultural gaps (WMF)
      • This is nothing new but the data clearly shows that editor geography matters as far as what content is improved