Recommendation Equity: Best Practices
Closed, ResolvedPublic
Actions

Assigned To

Authored By

	Isaac
	Oct 15 2021, 7:20 PM

Description

Develop best practices for data collection for measuring content equity in recommender systems based on the analyses I've conducted of SuggestedEdits, Newcomer Tasks, and various campaigns.

Related Objects
Search...

		Status	Subtype	Assigned	Task
		Open		Isaac	T293516 Recommender Systems + Content Equity
		Resolved		Isaac	T293517 Recommendation Equity: Best Practices

Event Timeline

Isaac created this task.Oct 15 2021, 7:20 PM

Isaac moved this task from Backlog to In Progress on the Research board.

Isaac moved this task from In Progress to FY2021-22-Research-Oct-Dec on the Research board.

Isaac edited projects, added Research (FY2021-22-Research-Oct-Dec); removed Research.

Updates:

Spoke with MM and MW from Growth about my analysis of Newcomer Tasks but also the general framework that I use for evaluating recommender systems. MM very much appreciated the framework, in particular a few components;
- Breaking down a recommender system into stages.
- Callling out which stages are under whose 'control' -- e.g., this is where we can intervene, this part is on the editor side and we can only support, etc.
- Relating the analyses back to the pipeline so they understand how equity shifts throughout it.

I will clean up the framework a bit to share more broadly but the broad stages are:

Status Quo: Start with all (biased) content
Algorithm Design: Filter down to just content "eligible" for recommendation
Prioritization: Select individual pieces of content to recommend
Impression: Editors see content
Click-through: Editor choose whether or not to accept recommendation
Edit: Editor does or does not make the edit
Impact: What is cumulative effect of edits on content equity dimensions?

Updates:

Put together a clean version of the slide deck I presented to MM and MW from Growth: https://docs.google.com/presentation/d/1JPhzOiS5Xdsq0-EbN7m6PHalG9Uro-5X5M3egWB4sWk/edit?usp=sharing
Will be discussing more in-depth with team in the coming weeks too.
I still would like to clean up the Meta page so it's more approachable, but the slide deck is close to a place where I'd feel more comfortable sharing out.

Updates:

prepping for discussion with Martin/Miriam about current framework -- what are the big open RQs? Where do we want to go with this work? Agree on the strategy? etc.
Meta page (https://meta.wikimedia.org/wiki/Research:Prioritization_of_Wikipedia_Articles/Recommendation) is cleaned up to a better state.

Updates: missed the update a while back that I discussed this work with MG and MR. Some good discussion came out of that, especially on the relationship between the Knowledge Gaps work and this work. We have a follow-up next week to discuss more, in particular what it means to have a "representative" sample of content on Wikipedia (sounds simple but quite difficult to define and very important question for a lot of our recommender systems)..

Update: reworking the framing around the slide deck right now based on feedback from MR/MG. Making explicit the choice around who to empower through our recommender systems and the role that different approaches might take. Plans to bring the recommendations back to Product in Q3. Next week I'll summarize the recommendations and then close out this task.

Summary of recommendations:

Short-term:
- Empower editors: invest in more topic filters for users (Research / ML Platform / Product).
  - This means productionizing the language-agnostic topic models, a country-based geography model (already strong prototype available based on Wikidata), and potentially reworking the biography side of the topic models.
- Measure impact: standardize evaluation of recommender systems for gender and geographic impact (Product Analytics)
  - This means having a process for keeping the gender/geography data up-to-date on HDFS so it's easy to join in with edit tag data for the various recommender systems
Long-term:
- Empower organizers: connect campaigns ecosystem with recommender systems (Product)
  - I view this as the best long-term and most sustainable approach to aligning our recommender systems with Wikimedia's content equity goals.
- Diversify editors: continue focus on recruiting and supporting greater geographic distribution of editors to address geographic / cultural gaps (WMF)
  - This is nothing new but the data clearly shows that editor geography matters as far as what content is improved

Recommendation Equity: Best PracticesClosed, ResolvedPublicActions

Description

Related ObjectsSearch...

Event Timeline

Recommendation Equity: Best Practices
Closed, ResolvedPublic
Actions

Related Objects
Search...