Page MenuHomePhabricator

Explore data about how many patrollers would qualify to see IPs for different thresholds across multiple wikis
Closed, ResolvedPublic

Description

Motivation

We want to set thresholds for account age and edit count which will determine who can qualify to opt-in to see IP addresses. This is an important limit so that enough patrollers who need access can qualify yet we can limit IP address exposure. Our goal for this analysis is to understand how many users will qualify across different wikis if the thresholds varied.

Spec
  • Calculate the following:
    • Number of users who qualify
    • Percent of total registered users who qualify
  • Given that the qualifying requirements are:
    • Scenario 1:
      • Account is 6 months or older on the project
      • 500 or more edits on the project
      • No active block on any project (this piece is optional. feel free to skip if it complicates data analysis)
    • Scenario 2:
      • Account is 6 months or older on the project
      • 400 or more edits on the project
      • No active block on any project (this piece is optional. feel free to skip if it complicates data analysis)
    • Scenario 3:
      • Account is 6 months or older on the project
      • 300 or more edits on the project
      • No active block on any project (this piece is optional. feel free to skip if it complicates data analysis)

Preferred wikis for the analysis (wikis with 300 or more monthly active editors):

  • English Wikipedia
  • Wikimedia Commons
  • Wikidata
  • Japanese Wikipedia
  • German Wikipedia
  • French Wikipedia
  • Spanish Wikipedia
  • Russian Wikipedia
  • Chinese Wikipedia
  • Italian Wikipedia
  • Portuguese Wikipedia
  • Persian Wikipedia
  • Polish Wikipedia
  • Dutch Wikipedia
  • Ukrainian Wikipedia
  • Indonesian Wikipedia
  • Hebrew Wikipedia
  • Turkish Wikipedia
  • Arabic Wikipedia
  • Meta-Wiki
  • English Wiktionary
  • Czech Wikipedia
  • Vietnamese Wikipedia
  • Korean Wikipedia
  • Swedish Wikipedia
  • Finnish Wikipedia
  • Hungarian Wikipedia
  • Catalan Wikipedia
  • Thai Wikipedia
  • Greek Wikipedia
  • Norwegian Bokmål Wikipedia
  • Bengali Wikipedia
Note
  • There will ways for communities to request exceptions to the process if there is a demonstrated need.
  • Communities will be able to self-impose tighter restrictions for who can qualify to see IP addresses beyond these limits if they want to do so.

Event Timeline

Niharika created this task.

@Niharika , here is the analysis summary. Let me know if you have any questions.

Methodology

Metrics we measured:

  • Number of users who qualify for three criteria scenarios
  • Percent of total registered users who qualify for three criteria scenarios

Users are pulled from wmf.mediawiki_history schema. The analysis focused on the editors who edited at least once on the given wiki between 2022-01-01 and 2022-12-31 and are 6 months or older on the project ( registered before 2022-07-01). Bot users and blocked users are excluded.

Users' total edits are pulled from the event_user_revision_count field. It’s a rough number of edits and edit-like actions the user has performed. (ref: https://www.mediawiki.org/wiki/Manual:User_table). We analyzed users in three scenarios:

  • 300 or more historical edits on the project
  • 400 or more historical edits on the project
  • 500 or more historical edits on the project

To be consistent with users’ total edits, editors of all types of edits are included in the analysis, instead of restricting to editors who have submitted content edits.

Total registered users are pulled from the 2022-12 snapshot of wmf_raw.mediawiki_user schema.

Analysis Result

wiki_dbwiki english nameeditors (edits>= 300)editors (edits>=400)editors (edits>=500)registered userseditors /registered users (edits>= 300)editors / registered users (edits>= 400)editors /registered users (edits>= 500)
enwikiEnglish Wikipedia524034532440520447345020.12%0.10%0.09%
wikidatawikiWikidata16114143361305757873990.28%0.25%0.23%
commonswikiWikimedia Commons161011414812793115492440.14%0.12%0.11%
dewikiGerman Wikipedia92208033723340557120.23%0.20%0.18%
frwikiFrench Wikipedia71156106548845464710.16%0.13%0.12%
jawikiJapanese Wikipedia65085529485520102640.32%0.28%0.24%
ruwikiRussian Wikipedia53734759437233055100.16%0.14%0.13%
eswikiSpanish Wikipedia55074741426867270650.08%0.07%0.06%
zhwikiChinese Wikipedia40913583323932967240.12%0.11%0.10%
itwikiItalian Wikipedia37913340301423297400.16%0.14%0.13%
ptwikiPortuguese Wikipedia23552019182128521580.08%0.07%0.06%
plwikiPolish Wikipedia21681946180212102600.18%0.16%0.15%
nlwikiDutch Wikipedia17581536140312486350.14%0.12%0.11%
metawikiMeta-Wiki179715071313369603380.00%0.00%0.00%
ukwikiUkrainian Wikipedia1501131812026620180.23%0.20%0.18%
hewikiHebrew Wikipedia14031228111710393520.14%0.12%0.11%
fawikiPersian Wikipedia13321143103411798480.11%0.10%0.09%
enwiktionaryEnglish Wiktionary105293185540124720.03%0.02%0.02%
kowikiKorean Wikipedia9428417737669730.12%0.11%0.10%
svwikiSwedish Wikipedia9368317608474830.11%0.10%0.09%
arwikiArabic Wikipedia94083675323605680.04%0.04%0.03%
cswikiCzech Wikipedia9208187486015760.15%0.14%0.12%
idwikiIndonesian Wikipedia94680270313749120.07%0.06%0.05%
trwikiTurkish Wikipedia82171666014564960.06%0.05%0.05%
huwikiHungarian Wikipedia7776976495206320.15%0.13%0.12%
viwikiVietnamese Wikipedia7176435858928000.08%0.07%0.07%
fiwikiFinnish Wikipedia6956255675316910.13%0.12%0.11%
cawikiCatalan Wikipedia5574854384376910.13%0.11%0.10%
thwikiThai Wikipedia4984344014386670.11%0.10%0.09%
nowikiNorwegian Bokmål Wikipedia4533963515696100.08%0.07%0.06%
elwikiGreek Wikipedia3753232943796200.10%0.09%0.08%
bnwikiBengali Wikipedia3483062784013440.09%0.08%0.07%
mpopov subscribed.

Thank you, Jennifer!

@Niharika: I'm resolving this task as it seems to be compete according to original scope. If you have any follow-up questions or additional requests, please create a new task and work with Jennifer to figure out its priority relative to existing planned work.