Page MenuHomePhabricator

[L] Count possible notifications to users for image suggestions
Closed, ResolvedPublic

Description

For the SD Image Suggestions MVP product we will be sending notifications to users who are watchers of articles for which we have a suggested image, or who have edited those articles in the last 30 days.

We want to get an idea of how many notifications might be sent, and how this compares to the notification burden users already have and the typical user interaction with the notifications the user receives currently

Relevant wikis:

  • pt
  • ru
  • id

Acceptance criteria:

  • measurements of how many notifications users are getting right now for all relevant wikis - maximum, average, median and, if it's not too difficult, median of the top 10% and top 25% (1 day & 30 day over 90 days )
  • number of articles users are watching for all relevant wikis - maximum, average, median
  • a count of users that will be notified of suggested images for unillustrated articles they are watching, for all relevant wikis
  • a count of users that will be notified of suggested images for unillustrated articles they have edited in the last 30 days, for all relevant wikis
  • a count of unillustrated articles that users are watching, for which there is a suggestion of an image, for all relevant wikis

Note that some of the notebook code for T280370 may be re-usable for this task

Event Timeline

CBogen renamed this task from Count possible notifications to users for image suggestions to [L] Count possible notifications to users for image suggestions.Oct 6 2021, 4:40 PM

measurements of how many notifications users are getting right now for all relevant wikis - maximum, average, median and, if it's not too difficult, median of the top 10% and top 25% (1 day & 30 day )
(based on mariadb queries covering Jan 1 - Jan 30)

ru

  • 1 day Maximums -- 28, 37, 123, 114, 32, 51, 26, 45, 19, 35, 36, 18, 38, 58, 63, 46, 35, 36, 80, 720, 51, 1882, 201, 46, 74, 154, 106, 59, 117
  • 1 day Averages -- 2.07, 2.09, 2.16, 2.26, 2.02, 2.37, 2.13, 2.2, 2.03, 2.03, 2.19, 1.99, 2.16, 2.18, 2.13, 2.09, 2.37, 2.08, 2.19, 2.9, 2.21, 4.92, 2.32, 2.33, 2.26, 2.36, 2.36, 1.75, 2.34
  • 1 day Medians -- 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0
  • 30 day Maximum -- 1999
  • 30 day Average -- 4.23
  • 30 day Median -- 1.0
  • 1 day Averages top 25% -- 4.87, 4.78, 5.06, 5.61, 4.6, 5.81, 4.91, 5.26, 4.56, 4.6, 5.2, 4.38, 4.9, 5.14, 5.03, 4.81, 5.85, 4.77, 5.14, 8.06, 5.3, 16.23, 5.78, 5.78, 5.5, 5.9, 5.78, 3.92, 5.69
  • 1 day Medians top 25% -- 3.0, 3.0, 3.0, 3.0, 3.0, 4.0, 4.0, 3.0, 3.5, 3.0, 3.0, 3.0, 3.0, 3.0, 3.0, 3.0, 4.0, 3.0, 3.0, 4.0, 3.0, 3.0, 4.0, 4.0, 4.0, 4.0, 4.0, 3.0, 4.0
  • 1 day Averages top 10% -- 7.63, 7.48, 7.99, 9.48, 6.99, 9.74, 7.47, 8.64, 6.64, 7.11, 8.45, 6.38, 7.59, 8.12, 8.0, 7.63, 9.77, 7.45, 8.22, 15.42, 8.61, 36.05, 9.68, 9.66, 8.94, 9.94, 9.64, 6.3, 9.44
  • 1 day Medians top 10% -- 5.0, 5.0, 5.0, 5.0, 5.0, 7.0, 5.0, 5.0, 5.0, 5.0, 5.0, 5.0, 6.0, 6.0, 5.0, 5.0, 6.0, 5.0, 5.0, 6.0, 5.0, 6.0, 6.0, 6.0, 6.0, 5.0, 6.0, 4.0, 6.0
  • 30 day Average top 25% -- 12.66
  • 30 day Median top 25% -- 4.0
  • 30 day Average top 10% -- 26.04
  • 30 day Median top 10% -- 10.0

pt

  • 1 day Maximums -- 23, 22, 46, 39, 78, 27, 15, 16, 52, 94, 115, 32, 53, 22, 56, 61, 28, 36, 16, 93, 26, 65, 60, 44, 81, 60, 55, 106, 34
  • 1 day Averages -- 1.7, 2.36, 2.27, 2.44, 2.42, 2.38, 1.73, 1.69, 2.74, 2.56, 2.55, 2.68, 2.27, 2.2, 2.59, 2.49, 1.63, 1.91, 2.36, 1.79, 2.21, 2.48, 2.26, 2.71, 2.81, 2.67, 1.82, 1.91, 2.21
  • 1 day Medians -- 1.0, 2.0, 2.0, 2.0, 2.0, 2.0, 1.0, 1.0, 3.0, 2.0, 2.0, 3.0, 2.0, 2.0, 2.0, 2.0, 1.0, 1.0, 2.0, 1.0, 2.0, 2.0, 2.0, 3.0, 2.0, 2.0, 1.0, 1.0, 2.0
  • 30 day Maximum -- 984
  • 30 day Average -- 4.7833830050156285
  • 30 day Median -- 2.0
  • 1 day Averages top 25% -- 3.53, 4.97, 3.97, 5.01, 4.4, 5.08, 3.49, 3.44, 4.23, 5.61, 5.45, 3.98, 3.83, 4.34, 5.87, 5.29, 3.33, 4.14, 4.61, 3.92, 4.61, 5.33, 4.71, 4.31, 6.63, 5.81, 3.87, 4.25, 4.37
  • 1 day Medians top 25% -- 3.0, 4.0, 3.0, 4.0, 3.0, 4.0, 3.0, 3.0, 3.0, 4.0, 3.0, 3.0, 3.0, 3.0, 4.0, 4.0, 3.0, 3.0, 4.0, 3.0, 3.0, 4.0, 4.0, 3.0, 4.0, 4.0, 3.0, 3.0, 3.0
  • 1 day Averages top 10% -- 5.5, 7.49, 6.22, 7.47, 7.23, 7.91, 5.16, 4.94, 6.1, 8.99, 8.91, 5.47, 5.87, 6.23, 9.72, 8.48, 4.98, 6.54, 6.76, 6.28, 6.94, 8.51, 6.99, 6.3, 11.69, 9.3, 6.09, 6.99, 6.33
  • 1 day Medians top 10% -- 4.0, 6.0, 5.0, 5.0, 4.5, 5.5, 4.0, 4.0, 5.0, 5.0, 5.0, 4.0, 4.0, 5.0, 7.0, 5.0, 4.0, 4.0, 6.0, 4.0, 5.0, 5.0, 5.0, 5.0, 6.0, 6.5, 4.0, 5.0, 5.0
  • 30 day Average top 25% -- 13.19
  • 30 day Median top 25% -- 5.0
  • 30 day Average top 10% -- 26.05
  • 30 day Median top 10% -- 24.0

id

  • 1 day Maximums --15, 49, 19, 114, 26, 36, 53, 14, 28, 17, 28, 39, 27, 13, 22, 10, 14, 9, 30, 25, 30, 21, 27, 17, 23, 25, 13, 28, 16
  • 1 day Averages -- 1.61, 1.8, 1.7, 2.57, 1.39, 1.79, 1.97, 1.7, 1.82, 1.84, 1.88, 1.99, 1.96, 1.61, 2.38, 1.79, 1.76, 1.75, 1.47, 1.88, 1.94, 1.87, 1.97, 1.87, 1.89, 1.79, 1.68, 1.9, 1.64
  • 1 day Medians -- 1.0, 1.0, 1.0, 2.0, 1.0, 1.0, 2.0, 1.0, 1.0, 2.0, 1.0, 2.0, 1.0, 1.0, 2.0, 1.0, 1.0, 1.0, 1.0, 2.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0
  • 30 day Maximum -- 390
  • 30 day Average -- 3.08
  • 30 day Median -- 2.0
  • 1 day Averages top 25% -- 3.01, 3.58, 3.38, 5.96, 2.58, 3.45, 3.5, 3.41, 3.69, 3.29, 3.85, 3.96, 3.89, 3.0, 4.69, 3.24, 3.14, 3.28, 2.86, 3.23, 3.8, 3.8, 4.05, 3.55, 3.81, 3.57, 3.24, 3.72, 3.14
  • 1 day Medians top 25% -- 2.0, 3.0, 3.0, 3.0, 2.0, 3.0, 3.0, 3.0, 3.0, 3.0, 3.0, 3.0, 3.0, 2.0, 4.0, 3.0, 3.0, 3.0, 2.0, 3.0, 3.0, 3.0, 3.0, 3.0, 3.0, 3.0, 3.0, 3.0, 2.0
  • 1 day Averages top 10% -- 4.3, 5.52, 5.05, 11.51, 3.76, 5.22, 5.38, 5.07, 5.65, 4.61, 5.78, 6.1, 5.82, 4.35, 5.73, 4.35, 4.12, 4.48, 4.53, 4.71, 5.48, 5.74, 6.35, 5.25, 5.53, 5.51, 4.73, 5.63, 4.64
  • 1 day Medians top 10% -- 3.0, 4.0, 4.0, 5.0, 3.0, 3.0, 4.0, 4.0, 4.0, 4.0, 4.0, 4.0, 4.0, 4.0, 4.0, 3.0, 3.0, 4.0, 3.0, 3.0, 5.0, 4.5, 4.0, 4.0, 4.0, 4.0, 4.0, 4.0, 3.0
  • 30 day Average top 25% -- 7.834972210346302
  • 30 day Median top 25% -- 3.0
  • 30 day Average top 10% -- 15.24385026737968
  • 30 day Median top 10% -- 7.0

number of articles users are watching for all relevant wikis - maximum, average, median
(based on mariadb queries)

ru

  • Maximum -- 111058
  • Average -- 27.35
  • Median -- 1

pt

  • Maximum -- 220943
  • Average -- 23.74
  • Median -- 1

id

  • Maximum -- 111058
  • Average -- 12.30
  • Median -- 1

a count of users that will be notified of suggested images for unillustrated articles they are watching, for all relevant wikis:
(based on mariadb queries and prior unillustrated article list dump )

ru -- 46666
pt -- 8967
id --

a count of users that will be notified of suggested images for unillustrated articles they have edited in the last 30 days, for all relevant wikis
(based on mariadb queries and prior unillustrated article list dump )

ru --25
pt -- 2
id --

a count of unillustrated articles that users are watching, for which there is a suggestion of an image, for all relevant wikis
(based on mariadb queries and prior unillustrated article list dump )

ru -- 113538
pt -- 19394
id --

The GatherAllSuggestions.php needs a quick merge so I can get updated lists but once that is done I can run through the outputs for ru and pt and ensure the the numbers are up to day. The pure mariadb queries are all up to date.

Seddon changed the task status from Open to In Progress.Feb 16 2022, 2:10 PM
Seddon changed the task status from In Progress to Stalled.Feb 18 2022, 11:39 AM

@Seddon is this still blocked, or can it move forward? If it's blocked, what do you need to move it forward?

Seddon changed the task status from Stalled to In Progress.Mar 22 2022, 11:16 PM

I need to add some additional documentation for the code used to generate the stats and it could be improved to require a little less human input, but it can be explored here:

@Seddon I just added Indonesian to the list of wikis for this task now that we're moving forward with that as our third launch language; please let me know if that adds significantly more work. Thanks!

Apologies @CBogen, just noticed this request! Will pull the additional numbers for id.wiki

Moving to blocked because idwiki wasn’t part of the suggestions data set that was generated last year. We’ll need to wait for the new production API before we can generate complete numbers for indonesia. The rest of the results are in a comment above T291403#7711276.

Closing because this was sufficient to get us started and we have analytics of the actual feature coming soon in T299667.