Measure the user responsiveness to notifications over time
Closed, ResolvedPublic
Actions

Assigned To

Authored By

	Pginer-WMF
	Aug 6 2015, 5:43 PM

Description

Notifications are meant to be read. Issues with notification systems tend to result on notifications being ignored, overlooked or result in delays for the user to notice..

When considering improving the visibility of notifications (T113228), the control of volume (T100528), or the way those are presented to align with the priorities of our users (T108190), we may want to measure the impact it has in terms of helping reduce the general backlog of unread notifications with respect to the former baseline.

To support the above, some understanding on users responsiveness with notifications will be helpful. Some aspects to measure:

Monthly production and consumption of notifications. Number of notifications created and number of notifications read for 30 day periods. For example, "In August, 1000 notifications were sent and 500 were read. In September, 600 notifications were sent and 700 were read, etc.".
Distribution of unread notifications. Percentage of notifications generated during 30 days that remain unread the next 30 days. For example, "30% of the notifications from August remained unread at the end of September".
Distribution of response time. Number of notifications generated in a 30 days period that were consumed in 1-2 days, 3-5 days, ... 20-30 days or remain unread after the next 30 days. For example, "500 of the 1000 notifications generated in August were consumed in less than 2 days".

Some possible visualisations are shown below:

notif-unread-perc.png (331×625 px, 26 KB)

Details

	Subject	Repo	Branch	Lines +/-
	Measure the user responsiveness to notifications over time	analytics/limn-ee-data	master	+116 -0

Customize query in gerrit

Related Objects
Search...

View Standalone Graph

This task is connected to more than 200 other tasks. Only direct parents and subtasks are shown here. Use View Standalone Graph to show more of the graph.

Status	Assigned	Task
		· · ·
Open	None	T100528 Improve organization and control for Flow notifications (tracking + ideas)
Resolved	Mooeypoo	T108190 Split notifications into Alerts and Messages
Resolved	Catrope	T114350 Notifications Panel: Support cross-wiki notifications
Resolved	matthiasmullie	T108208 Measure the user responsiveness to notifications over time
Resolved	Milimetric	T117220 Analytics support for echo dashboard task {frog} [8 pts]
		· · ·

Event Timeline

Pginer-WMF created this task.Aug 6 2015, 5:43 PM

Pginer-WMF raised the priority of this task from to Medium.

Pginer-WMF updated the task description. (Show Details)

Pginer-WMF added projects: Notifications, Collaboration-Team-Triage, StructuredDiscussions.

Pginer-WMF added a parent task: T100528: Improve organization and control for Flow notifications (tracking + ideas).

Pginer-WMF updated the task description. (Show Details)

Pginer-WMF set Security to None.

Pginer-WMF removed a parent task: T106033: "You have new messages" stuck (until page reload) after all messages are 'Mark as read'.

Pginer-WMF added subscribers: • Mattflaschen-WMF, Huji, Pginer-WMF and 2 others.

Pginer-WMF updated the task description. (Show Details)Aug 6 2015, 5:45 PM

Pginer-WMF added a parent task: T108190: Split notifications into Alerts and Messages.Aug 7 2015, 11:52 AM

Pginer-WMF mentioned this in T108190: Split notifications into Alerts and Messages.Aug 7 2015, 12:13 PM

Pginer-WMF renamed this task from Measure the number of unread notifications over time to Measure the user responsiveness to notifications over time.Aug 7 2015, 12:16 PM

Pginer-WMF updated the task description. (Show Details)

Defining the "norm" is going to be hard. It would be unrealistic to expect that most people (or even the "ideal" users so to speak) check all their notifications, and do so in a timely manner. Also, if a notification is unread, but another newer one is read, it can have different meanings (user ignored it, user knew about it some other way and checked the related page directly, etc).

I like the idea as a whole; however I think it is more of an R&D type of request than a feature request for the production software. It would be reasonable for WMF to investigate this, but until clear conclusions are made from the data, and those conclusions justify a change in MW or its extensions (which, by the way, is also used by non-WMF users a lot), I think a request for changing the software would be irrelevant.

Another way to put it is: I think WMF should use its own resources to collect and analyze those stats, not the resources of MediaWiki (e.g. volunteer programmers). YMMV.

Relevant schemas: https://meta.wikimedia.org/wiki/Schema:EchoInteraction (especially notification-impression and the *-link-click), and possibly https://meta.wikimedia.org/wiki/Schema:Echo .

Note, we should define exactly what we mean by 'responsiveness'.

In some cases (e.g. a mention notification), just seeing the notification is sufficient. In other cases, it may be worth tracking whether the user clicked it (e.g. a revert notification, to see the revert message, or a user talk notification, to see the user talk message).

Pginer-WMF updated the task description. (Show Details)Sep 24 2015, 3:56 PM

Defining the "norm" is going to be hard.

Yes. I agree that it is unrealistic to aim for a metric that perfectly reflect users intents (notifications depend a lot on the context and we cannot read the user minds), but I think we can find a measurable outcome that reflects an issue and whether it improves or not when we try a specific solution.

Note, we should define exactly what we mean by 'responsiveness'.

I added some more specific metric definitions. With those I expect we can get a more clear idea on which is the pace at which notifications are produced and consumed, whether unread notifications are accumulating, and how much it takes for users to consume them on average. We can get results per wiki, or per notification type, or even adjust time periods as we iterate, but I think the questions I mentioned are quite basic and we don't have much clues about them right now.

Pginer-WMF added a parent task: T113228: Better organization for the Notification panel.Sep 24 2015, 3:59 PM

Pginer-WMF mentioned this in T113228: Better organization for the Notification panel.Sep 24 2015, 5:04 PM

Quiddity mentioned this in T113664: Measure the notification types that are most abundantly received at 5 sample wikis.Sep 24 2015, 9:26 PM

Jay8g subscribed.Sep 25 2015, 3:30 AM

Catrope raised the priority of this task from Medium to High.Sep 30 2015, 11:44 PM

Catrope edited projects, added Collaboration-Team-Archive-2015-2016; removed Collaboration-Team-Triage.

Catrope subscribed.

Pginer-WMF added a parent task: T114350: Notifications Panel: Support cross-wiki notifications.Oct 1 2015, 10:58 AM

Pginer-WMF removed a parent task: T113228: Better organization for the Notification panel.

Pginer-WMF mentioned this in T114350: Notifications Panel: Support cross-wiki notifications.Oct 1 2015, 11:09 AM

nshahquinn-wmf added a project: Contributors-Analysis.Oct 7 2015, 10:52 PM

This data should all be available from the database without any software modifications. However, we probably want to track this data on an ongoing basis, not just as a one-off query, so we should build a limn dashboard or whatever else we're supposed to use for this kind of thing (graphing the results of DB queries over time).

Catrope moved this task from Untriaged to Ready for pickup on the Collaboration-Team-Archive-2015-2016 board.Oct 9 2015, 10:23 PM

matthiasmullie claimed this task.Oct 21 2015, 11:02 AM

matthiasmullie moved this task from Ready for pickup to In Development on the Collaboration-Team-Archive-2015-2016 board.

matthiasmullie removed matthiasmullie as the assignee of this task.Oct 21 2015, 5:41 PM

matthiasmullie moved this task from In Development to Ready for pickup on the Collaboration-Team-Archive-2015-2016 board.

matthiasmullie claimed this task.Oct 26 2015, 2:36 PM

matthiasmullie moved this task from Ready for pickup to In Development on the Collaboration-Team-Archive-2015-2016 board.

Responding here to an email from @matthiasmullie:

We have an existing limn dashboard that’s a couple of years old[2], and I’d like to add 3 charts to it.
It looks like the Limn config is hosted on GitHub[3] - is that where I submit the changes to, or is that a mirror?

That is not a mirror, it was before we moved stuff to gerrit. The dashboard configuration will need to be updated in that repo, I have push rights and you can submit a pull request if you like. But I'll help you with that when the files are generated and ready on datasets (more below).

I have the queries[4] to generate the CSV files, but I’m unsure where to put those.

The best place is in this repository in gerrit: https://gerrit.wikimedia.org/r/#/admin/projects/analytics/limn-ee-data.

IIRC, they used to be generated from a cronjob that ran on stat1001, but I seem to remember that has changed.
Existing docs[5] for that dashboard aren’t really useful - can you point me in the right direction for how to get them on datasets.wikimedia.org?

So the process is:

create a folder called "ee" in the analytics/limn-ee-data repository
create a config.yaml in there that configures the reportupdater - a tool that will run your SQL on an on-going basis and make sure to re-run if it misses a day, re-order your columns if you change your query, etc. All the limn-*-data repositories in gerrit have examples, but the most straightforward one is probably limn-language-data. I'll link to github (easier to read code): https://github.com/wikimedia/analytics-limn-language-data/blob/master/language/config.yaml
put your SQL scripts in ee/name-of-graph.sql noting the convention that the keys in the config.yaml configuration need to match the name of the sql file (without .sql)
note that the SQL scripts can be templates that use special parameters like {from_timestamp} which lets the reportupdater run your query one day at a time, and like {wiki_db} which you can configure from config.yaml like the language team did.
submit that to gerrit and I'll work with you to make sure the scripts are correct in their template incarnation.
once that's merged, the reportupdater will run it (I have to enable it in puppet), the files will be generated and rsynced to a folder that serves them on datasets.wikimedia.org

Docs are hard because of the XKCD standards problem :) For more in-depth information about what we're going to do you can read here[6] (but don't unless you're curious, I'll walk you through it).

As always, you can ping me on IRC in #wikimedia-analytics.

1: https://phabricator.wikimedia.org/T108208
2: http://ee-dashboard.wmflabs.org/dashboards/enwiki-features
3: https://github.com/wikimedia/limn-editor-engagement-data
4: https://gist.github.com/matthiasmullie/bff1818c31d8c2762da9
5: https://wikitech.wikimedia.org/wiki/EE_Dashboard

[6] https://wikitech.wikimedia.org/wiki/Analytics/Dashboards

Change 249394 had a related patch set uploaded (by Matthias Mullie):
Measure the user responsiveness to notifications over time

https://gerrit.wikimedia.org/r/249394

gerritbot added a project: Patch-For-Review.Oct 28 2015, 2:00 PM

Thanks for the help, @Milimetric!

I've submitted https://gerrit.wikimedia.org/r/#/c/249394/ (data) & https://github.com/wikimedia/limn-editor-engagement-data/pull/6 (charts).
I've added some comments myself already about things I was unsure of, but I'm sure there's even more things I'm unaware of :)

(note: I haven't tried running the charts myself, yet - I haven't been able to get limn running on my machine so far)

matthiasmullie moved this task from In Development to Needs Review on the Collaboration-Team-Archive-2015-2016 board.Oct 28 2015, 2:11 PM

Nice work, commented on both changes.

matthiasmullie moved this task from Needs Review to Code Review Started on the Collaboration-Team-Archive-2015-2016 board.Oct 28 2015, 3:16 PM

Milimetric added a project: Analytics-Kanban.Oct 28 2015, 4:06 PM

Milimetric moved this task from Next Up to In Progress on the Analytics-Kanban board.

Milimetric removed a project: Analytics-Kanban.Oct 30 2015, 3:20 PM

Change 249394 merged by Milimetric:
Measure the user responsiveness to notifications over time

https://gerrit.wikimedia.org/r/249394

Catrope moved this task from Code Review Started to QA Review on the Collaboration-Team-Archive-2015-2016 board.Nov 4 2015, 6:40 PM

• Nuria closed subtask T117220: Analytics support for echo dashboard task {frog} [8 pts] as Resolved.Nov 10 2015, 4:52 PM

Checked https://gerrit.wikimedia.org/r/#/c/249394/5/ee/monthly_production_and_consumption_of_notifications.sql

Takes quite long time to run - e.g. BETWEEN '20151104005301' AND '20151104015401' - 12 min 22.88 sec :

mysql:research@x1-analytics-slave [enwiki]> SELECT DATE('20151104') AS date, SUM(notifications_sent) AS notifications_sent, SUM(notifications_read) AS notifications_read    FROM       (SELECT COUNT(*) AS notifications_sent, 0 AS notifications_read  FROM echo_notification WHERE notification_timestamp BETWEEN '20151104005301' AND '20151104015401' UNION  SELECT 0 AS notifications_sent, COUNT(*) AS notifications_read  FROM echo_notification AS notification  LEFT JOIN  (SELECT notification_read_timestamp ,notification_bundle_display_hash  FROM echo_notification WHERE notification_bundle_base = 1 ) bundle ON notification.notification_bundle_display_hash = bundle.notification_bundle_display_hash AND notification.notification_bundle_display_hash != '' WHERE notification.notification_read_timestamp BETWEEN '20151104005301' AND '20151104015401' ) AS temp;
+------------+--------------------+--------------------+
| date       | notifications_sent | notifications_read |
+------------+--------------------+--------------------+
| 2015-11-04 |                648 |                365 |
+------------+--------------------+--------------------+
1 row in set (12 min 22.88 sec)

For BETWEEN '20151104005301' AND '20151104005501' - 21.69 sec

mysql:research@x1-analytics-slave [enwiki]> 
mysql:research@x1-analytics-slave [enwiki]> SELECT DATE('20151104') AS date, SUM(notifications_sent) AS notifications_sent, SUM(notifications_read) AS notifications_read    FROM       (SELECT COUNT(*) AS notifications_sent, 0 AS notifications_read  FROM echo_notification WHERE notification_timestamp BETWEEN '20151104005301' AND '20151104005501' UNION  SELECT 0 AS notifications_sent, COUNT(*) AS notifications_read  FROM echo_notification AS notification  LEFT JOIN  (SELECT notification_read_timestamp ,notification_bundle_display_hash  FROM echo_notification WHERE notification_bundle_base = 1 ) bundle ON notification.notification_bundle_display_hash = bundle.notification_bundle_display_hash AND notification.notification_bundle_display_hash != '' WHERE notification.notification_read_timestamp BETWEEN '20151104005301' AND '20151104005501' ) AS temp;

+------------+--------------------+--------------------+
| date       | notifications_sent | notifications_read |
+------------+--------------------+--------------------+
| 2015-11-04 |                 15 |                 12 |
+------------+--------------------+--------------------+
1 row in set (21.69 sec)

EXPLAIN for the above query shows huge number of rows scanned -rows: 172187973127722

*************************** 1. row ***************************
           id: 1
  select_type: PRIMARY
        table: <derived2>
         type: ALL
possible_keys: NULL
          key: NULL
      key_len: NULL
          ref: NULL
         rows: 172187973127722
        Extra: 
*************************** 2. row ***************************
           id: 2
  select_type: DERIVED
        table: echo_notification
         type: index
possible_keys: NULL
          key: user_timestamp
      key_len: 18
          ref: NULL
         rows: 13122041
        Extra: Using where; Using index
*************************** 3. row ***************************
           id: 3
  select_type: UNION
        table: notification
         type: ALL
possible_keys: NULL
          key: NULL
      key_len: NULL
          ref: NULL
         rows: 13122041
        Extra: Using where
*************************** 4. row ***************************
           id: 3
  select_type: UNION
        table: echo_notification
         type: index
possible_keys: NULL
          key: echo_notification_user_hash_base_timestamp
      key_len: 53
          ref: NULL
         rows: 13122041
        Extra: Using where; Using index
*************************** 5. row ***************************
           id: NULL
  select_type: UNION RESULT
        table: <union2,3>
         type: ALL
possible_keys: NULL
          key: NULL
      key_len: NULL
          ref: NULL
         rows: NULL
        Extra: 
5 rows in set (0.00 sec)

Etonkovidova moved this task from QA Review to Product Review on the Collaboration-Team-Archive-2015-2016 board.Nov 12 2015, 5:19 PM

Yeah, one main problem is there's no index on timestamp there. It would help a lot, and a request was filed.

http://ee-dashboard.wmflabs.org/dashboards/enwiki-features

Quiddity mentioned this in T125180: Research: How Many Users Get a Lot of Notifications?.Feb 16 2016, 8:35 PM

nshahquinn-wmf raised the priority of this task from High to Needs Triage.Mar 30 2018, 10:31 AM

nshahquinn-wmf moved this task from Backlog to Radar on the Contributors-Analysis board.

	F2631384: notif-time.png
	Sep 24 2015, 3:56 PM

	F2631382: notif-unread-perc.png
	Sep 24 2015, 3:56 PM

	F2631379: notif-prod-cons.png
	Sep 24 2015, 3:56 PM

Measure the user responsiveness to notifications over timeClosed, ResolvedPublicActions

Description

Details

Related ObjectsSearch...

Event Timeline

Measure the user responsiveness to notifications over time
Closed, ResolvedPublic
Actions

Related Objects
Search...