EventLogging needs to enque events to avoid draining users' battery on mobile
Closed, ResolvedPublic8 Estimated Story Points
Actions

Description

Similar to how it is done in the statsv client we should enque EL events so we are not sending frequent beacons. See statsv code: https://gerrit.wikimedia.org/r/#/c/mediawiki/extensions/WikimediaEvents/+/500046/

This change is important in the light of baseline metrics like session length that might be sending pings every N seconds.
https://wikitech.wikimedia.org/wiki/Analytics/Data_Lake/Traffic/SessionLength

Note that we are not talking about batching events but rather enqueuing them, they are not batched into a single network call, but rather the network calls happen around the same time. This should improves battery usage on mobile as the times at which we wake up radio and use network should be reduced

Details

	Subject	Repo	Branch	Lines +/-
	statsd: Refactor queue handling to mw.eventLog	mediawiki/extensions/WikimediaEvents	master	+25 -73
	Add background queue with simple interface	mediawiki/extensions/EventLogging	master	+155 -15

Customize query in gerrit

Related Objects

Mentioned In: T246382: New EventLogging queue doesn't log events in window.unload
T240454: Consider how to best architect transmission of events from Browser Client
T228175: Event Platform Client Libraries

Event Timeline

• Nuria created this task.Jun 11 2019, 10:42 PM

Restricted Application added a subscriber: Aklapper. · View Herald TranscriptJun 11 2019, 10:42 PM

• fdans triaged this task as High priority.Jun 13 2019, 5:00 PM

• fdans moved this task from Incoming to Smart Tools for Better Data on the Analytics board.

• Nuria updated the task description. (Show Details)Jun 13 2019, 5:20 PM

• Gilles moved this task from Inbox, needs triage to Radar on the Performance-Team board.Jun 17 2019, 8:15 PM

• Gilles edited projects, added Performance-Team (Radar); removed Performance-Team.

Krinkle moved this task from Limbo to Perf recommendation on the Performance-Team (Radar) board.Jun 20 2019, 6:48 PM

Ping @Gilles: work on this to start second week of July

Milimetric moved this task from Next Up to In Progress on the Analytics-Kanban board.Jul 8 2019, 3:03 PM

cc @Krinkle

@Krinkle so I'm looking at this and feeling a little uncomfortable about basically duplicating the logic from WikimediaEvents.

What do you think about an EventQueue class in the eventLogging core module? It would do common things (managing the timer, schedule, and handle pagehide/visibilitychange). You would pass logic to it for dispatch. So we'd refactor WikimediaEvents to use it and also use it for Event Logging to address this task here.

Yes, a common interface for this would make sense. I'd go further and actually also make it a singleton so as to not have multiple instances tracking and reflecting the same in-browser state. Probably something generic callback-based, like (mw.)requestIdleCallback and (window.)requestAnimationCallback do already.

Hm, if it's a singleton then clients would have to namespace events. Because the statsv client wants to batch events and send them together while the EventLogging client just wants to send them one at a time. So dispatch would be:

dispatch ( namespace ) {
    // get events in { namespace } as a list
    // call a specific callback like { callbacks[ namespace ] } with the list
    // get back a list of URLs
    // loop over the list and call sendBeacon
}

That would allow the statsv client to squish the list of events into a list of one URL, and the EventLogging client to $.map it into another list.

I think it would be cleaner to have separate instances than have namespaces.

If it is a singleton there would be 1 queue but in this case the statsd queue and EL queue are different (and so is the dispatch method for either) so the queue management and dispatch would need to happen outside the singleton so it ends up being a bit of fake-singleton no? (totally could be missing some well -stablished mw conventions here)

Change 524575 had a related patch set uploaded (by Milimetric; owner: Milimetric):
[mediawiki/extensions/EventLogging@master] [WIP] Refactor queue/batching of events from statsv to eventLog module.

https://gerrit.wikimedia.org/r/524575

Change 524576 had a related patch set uploaded (by Milimetric; owner: Milimetric):
[mediawiki/extensions/WikimediaEvents@master] [WIP] Refactoring queue/batching of events to eventLog module

https://gerrit.wikimedia.org/r/524576

In T225578#5342527, @Nuria wrote:

If it is a singleton there would be 1 queue but in this case the statsd queue and EL queue are different (and so is the dispatch method for either) [..]

My thinking was that this mechanism is more low level than that. Purely callback based, like setTimeout or requestIdleCallback. E.g. a callback that (based on current logic) is debounced by 2 seconds, but resolves earlier based on certain events. What you then do with the callback is the caller's decision. So statsd would indeed still have its own array of metrics, and EventLogging core its array of beacon urls. The callback is then to prompt the dispatching call in whatever each each handles that.

• Nuria set the point value for this task to 8.Jul 22 2019, 8:48 PM

Milimetric moved this task from In Progress to In Code Review on the Analytics-Kanban board.Jul 23 2019, 3:33 AM

Is there any production instrumentation that relies on the time at which the beacon request arrived at the edge (the dt field in the Hive event.* tables IIRC)? My apologies if you've already looked into this.

@phuedx
no, there shouldn't be anything relying on that field

• Nuria moved this task from In Code Review to Paused on the Analytics-Kanban board.Sep 17 2019, 4:05 PM

Milimetric moved this task from Paused to In Progress on the Analytics-Kanban board.Sep 30 2019, 4:32 PM

Krinkle edited projects, added Performance-Team; removed Performance-Team (Radar).Sep 30 2019, 9:43 PM

Krinkle moved this task from Inbox, needs triage to To-do: Goals, prioritized next 4 Quarters on the Performance-Team board.

Krinkle moved this task from To-do: Goals, prioritized next 4 Quarters to To-do: Goals prioritized current Quarter on the Performance-Team board.Sep 30 2019, 9:56 PM

Milimetric moved this task from In Progress to In Code Review on the Analytics-Kanban board.Oct 17 2019, 3:56 PM

Krinkle moved this task from To-do: Goals prioritized current Quarter to Doing (old) on the Performance-Team board.Oct 18 2019, 3:37 AM

Change 524575 had a related patch set uploaded (by Milimetric; owner: Milimetric):
[mediawiki/extensions/EventLogging@master] Add background queue with simple interface

https://gerrit.wikimedia.org/r/524575

Change 524575 had a related patch set uploaded (by Milimetric; owner: Milimetric):
[mediawiki/extensions/EventLogging@master] Add background queue with simple interface

https://gerrit.wikimedia.org/r/524575

• jlinehan mentioned this in T228175: Event Platform Client Libraries.Oct 31 2019, 5:39 PM

Quarter has rolled over once or twice since we started, not currently in my goals any more and likely won't have time for this in what's left of December. Putting up first thing for FY2019–20 Q3 in January.

• jlinehan mentioned this in T240454: Consider how to best architect transmission of events from Browser Client.Dec 11 2019, 2:04 PM

Milimetric moved this task from In Code Review to Paused on the Analytics-Kanban board.Dec 11 2019, 5:03 PM

Happy New Year, can we work on getting this out this quarter? :)

Milimetric moved this task from Paused to In Progress on the Analytics-Kanban board.Jan 10 2020, 5:25 PM

Change 524575 merged by jenkins-bot:
[mediawiki/extensions/EventLogging@master] Add background queue with simple interface

https://gerrit.wikimedia.org/r/524575

ReleaseTaggerBot added a project: MW-1.35-notes (1.35.0-wmf.18; 2020-02-04).Jan 28 2020, 10:01 PM

Krinkle moved this task from To-do: Goals, prioritized next 4 Quarters to Doing (old) on the Performance-Team board.Feb 3 2020, 8:57 PM

Change 524576 merged by jenkins-bot:
[mediawiki/extensions/WikimediaEvents@master] statsd: Refactor queue handling to mw.eventLog

https://gerrit.wikimedia.org/r/524576

ReleaseTaggerBot edited projects, added MW-1.35-notes (1.35.0-wmf.19; 2020-02-11); removed MW-1.35-notes (1.35.0-wmf.18; 2020-02-04).Feb 10 2020, 4:01 PM

@Milimetric Is this task ready to resolve?

Krinkle moved this task from Doing (old) to Radar on the Performance-Team board.Feb 10 2020, 9:26 PM

Krinkle edited projects, added Performance-Team (Radar); removed Performance-Team.

Moved to done, Nuria likes to look these over before she resolves them. But yes, no more work as far as I know.

• Nuria closed this task as Resolved.Feb 27 2020, 7:19 PM

DLynch mentioned this in T246382: New EventLogging queue doesn't log events in window.unload.Feb 27 2020, 9:18 PM

• Nuria updated the task description. (Show Details)Jul 21 2020, 12:04 AM

Krinkle edited projects, added Wikimedia-Performance-publish; removed MW-1.35-notes (1.35.0-wmf.19; 2020-02-11).Apr 1 2022, 4:30 PM

Restricted Application added a project: Data-Engineering. · View Herald TranscriptApr 1 2022, 4:30 PM

Krinkle moved this task from Untriaged to Ready for write-up on the Wikimedia-Performance-publish board.Apr 1 2022, 4:30 PM

EventLogging needs to enque events to avoid draining users' battery on mobile Closed, ResolvedPublic8 Estimated Story PointsActions

Description

Details

Related Objects

Event Timeline

EventLogging needs to enque events to avoid draining users' battery on mobile
Closed, ResolvedPublic8 Estimated Story Points
Actions