Review banner history log data and confirm that it satisfies use cases
Closed, ResolvedPublic
Actions

Assigned To

Authored By

	awight
	Sep 17 2015, 10:07 PM

Description

Look over the data collected by EventLogging and decide whether it covers Fundraising's needs for the banner history MVP.

We can iterate on this until the data has everything we need. The acceptance criterion for finishing this task is that the data is as complete as we can get it. At that point, we're ready to deploy the feature to real readers.

Related Objects
Search...

Status	Assigned	Task
Resolved	awight	T45250 Redo /beacon/impression system (formerly Special:RecordImpression) to remove extra round trips on all FR impressions (title was: S:RI should pyroperish)
Declined	None	T90917 Get banner count via Special:BannerLoader
Duplicate	None	T105109 Create UI for limiting banner impressions (banner diet?)
Resolved	• atgo	T78089 [epic] Banner History MVP
Resolved	None	T112020 [Mini epic] Activate Banner History!
Resolved	• ellery	T112986 Review banner history log data and confirm that it satisfies use cases

Event Timeline

awight created this task.Sep 17 2015, 10:07 PM

awight assigned this task to • ellery.

awight raised the priority of this task from to High.

awight updated the task description. (Show Details)

awight added projects: Fundraising Sprint Tom Waits, Fundraising-Backlog, Research.

awight added subscribers: • ellery, MeganHernandez_WMF, • atgo and 3 others.

Here's a tidbit of the log from the beta cluster. Entries with the "r" property were randomly sampled, where "r" indicates the sample rate. Entries with the "i" property were generated when a user clicked on Donate--in that case, the log will always be sent. The value of the "i" property is the temporary banner history log ID that you'll use to correlate with donations. (More doc in the schema itself.)

banner_history_events.log31 KBDownload

awight renamed this task from Spike: Review banner history log data and confirm that it satisfies use cases to Review banner history log data and confirm that it satisfies use cases.Sep 17 2015, 10:12 PM

awight updated the task description. (Show Details)

awight set Security to None.

Do you want the history log to include banner impressions from other campaigns?

Do you want a record of pageviews with no banner impression?

• DStrine added a project: Unplanned-Sprint-Work.Sep 21 2015, 6:30 PM

• DStrine moved this task from Triage to Closed Sprint Work Q1 1516 on the Fundraising-Backlog board.Sep 21 2015, 7:00 PM

@awight Having page views would make the data quite a bit more interesting. Unfortunately, if we can only send back a log with 10 or so items, then these would "wash out" the more valuable banner data log items. An efficient way to get the most interesting page view data would be to just have a count for the number page views between impressions. This count could even be an element in the log item (e.g. views_until_next_impression). This does not need to be in the MVP but would be a valuable addition.

@ellery @awight If the additional log entries would be to count pageviews or record banner displays in the same segment of users as is already targeted by a campaign with banner history enabled, then I think I have a solution: run a bit of code for users that were targeted by the campaign but that weren't included in it due to allocation blocks and random selection thereof.

This would work for the following scenarios:

Users are targeted by a throttled low-level campaign. As currently happens, users targeted by such a campaign randomly get the campaign for n% of pageviews. This would let us run code for the remaining percent to count the intervening pageviews.
Users are targeted by two or more campaigns at once, one of which is a fundraising campaign with banner history enabled. In this case, we'd also run code when the user is randomly selected to get any of the non-fundraising campaigns. So we'd also be able to put stuff in the banner history log at that time, too.

What this approach would not do is record pageviews or banner displays for any users not targeted by the fundraising campaign (on country, language, project, device or logged-in status criteria).

Thoughts? :)

AndyRussG moved this task from Backlog to Doing on the Fundraising Sprint Tom Waits board.Sep 22 2015, 9:50 PM

• DarTar moved this task from Backlog to Time Sensitive on the Research board.Sep 24 2015, 10:18 PM

• DarTar moved this task from Time Sensitive to Done (current quarter) on the Research board.

@ellery here're my main take-aways from our e-mail discussion from last week, as they relate to this task.

The main unit of analysis is pageview-in-the-campaign (erstwhile "impression"). The data is fine for analysis on that basis.
By sending a log ID for every time a banner history log is sent via EventLogging, and never sending a log more than once per pageview, we cleanly separate pageviews that lead to donations and those that don't. (This is fixed and now deployed on production.)
We'll leave data about banners and pageviews outside the campaign for a future iteration.

Given the above, it seems maybe we can mark this task resolved? What do you think? Or should we wait until you've seen data on the civi side, or from an actual campaign?

Thanks!!

Thanks for looking at this!

awight moved this task from Doing to Done on the Fundraising Sprint Tom Waits board.Sep 30 2015, 10:40 PM

@AndyRussG is the data in HDFS on the analytics cluster now as well?

@ellery: I'm not sure... I don't know how to query it on HDFS. But I can see the events successfully zooming through Kafka like so:

$ kafkacat -o beginning -t eventlogging_CentralNoticeBannerHistory -b kafka1012.eqiad.wmnet:9092

• DStrine moved this task from Closed Sprint Work Q1 1516 to Closed Sprint Work Completed in Q2 1516 on the Fundraising-Backlog board.Oct 5 2015, 5:26 PM

• DStrine moved this task from Closed Sprint Work Completed in Q2 1516 to Closed Sprint Work Q1 1516 on the Fundraising-Backlog board.

• mmodell removed a subscriber: awight.Jun 22 2017, 9:37 PM

• DStrine edited projects, added Fr-tech-archived-from-FY-2015/16; removed Fundraising-Backlog.Jan 11 2018, 9:39 PM

Review banner history log data and confirm that it satisfies use casesClosed, ResolvedPublicActions

Description

Related ObjectsSearch...

Event Timeline

Review banner history log data and confirm that it satisfies use cases
Closed, ResolvedPublic
Actions

Related Objects
Search...