In order to import impression data of our fundraising banners to our own database we need them to be publicly available. This ticket is meant to resolve T243092 (and T255446).
Banner impression data is regularly aggregated in banner_activity_minutely, but with a delay of ~2 hours, which does not meet the requirement of WMDE's team Kampagnen und Fundraising.
Acceptance Criteria
- Banner impression data is collected and aggregated in time spans of 15 minutes for all banners that
- match the regular expression B\d{2}_WMDE
- were sending a beacon to uri_host (en|de)\.(m\.)?wikipedia.org
- There is one file for each time span.
- The file is named banner_impressions_YYYYMMDD_hhmm.csv, using the beginning of the time span.
- The file contains comma-separated values:
- Banner name
- Impression count (extrapolated)
- The files are published in analytics.wikitech.org
- Files older than 30 days are deleted regularly.