We had a meeting this morning with the people CC'ed on this task to discuss the handoff of pageview reports from Research (sampled logs + R implementation) to Analytics (unsampled logs + UDF).
Starting with April, daily and monthly data will be generated via the UDFs and include the following dimensions:
1. project, e.g. enwiki
2. language, e.g. en
3. period, e.g. 2015-04-01
4. access method (desktop site/ mobile web)
5. country (country_iso, country_name)
6. is_spider
We're handing off the generation of this data to Analytics Eng, and the team will set up systems to allow customers to access this data and compute arbitrary aggregations. As part of this transition, and to ensure we have a complete data series based on the new pageview definition to cover calendar Q1-2015, we would like to request backfilling the data on the staging DB for the entire month of March 2015.
@Eloquence, can you approve this for Oliver to help with this task?
Other minutes from the meeting are here: http://etherpad.wikimedia.org/p/PVTransition