Page MenuHomePhabricator

Story: Community has periodic browser stats report generated from Hadoop data
Closed, ResolvedPublic0 Estimated Story Points

Description

Report browser stats from hadoop data.

Our current browser stats come from squid reports: http://stats.wikimedia.org/wikimedia/squids/SquidReportClients.html

We should work towards replacing these reports with pageview data from hadoop + ua parser.

Erik Z. suggested that we can "pipe data from new input stream into old reports via csv files"


Now available at https://browser-reports.wmflabs.org

Details

Reference
bz67053

Related Objects

Event Timeline

bzimport raised the priority of this task from to Medium.Nov 22 2014, 3:37 AM
bzimport set Reference to bz67053.
bzimport added a subscriber: Unknown Object (MLST).

Erik Z. suggested that we can "pipe data from new input stream into old reports via csv files"

There is no need to do this and new reports can be generated from hadoop directly.

This work is contingent on the work that we are currently doing in hadoop/kafka to productionize the setup and migrate to latest cloudera release.

Re-opening since it is apparently not a duplicate as both have been closed without clear reason.

Milimetric set the point value for this task to 0.
Milimetric moved this task from Incoming to Analytics Query Service on the Analytics board.

Preliminary browser reports are deployed to:

https://browser-reports-test.wmflabs.org/

See task https://phabricator.wikimedia.org/T130405 for UI improvements currently worked on

Krinkle set Security to None.
Addshore added a subscriber: Addshore.

@Nuria please see T130102.
Would it be possible to split off wikidata data so that it can be looked at separately?

Updated related ticket, closing. this one.