Page MenuHomePhabricator

Make mediacounts available in Wikimedia Labs
Closed, ResolvedPublic

Description

After T93317, https://wikitech.wikimedia.org/wiki/Analytics/Data/Mediacounts i.e. http://dumps.wikimedia.org/other/mediacounts/ should also be made available on Labs.

mediacounts-stats.py makes querying such datasets quite easy, but downloading the files is tedious. Having data on labs may also make tools for T116363 easier.

Event Timeline

Nemo_bis raised the priority of this task from to Needs Triage.
Nemo_bis updated the task description. (Show Details)
Nemo_bis added subscribers: Nemo_bis, ArielGlenn.

We should just get all Analytics data over to Labs if space allows, which includes the currently unannounced pageviews and projectviews data. Perhaps, data published by the Analytics Team should be automatically added to Labs, reducing the need for separate tasks requesting for this.

This will happen once the labstore1006-7 hosts take over web service from the dataset1001 host, within a month (?) All datasets will automagically be available in labs then. Adding @madhuvishy for a better estimate of the time frame and/or corrections.

Hydriz claimed this task.

Apparently done when T188726 was resolved. Files are in /public/dumps/public/other/mediacounts.