Page MenuHomePhabricator

Get "edits hourly" on a daily basis
Open, HighPublic

Assigned To
None
Authored By
JKatzWMF
Sep 3 2019, 8:18 PM
Tokens
"Mountain of Wealth" token, awarded by Pcoombe."Mountain of Wealth" token, awarded by phuedx."Mountain of Wealth" token, awarded by cmadeo."Like" token, awarded by Volker_E."100" token, awarded by RHo.

Description

Currently, we release changes to our site and it can take up to 35 days to see the impact via the edits hourly table. It would be amazingly useful if we could see updates daily without writing custom script.

I recognize that the mediawiki history job is crunching 15 years of data, but perhaps we could reprocess a shorter period or less granular (the latest month or do daily counts instead of hourly) every 24hrs and stomach the impact on accuracy?

Event Timeline

fdans added a subscriber: fdans.

In the future we would like to do this but right now the edits data is generated on a monthly basis.

kzimmerman triaged this task as Medium priority.Oct 21 2019, 5:22 PM
kzimmerman moved this task from Triage to Icebox on the Product-Analytics board.
kzimmerman added a subscriber: kzimmerman.

@JKatzWMF @Tnegrin this isn't feasible at the moment; moving to our icebox to reconsider in the future.

Hi! I just want to post here and say that since I've been using Turnilo, I definitely want this data to be updated more frequently -- even weekly would be a great improvement. My use case is that I check Turnilo to see how many suggested edits (based on edit tags) are being done in our target wikis.

RHo added a subscriber: RHo.
Milimetric raised the priority of this task from Medium to High.Mar 9 2020, 4:46 PM
Milimetric moved this task from Backlog (Later) to Smart Tools for Better Data on the Analytics board.

@JKatzWMF @Tnegrin wanted to give you a heads up that this task has moved out of Analytics Engineering's backlog and falls under their work to make incremental updates available in the data lake (https://phabricator.wikimedia.org/T258511).