Page MenuHomePhabricator

pagecounts-raw missing since 5th August
Closed, DeclinedPublic

Description

Currently, we consume the pagecounts raw files that you generate every day as an input of our Wikipedia infrastructure.

https://dumps.wikimedia.org/other/pagecounts-raw/2016/2016-08/

  We have noticed that there are no new files since yesterday at 5th, and because of that we are having issues in our side.

  I would like to know if this issue is already noticed in your side and if there is a current action to fix the generation of this files.

Thanks

Event Timeline

DianaArq created this task.Aug 10 2016, 1:33 PM
Restricted Application added a subscriber: Aklapper. · View Herald TranscriptAug 10 2016, 1:33 PM

I see in puppet on Aug 4th, which matches nicely the date that new files stopped showing up on dumps.wm.o:

Remove pagecounts-[raw|all-sites] related code

See https://gerrit.wikimedia.org/r/#/c/302932/ for the details.

What files should people now be using/downloading?

Adding @JAllemandou who authored that changeset.

Hi,
pagecounts-raw and pagecounts-all-sites have been deprecated in favor of pageviews (see anouncements in the analytics mailing list: http://analytics.wikimedia.narkive.com/P1Rrr2oz/pagecount-datasets-to-be-deprecated-at-the-end-of-may).
For new data, you can find pageviews (more accurate, same format as pagecounts-raw) here: https://dumps.wikimedia.org/other/pageviews/

@Shizhao can you please open a task for fixing up wikitrends and notify the owner of the tool if there are still issues? It looks like his email contact is at the bottom of the wikitrends web page.

Sent mail to the gmail address without much hope of a response. We'll see.

No answer to my email.

ArielGlenn closed this task as Declined.Dec 19 2016, 2:26 PM

Summary:

These files were deprecated in favour of https://dumps.wikimedia.org/other/pageviews/ and an announcement was sent to the analytics list. Subsequently production of these files was stopped.

I've tried and failed to reach the owner of the tool reported to be broken by reliance of these files. I can find no other contact information for the maintainer.

At this point there's no further action we can take on our side. I'm therefore closing this ticket as 'declined'.

ArielGlenn moved this task from Backlog to Done on the Dumps-Generation board.Feb 20 2017, 9:22 PM