Page MenuHomePhabricator

Pageviews-daily broken after move from Pivot to Turnilo
Closed, ResolvedPublic3 Estimated Story Points

Description

Selecting the pageviews-daily dataset from https://turnilo.wikimedia.org results in the following error message:
Query error
could not resolve $view_count

(Besides View Count, there is also a second measure available named Count, but that's likely the wrong one when one is interested in pageview numbers.)

Event Timeline

Thanks a lot for the notification, I think that this is due to the reimage of druid1002 that happened this morning. pageviews-daily is not replicated, so the segments cache is present on one Druid historical only, that I am almost sure was on druid1002. The segments are stored in HDFS, but needs to be reloaded and this is happening as we speak:

https://grafana.wikimedia.org/dashboard/db/druid?refresh=1m&panelId=50&fullscreen&orgId=1

pageviews-hourly is for example replicated two times, and it is available via Turnilo (same thing as webrequest).

The follow up for this task is surely figure out what datasets needs to be replicated at least two times to ease the maintenance and/or a random failure of one Druid node.

elukey triaged this task as Medium priority.

This is clearly not right, segments have been loaded and nothing changed. Moreover, from the title (that I didn't pay attention to before) says that it was used to work with pivot but not turnilo, so definitely something that has been happening for a while (not started today).

This is clearly not right, segments have been loaded and nothing changed. Moreover, from the title (that I didn't pay attention to before) says that it was used to work with pivot but not turnilo, so definitely something that has been happening for a while (not started today).

I'm actually not 100% certain when I last saw it working (I do recall checking some pageview date in recent days since the switchover, but probably only used the hourly version).

Thanks for continuing to look into it!

Change 435997 had a related patch set uploaded (by Elukey; owner: Elukey):
[operations/puppet@production] turnilo: disable instrospection autofill-all for pageviews-*

https://gerrit.wikimedia.org/r/435997

Change 435997 merged by Elukey:
[operations/puppet@production] turnilo: disable instrospection autofill-all for pageviews-*

https://gerrit.wikimedia.org/r/435997

The issue should be gone now!

@elukey it is, thank you for the speedy fix!!

elukey set the point value for this task to 3.

Sorry, but the pageviews-daily dataset as it was linked in the task description is still broken: While the quoted error message is gone, it still only offers the "Count" measure (which is rather meaningless and likely to mislead users into believing we get only between 3 and 4 million pageviews per day overall) and not the "View Count" measure that we need.

It seems that the separately listed pageviews_daily cube (underscore instead of dash) is the right one where one can access "View Count" (I have been using it recently, and perhaps also @JKatzWMF when he made the comment above).

In the light of this, it may be best to simply delete pageviews-daily from the list:

Turnilo data cubes 2018-06-02 marked.png (854×1 px, 64 KB)

Thanks a lot for the ping, we indeed have the plan to delete the old dataset but we have not announced the new naming convention for the Druid's datasources yet (all underscores basically). We'd have done it on Monday sending an email to the analytics@ mailing list, apologies for the confusion.

Email to analytics@ sent, closing task!

Vvjjkkii renamed this task from Pageviews-daily broken after move from Pivot to Turnilo to w3baaaaaaa.Jul 1 2018, 1:07 AM
Vvjjkkii reopened this task as Open.
Vvjjkkii removed elukey as the assignee of this task.
Vvjjkkii raised the priority of this task from Medium to High.
Vvjjkkii updated the task description. (Show Details)
Vvjjkkii removed the point value 3 for this task.
Vvjjkkii removed subscribers: gerritbot, Aklapper.
CommunityTechBot renamed this task from w3baaaaaaa to Pageviews-daily broken after move from Pivot to Turnilo.Jul 2 2018, 3:08 AM
CommunityTechBot closed this task as Resolved.
CommunityTechBot assigned this task to elukey.
CommunityTechBot lowered the priority of this task from High to Medium.
CommunityTechBot set the point value for this task to 3.
CommunityTechBot updated the task description. (Show Details)
CommunityTechBot added subscribers: gerritbot, Aklapper.