Page MenuHomePhabricator

unique devices data for january not in cassandra
Closed, ResolvedPublic1 Estimated Story Points

Description

See:
https://wikimedia.org/api/rest_v1/metrics/unique-devices/es.wikipedia.org/all-sites/monthly/2017121400/2018031400

Only data for February is returned

Data is present on hive so loading did not happen

Event Timeline

cassandra@cqlsh> select * from "local_group_default_T_unique_devices".data where timestamp = '20180201' and project='es.wikipedia' and granularity='monthly' and "_domain"='analytics.wikimedia.org' and "access-site"='all-sites' limit 1 ;

_domain | project | access-site | granularity | timestamp | _tid | _del | devices
-------------------------+--------------+-------------+-------------+-----------+--------------------------------------+------+-----------
analytics.wikimedia.org | es.wikipedia | all-sites | monthly | 20180201 | 13814000-1dd2-11b2-8080-808080808080 | null | 151476294

(1 rows)
cassandra@cqlsh> select * from "local_group_default_T_unique_devices".data where timestamp = '20180101' and project='es.wikipedia' and granularity='monthly' and "_domain"='analytics.wikimedia.org' and "access-site"='all-sites' limit 1 ;

_domain | project | access-site | granularity | timestamp | _tid | _del | devices
---------+---------+-------------+-------------+-----------+------+------+---------

(0 rows)

I double checked data:

  • Monthly uniques are missing for 2018-01
  • Daily uniques are not missing

Here is what happened:

I start a dedicated coordinator for that specific month now.

Solved - Data is in cassandra (we need to wait for cache updates before seeing the data in AQS).

JAllemandou edited projects, added Analytics-Kanban; removed Analytics.
JAllemandou moved this task from Next Up to Done on the Analytics-Kanban board.
JAllemandou set the point value for this task to 1.

Change 419692 had a related patch set uploaded (by Joal; owner: Joal):
[analytics/refinery@master] Correct cassandra loading monthly jobs

https://gerrit.wikimedia.org/r/419692

Change 419692 merged by Joal:
[analytics/refinery@master] Correct cassandra loading monthly jobs

https://gerrit.wikimedia.org/r/419692