Page MenuHomePhabricator

adding dcausse to analytics-privatedata-users (hive and webrequests)
Closed, ResolvedPublic


I'd like to have access to webrequests data in hive in order to analyze search traffic, redirects usage and various data related to search.
My shell account is dcausse.

I have access to stat1002 but when I want to run a hive query it fails with this error:

hive (wmf)> SELECT
          >     project,
          >     page_title,
          >     SUM(view_count)
          > FROM wmf.pageview_hourly
          >     WHERE year = 2015
          >     AND month = 9
          >     AND day = 22
          >     AND page_title IN ('Girafe', 'Giraffe')
          > GROUP BY
          >     project,
          >     page_title;
Query ID = dcausse_20151005120505_d9a685d3-f1af-481f-8dc7-4b1ca3fd0956
Total jobs = 1
Launching Job 1 out of 1
Number of reduce tasks not specified. Estimated from input data size: 42
In order to change the average load for a reducer (in bytes):
  set hive.exec.reducers.bytes.per.reducer=<number>
In order to limit the maximum number of reducers:
  set hive.exec.reducers.max=<number>
In order to set a constant number of reducers:
  set mapreduce.job.reduces=<number> Permission denied: user=dcausse, access=WRITE, inode="/user":hdfs:hadoop:drwxrwxr-x
[full stack omitted]


Event Timeline

dcausse raised the priority of this task from to Needs Triage.
dcausse updated the task description. (Show Details)
dcausse added subscribers: dcausse, Tfinc.
revi added a subscriber: revi.

Change 243686 had a related patch set uploaded (by John F. Lewis):
admin: add dcausse to analytics-privatedata-users


Please have your manager approve your addition to the analytics-privatedata-users group. (This is what allows access to the hive data.)

Once manager approval has been added, this still has the 3 day waiting period. If there are no objections on 2015-10-08 (Thursday), then John's patch can be merged.

RobH renamed this task from Access request to hive and webrequests to adding dcausse to analytics-privatedata-users (hive and webrequests).Oct 5 2015, 9:15 PM

Summary: All manager approvals are in for this patchset: The 3 day wait expires on 2015-10-08 (Thursday). If there are no objections by then, an opsen can merge. (An opsen has to check for the objections, as they can be on hidden sub-tasks.)

RobH triaged this task as Medium priority.Oct 6 2015, 9:30 PM

Change 243686 abandoned by Cmjohnson:
admin: add dcausse to analytics-privatedata-users

Cmjohnson claimed this task.
Cmjohnson added a subscriber: Cmjohnson.

It has been 3 days and I did not see any objections. Merged the change