I'd like to have access to webrequests data in hive in order to analyze search traffic, redirects usage and various data related to search.
My shell account is dcausse.
I have access to stat1002 but when I want to run a hive query it fails with this error:
hive (wmf)> SELECT > project, > page_title, > SUM(view_count) > FROM wmf.pageview_hourly > WHERE year = 2015 > AND month = 9 > AND day = 22 > AND page_title IN ('Girafe', 'Giraffe') > GROUP BY > project, > page_title; Query ID = dcausse_20151005120505_d9a685d3-f1af-481f-8dc7-4b1ca3fd0956 Total jobs = 1 Launching Job 1 out of 1 Number of reduce tasks not specified. Estimated from input data size: 42 In order to change the average load for a reducer (in bytes): set hive.exec.reducers.bytes.per.reducer=<number> In order to limit the maximum number of reducers: set hive.exec.reducers.max=<number> In order to set a constant number of reducers: set mapreduce.job.reduces=<number> org.apache.hadoop.security.AccessControlException: Permission denied: user=dcausse, access=WRITE, inode="/user":hdfs:hadoop:drwxrwxr-x [full stack omitted]
Thanks.