I'd like to have access to webrequests data in hive in order to analyze search traffic, pageviews, and various data related to search.
My shell account is tjones.
I have access to stat1002 but when I want to run a hive query it fails with this error:
hive (wmf)> SELECT > concat(month,'/',day,'/',year), access_method, sum(view_count) > FROM > wmf.pageview_hourly > WHERE > year = 2015 > AND month = 8 > AND agent_type = "user" > AND country = "Canada" > AND project = "en.wikipedia" > GROUP BY > year, month, day, access_method; Query ID = tjones_20151019161212_618eb127-1c8a-43e4-9291-8863067aea0d Total jobs = 1 Launching Job 1 out of 1 Number of reduce tasks not specified. Estimated from input data size: 1009 In order to change the average load for a reducer (in bytes): set hive.exec.reducers.bytes.per.reducer=<number> In order to limit the maximum number of reducers: set hive.exec.reducers.max=<number> In order to set a constant number of reducers: set mapreduce.job.reduces=<number> org.apache.hadoop.security.AccessControlException: Permission denied: user=tjones, access=WRITE, inode="/user":hdfs:hadoop:drwxrwxr-x [...etc.]