Page MenuHomePhabricator

analytics-privatedata-users access for Jeff Green
Closed, ResolvedPublic

Description

I'm requesting analytics access for ongoing maintenance purposes. It's helpful to be able to cross-check fundraising analytics against what's in hive.

Event Timeline

This guy seems trustable, approved :)

@Jgreen Do you just need Hive/Hadoop or do you additionally need sampled webrequest logs and stat boxes with private data? Asking this way because the description for the requested group says "This group should not be used just to grant someone Hadoop access." and you mention just Hive.

Yea, aware Jeff has root on the mentioned stat boxes anyways, heh.

@Dzahn I am pretty sure that Jeff needs access to webrequest data due to the fact that they maintain a kafka consumer in the fundraising cluster and they need to verify/double-check the consistency of their data. Assigning directly to Jeff since it is only a matter of adding himself to the right group and merge the puppet change :)

Change 383604 had a related patch set uploaded (by Jgreen; owner: Jgreen):
[operations/puppet@production] add jgreen to analytics-privatedata-users

https://gerrit.wikimedia.org/r/383604

Change 383604 merged by Jgreen:
[operations/puppet@production] add jgreen to analytics-privatedata-users

https://gerrit.wikimedia.org/r/383604

We concluded analytics-privatedata-user makes sense, so I can use hive to come up with hourly hit counts for specific domains, to compare against what we're collecting via kafkatee.

@Dzahn I am pretty sure that Jeff needs access to webrequest data due to the fact that they maintain a kafka consumer in the fundraising cluster and they need to verify/double-check the consistency of their data. Assigning directly to Jeff since it is only a matter of adding himself to the right group and merge the puppet change :)