Page MenuHomePhabricator

Requesting access to EventLogging data for knissen
Closed, ResolvedPublicRequest

Description

  • Wikitech username: Kai Nissen (WMDE)
  • Preferred shell username: knissen
  • Email address: kai.nissen@wikimedia.de
  • Ssh public key (must be dedicated key for wmf production): attached
  • Requested group membership: analytics-privatedata-users
  • Reason for access: As an employee of WMDE, I would like to request access to view EventLogging data for analyzing WMDE's fundraising banner campaigns. I already signed an NDA with WMF legal.
  • Name of approving party (hiring manager, for WMF staff): @Franziska_Heine

SSH Public Key

SRE Clinic Duty Checklist for Access Requests

Most requirements are outlined on https://wikitech.wikimedia.org/wiki/Requesting_shell_access

This checklist should be used on all access requests to ensure that all steps are covered. This includes expansion to access. Please do not check off items on the list below unless you are in Ops and have confirmed the step.

  • - User has signed the L3 Acknowledgement of Wikimedia Server Access Responsibilities Document.
  • - User has a valid NDA on file with WMF legal. (This can be checked by Operations via the NDA tracking sheet & is included in all WMF Staff/Contractor hiring.)
  • - User has provided the following: wikitech username, preferred shell username, email address, and full reasoning for access (including what commands and/or tasks they expect to perform)
  • - User has provided a public SSH key. This ssh key pair should only be used for WMF cluster access, and not share with any other service (this includes not sharing with WMCS access, no shared keys.)
  • - access request (or expansion) has sign off of WMF sponsor/manager (sponser for volunteers, manager for wmf staff)
  • - non-sudo requests: 3 business day wait must pass with no objections being noted on the task
  • - Patchset for access request

Event Timeline

Restricted Application added a subscriber: Aklapper. · View Herald Transcript
herron triaged this task as Medium priority.Jan 3 2020, 7:43 PM

@RStallman-legalteam Kai has signed an NDA with WMF Legal back in September 2017 (see T168046#3591921). Can i assume that is still valid?

Hi, @kai.nissen i'll handle this, i'm on clinic duty this week.

+@Nuria @elukey for analytics data requests

Does "view EventLogging data for analyzing WMDE's fundraising banner campaigns" require shell access today as it did in 2017?

Does "view EventLogging data for analyzing WMDE's fundraising banner campaigns" require shell access today as it did in 2017?

I'd assume so!

Since we've we no longer use MySQL for EventLogging, they'll need Hive access. 'analytics-privatedata-users' should be the right group.

Thanks for the clarification @Ottomata , alright!

Adjusted requested groups. So:

NOT researchers, analytics

but instead ONLY analytics-privatedata.

@kai.nissen See above, you'll need to use Hive nowadays instead of MySQL.

See https://wikitech.wikimedia.org/wiki/Analytics/Systems/Cluster/Hive

(and ACK, it says "either the analytics-privatedata-users or the analytics-users user group" on that page)

Yes, the NDA on file will suffice. Thanks!

@kai.nissen Have you ever used hadoop/hive before? The data is no longer in a MySQL data store.

Change 562940 had a related patch set uploaded (by Dzahn; owner: Dzahn):
[operations/puppet@production] admins: add Kai Nissen to analytics-privatedata-users

https://gerrit.wikimedia.org/r/562940

@Dzahn Alright, thanks!
@Nuria No, I've never used it before, but I'll make myself familiar with it. There are also quite some people who can help me out at WMDE.

Dzahn removed Dzahn as the assignee of this task.Jan 14 2020, 6:49 PM
Dzahn assigned this task to Muehlenhoff.
Dzahn subscribed.

@kai.nissen Please read data access gudelines, https://wikitech.wikimedia.org/wiki/Analytics/Data_Access_Guidelines and familirize yourself with hadoop, it is quite different from hive, @Addshore in WMDE can help.

Access approved.

Change 562940 merged by Muehlenhoff:
[operations/puppet@production] admins: add Kai Nissen to analytics-privatedata-users

https://gerrit.wikimedia.org/r/562940

MoritzMuehlenhoff subscribed.

@kai.nissen Your access is now enabled (but it can take up to 30 minutes until it has propagated to all servers), let me know (reopen this task or ping me on IRC (moritzm)) if you run into any issues logging in. If you have any questions wrt Hadoop access, best to ask the #wikimedia-analytics channel on IRC.

@kai.nissen You should have also received a mail for your Kerberos account (required to access Hadoop) with further instructions.