Page MenuHomePhabricator

Requesting Creation of a new POSIX group and system user for the Analytics WMDE team.
Closed, ResolvedPublicRequest

Description

Requestor provided information and prerequisites

As part of T342331 [EPIC] Set up a sustainable tech stack for Wikidata Analytics, WMDE team require an airflow instance to run various analytics jobs, which is being worked on here T340648 [Airflow] Setup Airflow instance for WMDE.
We need a system user to help manage the WMDE related analytics/data jobs and the admin tasks around it like start, stop, restart airflow services.
Thus we need to create a system user analytics-wmde same (uid/gid) across nodes (airflow, stat100x, hadoop worker nodes, etc..). Then and add the related system user to the analtyics-privatedata-users group for the user to carry out certain functions like submitting jobs to yarn and to run regular airflow services maintenance.

Next we shall add a group of users approved by WMDE Engineering Manager for

  1. Managing WMDE related analytics/data jobs as Analytics WMDE Aiflow admins. Using the system user analytics-wmde
  2. The same group of users shall used to deploy airflow dags.

Event Timeline

Stevemunene renamed this task from Requesting access to RESOURCE for USER[S] to Requesting Creation of a new POSIX group and system user for the Analytics WMDE team..Sep 6 2023, 12:01 PM
Stevemunene updated the task description. (Show Details)
Stevemunene added a project: Data-Platform-SRE.

Change 949001 had a related patch set uploaded (by Stevemunene; author: Stevemunene):

[operations/puppet@production] [WIP] admin: Create analytics-wmde system user and airflow admin group

https://gerrit.wikimedia.org/r/949001

RLazarus added subscribers: joanna_borun, RLazarus.

Hi @joanna_borun -- does this need Infrastructure Foundations approval?

Hi @joanna_borun -- does this need Infrastructure Foundations approval?

I think you're right. We added a similar system group for another Airflow instance back in May (https://gerrit.wikimedia.org/r/c/operations/puppet/+/914788) and that required discussion in the I/F meeting, as I understand it.

There's a slight difference with this request, which is that analytics-wmde is already a pre-existing normal user account and group.
This analytics-wmde user and group only exists on stat1007 at the moment. See https://gerrit.wikimedia.org/r/plugins/gitiles/operations/puppet/+/refs/heads/production/modules/statistics/manifests/wmde.pp
However this account no longer has any active jobs running on stat1007, so we are going to reuse the name of the user and group for running Airflow.

Hi @odimitrijevic ,
Requesting approval for adding the analytics-wmde user to analtyics-privatedata-users group for T340648.

We discussed this in our Monday I/F meeting and approved it.

Change 949001 merged by CDanis:

[operations/puppet@production] admin: Create analytics-wmde system user and airflow admin group

https://gerrit.wikimedia.org/r/949001

Will be live in half an hour.