Page MenuHomePhabricator

Check home/HDFS leftovers of eyener
Closed, ResolvedPublic

Description

The access for Erin Yener was removed. It needs to be checked if data was left in home dirs on stat*/HDFS since they were part of the "analytics-privatedata-users" group.

The Kerberos principal has already been removed.

Related Objects

StatusSubtypeAssignedTask
OpenNone
ResolvedBTullis

Event Timeline

Ottomata added subscribers: Unknown Object (User), Ottomata.Nov 29 2022, 6:04 PM

Hi @jrobell1, the following files are leftover in eyener's home directories on the stat boxes. Do you approve their removal? We can archive things that need to be kept, but we'd prefer to remove.

====== stat1005 ======
total 40
-rw-r--r-- 1 22127 wikidev  4642 Aug 17  2020 banner_history_test.ipynb
-rw-r--r-- 1 22127 wikidev 10334 Aug 17  2020 banner_history_utils.py
-rw-r--r-- 1 22127 wikidev  7117 Aug 17  2020 civi_utils.py
-rw-r--r-- 1 22127 wikidev  2474 Aug 17  2020 get_banner_history_data.py
drwxr-xr-x 2 22127 wikidev  4096 Aug 17  2020 __pycache__
drwxr-xr-x 7 22127 wikidev  4096 Aug 17  2020 venv


====== stat1007 ======
total 144
-rw-r--r-- 1 22127 wikidev  2024 Aug 19  2020 amount_analysis.py
-rw-r--r-- 1 22127 wikidev  6809 Aug 19  2020 banner_history_test.ipynb
-rw-r--r-- 1 22127 wikidev 10409 Aug 19  2020 banner_history_utils.py
-rw-r--r-- 1 22127 wikidev  8941 Aug 19  2020 campaign_analysis.py
-rw-rw-r-- 1 22127 wikidev 23105 Feb  9  2020 central_notice_banner_history_2019.hql
-rw-r--r-- 1 22127 wikidev  7205 Aug 19  2020 civi_utils.py
-rw-r--r-- 1 22127 wikidev  2541 Aug 19  2020 conversion_analysis.py
-rw-r--r-- 1 22127 wikidev  6294 Aug 19  2020 db_utils.py
-rw-r--r-- 1 22127 wikidev  2474 Aug 19  2020 get_banner_history_data.py
-rw-r--r-- 1 22127 wikidev  2321 Aug 19  2020 get_multilingual_prod_db.py
-rw-r--r-- 1 22127 wikidev     0 Aug 19  2020 __init__.py
-rw-r--r-- 1 22127 wikidev   702 Aug 19  2020 plot_utils.py
drwxr-xr-x 2 22127 wikidev  4096 Aug 19  2020 __pycache__
-rw-r--r-- 1 22127 wikidev  1207 Aug 19  2020 refresh_campaign.py
-rw-r--r-- 1 22127 wikidev  6756 Aug 19  2020 sklearn_utils.py
-rw-r--r-- 1 22127 wikidev  1120 Aug 19  2020 spark_util.py
-rw-r--r-- 1 22127 wikidev 13447 Aug 19  2020 sqoop_utils.py
-rw-r--r-- 1 22127 wikidev  3709 Aug 19  2020 stats_utils.py
-rw-r--r-- 1 22127 wikidev   821 Aug 19  2020 tensorflow_utils.py
drwxr-xr-x 7 22127 wikidev  4096 May 13  2020 venv
-rw-r--r-- 1 22127 wikidev  3957 Aug 19  2020 wikidata_utils.py

We have waited six months for approval to delete these files, without response.
@jrobell1 who was the director of Fundaraising has since left the foundation, so I will try to find out if there is anyone else who might be able to authorise removal of the files.

I have asked a question in the #talk-to-fundraising Slack channel: https://wikimedia.slack.com/archives/C01DNV8NRUG/p1684412485486199

I have received confirmation from @JMando that we can delete these files.

BTullis claimed this task.
btullis@cumin1001:~$ sudo cumin 'C:profile::analytics::cluster::client or C:profile::hadoop::master or C:profile::hadoop::master::standby' 'rm -rf /home/eyener'
16 hosts will be targeted:
an-coord[1001-1002].eqiad.wmnet,an-launcher1002.eqiad.wmnet,an-master[1001-1002].eqiad.wmnet,an-test-client[1001-1002].eqiad.wmnet,an-test-coord1001.eqiad.wmnet,an-test-master[1001-1002].eqiad.wmnet,stat[1004-1009].eqiad.wmnet
OK to proceed on 16 hosts? Enter the number of affected hosts to confirm or "q" to quit: 16
===== NO OUTPUT =====                                                                                                                                                                                              
PASS |███████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 100% (16/16) [00:04<00:00,  3.49hosts/s]
FAIL |                                                                                                                                                                            |   0% (0/16) [00:04<?, ?hosts/s]
100.0% (16/16) success ratio (>= 100.0% threshold) for command: 'rm -rf /home/eyener'.
100.0% (16/16) success ratio (>= 100.0% threshold) of nodes successfully executed all commands.