The access for Erin Yener was removed. It needs to be checked if data was left in home dirs on stat*/HDFS since they were part of the "analytics-privatedata-users" group.
The Kerberos principal has already been removed.
The access for Erin Yener was removed. It needs to be checked if data was left in home dirs on stat*/HDFS since they were part of the "analytics-privatedata-users" group.
The Kerberos principal has already been removed.
Status | Subtype | Assigned | Task | ||
---|---|---|---|---|---|
Open | None | T323884 Cleanup User Hive Databases | |||
Resolved | BTullis | T316072 Check home/HDFS leftovers of eyener |
Hi @jrobell1, the following files are leftover in eyener's home directories on the stat boxes. Do you approve their removal? We can archive things that need to be kept, but we'd prefer to remove.
====== stat1005 ====== total 40 -rw-r--r-- 1 22127 wikidev 4642 Aug 17 2020 banner_history_test.ipynb -rw-r--r-- 1 22127 wikidev 10334 Aug 17 2020 banner_history_utils.py -rw-r--r-- 1 22127 wikidev 7117 Aug 17 2020 civi_utils.py -rw-r--r-- 1 22127 wikidev 2474 Aug 17 2020 get_banner_history_data.py drwxr-xr-x 2 22127 wikidev 4096 Aug 17 2020 __pycache__ drwxr-xr-x 7 22127 wikidev 4096 Aug 17 2020 venv ====== stat1007 ====== total 144 -rw-r--r-- 1 22127 wikidev 2024 Aug 19 2020 amount_analysis.py -rw-r--r-- 1 22127 wikidev 6809 Aug 19 2020 banner_history_test.ipynb -rw-r--r-- 1 22127 wikidev 10409 Aug 19 2020 banner_history_utils.py -rw-r--r-- 1 22127 wikidev 8941 Aug 19 2020 campaign_analysis.py -rw-rw-r-- 1 22127 wikidev 23105 Feb 9 2020 central_notice_banner_history_2019.hql -rw-r--r-- 1 22127 wikidev 7205 Aug 19 2020 civi_utils.py -rw-r--r-- 1 22127 wikidev 2541 Aug 19 2020 conversion_analysis.py -rw-r--r-- 1 22127 wikidev 6294 Aug 19 2020 db_utils.py -rw-r--r-- 1 22127 wikidev 2474 Aug 19 2020 get_banner_history_data.py -rw-r--r-- 1 22127 wikidev 2321 Aug 19 2020 get_multilingual_prod_db.py -rw-r--r-- 1 22127 wikidev 0 Aug 19 2020 __init__.py -rw-r--r-- 1 22127 wikidev 702 Aug 19 2020 plot_utils.py drwxr-xr-x 2 22127 wikidev 4096 Aug 19 2020 __pycache__ -rw-r--r-- 1 22127 wikidev 1207 Aug 19 2020 refresh_campaign.py -rw-r--r-- 1 22127 wikidev 6756 Aug 19 2020 sklearn_utils.py -rw-r--r-- 1 22127 wikidev 1120 Aug 19 2020 spark_util.py -rw-r--r-- 1 22127 wikidev 13447 Aug 19 2020 sqoop_utils.py -rw-r--r-- 1 22127 wikidev 3709 Aug 19 2020 stats_utils.py -rw-r--r-- 1 22127 wikidev 821 Aug 19 2020 tensorflow_utils.py drwxr-xr-x 7 22127 wikidev 4096 May 13 2020 venv -rw-r--r-- 1 22127 wikidev 3957 Aug 19 2020 wikidata_utils.py
We have waited six months for approval to delete these files, without response.
@jrobell1 who was the director of Fundaraising has since left the foundation, so I will try to find out if there is anyone else who might be able to authorise removal of the files.
I have asked a question in the #talk-to-fundraising Slack channel: https://wikimedia.slack.com/archives/C01DNV8NRUG/p1684412485486199
btullis@cumin1001:~$ sudo cumin 'C:profile::analytics::cluster::client or C:profile::hadoop::master or C:profile::hadoop::master::standby' 'rm -rf /home/eyener' 16 hosts will be targeted: an-coord[1001-1002].eqiad.wmnet,an-launcher1002.eqiad.wmnet,an-master[1001-1002].eqiad.wmnet,an-test-client[1001-1002].eqiad.wmnet,an-test-coord1001.eqiad.wmnet,an-test-master[1001-1002].eqiad.wmnet,stat[1004-1009].eqiad.wmnet OK to proceed on 16 hosts? Enter the number of affected hosts to confirm or "q" to quit: 16 ===== NO OUTPUT ===== PASS |███████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 100% (16/16) [00:04<00:00, 3.49hosts/s] FAIL | | 0% (0/16) [00:04<?, ?hosts/s] 100.0% (16/16) success ratio (>= 100.0% threshold) for command: 'rm -rf /home/eyener'. 100.0% (16/16) success ratio (>= 100.0% threshold) of nodes successfully executed all commands.