I checked that those tables:
- Are not active: currently not ingesting data (contenttranslation stopped at 1st Oct, flowreplies has no data for last 90 days, changeslistfiltergrouping has 1 hour of data on 24th Nov)
- Are not in the event sanitization allow-list (the sanitization algorithm will not look for them when executing).
- Are not listed as Event Platform streams in InitialiseSettings.php (in mediawiki-config).
I think we can proceed with deletion.
The procedure for deletion should be:
# ssh to a machine with hdfs superuser kerberos credentials ssh an-launcher1002.eqiad.wmnet # drop hive tables sudo -u hdfs hive drop table event.flowreplies; drop table event.changeslistfiltergrouping; drop table event.contenttranslation; exit; # drop data sudo -u hdfs hdfs dfs -rm -r /wmf/data/event/flowreplies sudo -u hdfs hdfs dfs -rm -r /wmf/data/event/changeslistfiltergrouping sudo -u hdfs hdfs dfs -rm -r /wmf/data/event/contenttranslation