Page MenuHomePhabricator

Add logic to purging scripts that requires admin action if it's about to delete a lot of data
Closed, ResolvedPublic3 Estimated Story Points

Description

In any script that deletes data as part of our data retention policies, there should be some logic that evaluates if the data to be deleted looks like it shouldn't be deleted. The most obvious variable to control for would be the size in bytes of whatever will be removed. If the criteria are met, purging should be stopped and the team should be emailed.

Event Timeline

fdans moved this task from Incoming to Security Maturity and Data Privacy on the Analytics board.
mforns added a project: Analytics-Kanban.
mforns moved this task from Next Up to In Progress on the Analytics-Kanban board.

Change 694547 had a related patch set uploaded (by Mforns; author: Mforns):

[analytics/refinery@master] Add dropped partitions and deleted directory size limits

https://gerrit.wikimedia.org/r/694547

EChetty set the point value for this task to 3.Jun 30 2022, 5:17 PM

Change 813921 had a related patch set uploaded (by Mforns; author: Mforns):

[operations/puppet@production] analytics:refinery:job:data_purge: Add --allowed-interval to deletion jobs

https://gerrit.wikimedia.org/r/813921

Mentioned in SAL (#wikimedia-analytics) [2022-08-10T17:03:00Z] <ottomata> stopping puppet and drop data timers on an-launcher1002 and an-test-coord1001 to deploy drop script changes - T270433

Change 694547 merged by Ottomata:

[analytics/refinery@master] Add safety limits to refinery-drop-older-than

https://gerrit.wikimedia.org/r/694547

Mentioned in SAL (#wikimedia-operations) [2022-08-10T17:08:08Z] <otto@deploy1002> Started deploy [analytics/refinery@d4dd7e4] (hadoop-test): Add safety limits to refinery-drop-older-than - T270433 - TEST [analytics/refinery@d4dd7e4]

Change 813921 merged by Ottomata:

[operations/puppet@production] analytics:refinery:job:data_purge: Add --allowed-interval to deletion jobs

https://gerrit.wikimedia.org/r/813921

Change 822122 had a related patch set uploaded (by Mforns; author: Mforns):

[analytics/refinery@master] Add missing changes to the deletion script

https://gerrit.wikimedia.org/r/822122

Change 822122 merged by Mforns:

[analytics/refinery@master] Add missing changes to the deletion script

https://gerrit.wikimedia.org/r/822122

Mentioned in SAL (#wikimedia-operations) [2022-08-10T17:30:37Z] <otto@deploy1002> Started deploy [analytics/refinery@6e47e0e] (hadoop-test): Add missing changes to the deletion script - T270433 - TEST [analytics/refinery@6e47e0e]

Mentioned in SAL (#wikimedia-operations) [2022-08-10T17:34:56Z] <otto@deploy1002> Finished deploy [analytics/refinery@6e47e0e] (hadoop-test): Add missing changes to the deletion script - T270433 - TEST [analytics/refinery@6e47e0e] (duration: 04m 19s)

Mentioned in SAL (#wikimedia-operations) [2022-08-10T17:36:40Z] <otto@deploy1002> Started deploy [analytics/refinery@6e47e0e]: Add missing changes to the deletion script - T270433 - [analytics/refinery@6e47e0e]

Mentioned in SAL (#wikimedia-operations) [2022-08-10T17:42:09Z] <otto@deploy1002> Finished deploy [analytics/refinery@6e47e0e]: Add missing changes to the deletion script - T270433 - [analytics/refinery@6e47e0e] (duration: 05m 28s)

Change 822408 had a related patch set uploaded (by Mforns; author: Mforns):

[operations/puppet@production] analytics:refinery:job:data_purge: Improve drop-webrequest-sequence-stats

https://gerrit.wikimedia.org/r/822408

Change 822408 merged by Ottomata:

[operations/puppet@production] analytics:refinery:job:data_purge: Improve drop-webrequest-sequence-stats

https://gerrit.wikimedia.org/r/822408