In any script that deletes data as part of our data retention policies, there should be some logic that evaluates if the data to be deleted looks like it shouldn't be deleted. The most obvious variable to control for would be the size in bytes of whatever will be removed. If the criteria are met, purging should be stopped and the team should be emailed.
Description
Details
Status | Subtype | Assigned | Task | ||
---|---|---|---|---|---|
Open | None | T267355 Traffic anomaly alarms | |||
Resolved | mforns | T270433 Add logic to purging scripts that requires admin action if it's about to delete a lot of data |
Event Timeline
Change 694547 had a related patch set uploaded (by Mforns; author: Mforns):
[analytics/refinery@master] Add dropped partitions and deleted directory size limits
Change 813921 had a related patch set uploaded (by Mforns; author: Mforns):
[operations/puppet@production] analytics:refinery:job:data_purge: Add --allowed-interval to deletion jobs
Mentioned in SAL (#wikimedia-analytics) [2022-08-10T17:03:00Z] <ottomata> stopping puppet and drop data timers on an-launcher1002 and an-test-coord1001 to deploy drop script changes - T270433
Change 694547 merged by Ottomata:
[analytics/refinery@master] Add safety limits to refinery-drop-older-than
Mentioned in SAL (#wikimedia-operations) [2022-08-10T17:08:08Z] <otto@deploy1002> Started deploy [analytics/refinery@d4dd7e4] (hadoop-test): Add safety limits to refinery-drop-older-than - T270433 - TEST [analytics/refinery@d4dd7e4]
Change 813921 merged by Ottomata:
[operations/puppet@production] analytics:refinery:job:data_purge: Add --allowed-interval to deletion jobs
Change 822122 had a related patch set uploaded (by Mforns; author: Mforns):
[analytics/refinery@master] Add missing changes to the deletion script
Change 822122 merged by Mforns:
[analytics/refinery@master] Add missing changes to the deletion script
Mentioned in SAL (#wikimedia-operations) [2022-08-10T17:30:37Z] <otto@deploy1002> Started deploy [analytics/refinery@6e47e0e] (hadoop-test): Add missing changes to the deletion script - T270433 - TEST [analytics/refinery@6e47e0e]
Mentioned in SAL (#wikimedia-operations) [2022-08-10T17:34:56Z] <otto@deploy1002> Finished deploy [analytics/refinery@6e47e0e] (hadoop-test): Add missing changes to the deletion script - T270433 - TEST [analytics/refinery@6e47e0e] (duration: 04m 19s)
Mentioned in SAL (#wikimedia-operations) [2022-08-10T17:36:40Z] <otto@deploy1002> Started deploy [analytics/refinery@6e47e0e]: Add missing changes to the deletion script - T270433 - [analytics/refinery@6e47e0e]
Mentioned in SAL (#wikimedia-operations) [2022-08-10T17:42:09Z] <otto@deploy1002> Finished deploy [analytics/refinery@6e47e0e]: Add missing changes to the deletion script - T270433 - [analytics/refinery@6e47e0e] (duration: 05m 28s)
Change 822408 had a related patch set uploaded (by Mforns; author: Mforns):
[operations/puppet@production] analytics:refinery:job:data_purge: Improve drop-webrequest-sequence-stats
Change 822408 merged by Ottomata:
[operations/puppet@production] analytics:refinery:job:data_purge: Improve drop-webrequest-sequence-stats