The click data is unaggregated and includes PII, it needs to be deleted after 90 days to match our privacy policy. The hourly table is maintained by hive and typically has only a day of data. The daily table is, currently, pruned by manually calling a script and is done intermittently.
Task is to make patches necessary so the script is called automatically on a daily basis. We might as well purge both the hourly and daily tables from the script just incase our oozie pipeline fails to delete some hours from hive (has happened before).