We plan on deprecating 4 tables:
- mobile_apps_uniques_daily not updated since 2021-02-18
- mobile_apps_uniques_monthly still active (on 2023-05-01)
- mobile_apps_session_metrics not updated since 2018-03-20
- mobile_apps_session_metrics_by_os still active (on 2023-05-07)
Our plan is to:
- Pause the computation jobs (there's only 2 active, one in Oozie, one in Airflow)
- Create MR to remove the Airflow DAG https://gitlab.wikimedia.org/repos/data-engineering/airflow-dags/-/merge_requests/379
- Create a patch to remove the refinery-source Spark code https://gerrit.wikimedia.org/r/c/analytics/refinery/source/+/919357
- Drop the hive table but keep the data (we can recreate the tables easily)
- Wait for 30 days, and in case nobody has asked about the data...
- Remove the code from refine and airflow-dags
- Delete the old data altogether