Page MenuHomePhabricator

Migrate the movement_metrics ETL jobs to Airflow
Closed, ResolvedPublic

Description

Migrate the current ETLs in analytics/wmf-product/jobs/movement_metrics (scheduled via systemd timer in puppet) to Product Analytics Airflow.

As part of this, we should:

  • rebuild each of the tables from scratch. In most cases, it shouldn't make much difference, but it will address T358684 and possibly some other small deviations. (This does not mean we will recalculate all the metric values that have been derived from these tables over the years.)
  • do the rebuilding in Iceberg format
  • move each table to the appropriate one of the new "wmf_*" databases.

Related Objects

Event Timeline

nshahquinn-wmf renamed this task from Product Analytics ETL Migration: Movement metrics to Migrate the movement_metrics ETL jobs to Airflow.Mar 5 2024, 7:40 PM
nshahquinn-wmf claimed this task.
nshahquinn-wmf raised the priority of this task from Medium to High.Jul 4 2024, 6:28 PM
nshahquinn-wmf lowered the priority of this task from High to Medium.Jul 16 2024, 6:44 PM

Now that we have deleted the Superset movement metrics dashboards (T359700), we do not need to migrate the remaining tables.