Page MenuHomePhabricator

Migrate the movement_metrics ETL jobs to Airflow
Open, HighPublic

Description

Migrate the current ETLs in analytics/wmf-product/jobs/movement_metrics (scheduled via systemd timer in puppet) to Product Analytics Airflow.

As part of this, we should:

  • rebuild each of the tables from scratch. In most cases, it shouldn't make much difference, but it will address T358684 and possibly some other small deviations. (This does not mean we will recalculate all the metric values that have been derived from these tables over the years.)
  • do the rebuilding in Iceberg format
  • move each table to the appropriate one of the new "wmf_*" databases.

Related Objects

Event Timeline

nshahquinn-wmf renamed this task from Product Analytics ETL Migration: Movement metrics to Migrate the movement_metrics ETL jobs to Airflow.Mar 5 2024, 7:40 PM
nshahquinn-wmf claimed this task.
nshahquinn-wmf raised the priority of this task from Medium to High.Thu, Jul 4, 6:28 PM