Page MenuHomePhabricator

Commons Impact Metrics has no data for September snapshot
Closed, ResolvedPublic

Description

Last month (August snapshot) we had to tweak the Commons Impact Metrics job to be able to read from the newly modified *links tables (T404735).
This month, the job finished successfully, but the datasets are empty.
This task is to troubleshoot, look for root cause, and fix this issue.

Details

Related Changes in GitLab:
TitleReferenceAuthorSource BranchDest Branch
Add sensor for linktarget table in Commons Impact Metrics DAGrepos/data-engineering/airflow-dags!1748mfornsadd-linktarget-to-cim-sensorsmain
Customize query in GitLab

Event Timeline

After some troubleshooting I saw that, when we added the linktarget table as a datasource for Commons Impact Metrics, we forgot to add the corresponding sensor.
This made it so that the September DAG run started before the linktarget data was properly loaded to the data lake, and so the CIM job produced empty results.
The MR above adds the proper sensor to the DAG.