Page MenuHomePhabricator

Data pipeline to aggregate CX monthly machine translation service usage
Closed, ResolvedPublic

Description

Complementing to T394525, this task to setup necessary artifacts for the following

  • Published translation by MT service (time series)
  • Average number of daily translations by service (filter scoped)
  • Usage by language pair
  • Percentage of content modified by MT service (published translations)
  • Percent of articles that are created with each MT service and deleted

Details

Related Changes in GitLab:
TitleReferenceAuthorSource BranchDest Branch
Add monthly cadence suffix to CX default service comparison tablerepos/product-analytics/data-pipelines!44kcvelagaadd-cx-monthly-cadencemain
Airflow DAGs to calculate metrics to related to machine translation service usage metricsrepos/data-engineering/airflow-dags!1487kcvelagamainmain
Scripts for CX MT service metricsrepos/product-analytics/content-translation-airflow-jobs!8kcvelagacx-mt-service-metricsmain
Customize query in GitLab

Event Timeline

kcvelaga opened https://gitlab.wikimedia.org/repos/data-engineering/airflow-dags/-/merge_requests/1487

Airflow DAGs to calculate metrics to related to machine translation service usage metrics

KCVelaga_WMF changed the task status from Open to In Progress.Jul 2 2025, 6:06 AM
KCVelaga_WMF moved this task from Priority to In progress on the LPL Analytics board.
KCVelaga_WMF moved this task from In progress to Review/sign-off on the LPL Analytics board.

kcvelaga merged https://gitlab.wikimedia.org/repos/data-engineering/airflow-dags/-/merge_requests/1487

Airflow DAGs to calculate metrics to related to machine translation service usage metrics

Mentioned in SAL (#wikimedia-operations) [2025-07-08T18:26:40Z] <kcvelaga@deploy1003> Started deploy [airflow-dags/analytics_product@52ec646]: T394526

Mentioned in SAL (#wikimedia-operations) [2025-07-08T18:28:11Z] <kcvelaga@deploy1003> Finished deploy [airflow-dags/analytics_product@52ec646]: T394526 (duration: 01m 35s)