Page MenuHomePhabricator

Develop a unified Content Translation (CX) metrics dashboard
Closed, ResolvedPublic

Description

The goal of this task is to create a single unified dashboard to monitor metrics of interest of Content translation workflows. The final dashboard should meet the following requirements:

  • A single unified dashboard going forward for all metrics of interested related to CX.
  • Public: accessible to both the Language team and the community
    • anything that shouldn't be public can be monitored privately, but most of the data is public.
  • The data should be updated real-time (ideally every 1 hr)

Background

Currently, multiple dashboards and reports are used by the Language team to monitor CX usage and also share with the community. They are:

This creates a very fragmented view of CX usage, increased maintenance burden and also issues like T325790: Special:ContentTranslationStats is slow and getting crowded.
The first version of dashboard should at least unify metrics from CX key metrics dashboard, Special:CXStats and deletion stats.

Suggested solution

Steps involved
(sub-tasks to be created as required)

StepTeam(s) involvedTask(s)Notes
Identify all the metrics (+dependent data sources) to be tracked with v1 of the dashboardProduct Analytics, LPLT366044
^ list down all available metrics for future considerationProduct AnalyticsT366044
Create a basic sketch of dashboard design (placement of numbers, charts, tabs etc.)Product Analytics, LPLCX key metrics dashboard can be used as reference.
Sqoop necessary CX tables to Data LakeProduct Analytics, Data EngineeringT366867, T366868, T366869
Jobs scripts & Airflow DAGs to load required CX extension tables to Data LakeProduct Analyticscx_translations, cx_translators, cx_corpora
Queries required to calculate metrics required for v1 of the dashboardProduct Analytics
Airflow DAGs to aggregate metrics required for v1 of the dashboardProduct Analytics, Data Engineeringincludes publication to analytics/published/datasets
Toolforge jobs to load to data to ToolsDBProduct Analytics
Create an Airflow ETL pipeline to calculate the identified metricsProduct AnalyticsT287306
Privacy review of the data to be published (per data publication guidelines)Privacy (L3SC request)
Write queries for charts, from the aggregated dataProduct Analytics
Create required chartsProduct Analytics
Development and publication of the dashboardProduct Analytics
Communication to communitiesLPL

Related Objects

Event Timeline

KCVelaga_WMF triaged this task as Medium priority.
KCVelaga_WMF updated the task description. (Show Details)
KCVelaga_WMF moved this task from Triage to Epics on the Product-Analytics board.
KCVelaga_WMF added a subscriber: Pginer-WMF.
KCVelaga_WMF reopened this task as In Progress.
KCVelaga_WMF changed the status of subtask T366044: Identify metrics to be presented on the unified CX dashboard from Open to In Progress.
KCVelaga_WMF updated the task description. (Show Details)
KCVelaga_WMF moved this task from Backlog to Priority Backlog on the Language-analytics board.
KCVelaga_WMF changed the task status from In Progress to Open.Oct 3 2024, 7:15 AM
KCVelaga_WMF updated the task description. (Show Details)
KCVelaga_WMF renamed this task from Develop a unified Content Translation (CX) monitoring dashboard to Develop a unified Content Translation (CX) metrics dashboard.Oct 3 2024, 7:19 AM
KCVelaga_WMF changed the task status from Open to In Progress.May 19 2025, 9:02 PM
KCVelaga_WMF raised the priority of this task from Medium to High.
KCVelaga_WMF moved this task from Priority to In progress on the LPL Analytics board.
KCVelaga_WMF changed the status of subtask T394431: Data pipeline to aggregate CX daily monitoring metrics from Open to In Progress.

Mentioned in SAL (#wikimedia-operations) [2025-06-17T13:56:57Z] <kcvelaga@deploy1003> Started deploy [airflow-dags/analytics_product@90a716a]: T365813

Mentioned in SAL (#wikimedia-operations) [2025-06-17T13:58:08Z] <kcvelaga@deploy1003> Finished deploy [airflow-dags/analytics_product@90a716a]: T365813 (duration: 01m 21s)

Marking this as completed as the first version is functional. Additional tasks can be filed for additions/improvements.