Page MenuHomePhabricator

Develop a unified Content Translation (CX) metrics dashboard
Open, MediumPublic

Description

The goal of this task is to create a single unified dashboard to monitor metrics of interest of Content translation workflows. The final dashboard should meet the following requirements:

  • A single unified dashboard going forward for all metrics of interested related to CX.
  • Public: accessible to both the Language team and the community
    • anything that shouldn't be public can be monitored privately, but most of the data is public.
  • The data should be updated real-time (ideally every 1 hr)

Background

Currently, multiple dashboards and reports are used by the Language team to monitor CX usage and also share with the community. They are:

This creates a very fragmented view of CX usage, increased maintenance burden and also issues like T325790: Special:ContentTranslationStats is slow and getting crowded.
The first version of dashboard should at least unify metrics from CX key metrics dashboard, Special:CXStats and deletion stats.

Suggested solution

Steps involved
(sub-tasks to be created as required)

StepTeam(s) involvedTask(s)Notes
Identify all the metrics (+dependent data sources) to be tracked with v1 of the dashboardProduct Analytics, LPLT366044
^ list down all available metrics for future considerationProduct AnalyticsT366044
Create a basic sketch of dashboard design (placement of numbers, charts, tabs etc.)Product Analytics, LPLCX key metrics dashboard can be used as reference.
Sqoop necessary CX tables to Data LakeProduct Analytics, Data EngineeringT366867, T366868, T366869
Jobs scripts & Airflow DAGs to load required CX extension tables to Data LakeProduct Analyticscx_translations, cx_translators, cx_corpora
Queries required to calculate metrics required for v1 of the dashboardProduct Analytics
Airflow DAGs to aggregate metrics required for v1 of the dashboardProduct Analytics, Data Engineeringincludes publication to analytics/published/datasets
Toolforge jobs to load to data to ToolsDBProduct Analytics
Create an Airflow ETL pipeline to calculate the identified metricsProduct AnalyticsT287306
Privacy review of the data to be published (per data publication guidelines)Privacy (L3SC request)
Write queries for charts, from the aggregated dataProduct Analytics
Create required chartsProduct Analytics
Development and publication of the dashboardProduct Analytics
Communication to communitiesLPL

Event Timeline

KCVelaga_WMF triaged this task as Medium priority.
KCVelaga_WMF updated the task description. (Show Details)
KCVelaga_WMF moved this task from Triage to Epics on the Product-Analytics board.
KCVelaga_WMF added a subscriber: Pginer-WMF.
KCVelaga_WMF reopened this task as In Progress.
KCVelaga_WMF changed the status of subtask T366044: Identify metrics to be presented on the unified CX dashboard from Open to In Progress.
KCVelaga_WMF updated the task description. (Show Details)
KCVelaga_WMF moved this task from Backlog to Priority Backlog on the Language-analytics board.
KCVelaga_WMF changed the task status from In Progress to Open.Oct 3 2024, 7:15 AM
KCVelaga_WMF updated the task description. (Show Details)
KCVelaga_WMF renamed this task from Develop a unified Content Translation (CX) monitoring dashboard to Develop a unified Content Translation (CX) metrics dashboard.Oct 3 2024, 7:19 AM