Page MenuHomePhabricator

[Analytics] [Request] Wikipedia Reference Creation over time
Open, Needs TriagePublic

Description

Wikipedia Analytics Request

Purpose

Please provide as much context as possible as well as what the produced insights or services will be used for.

We want to put guardrails for when the sub-referencing feature is deployed. These guardrails help us understand the impact of a feature deployment on core metrics. At a minimum we want to see that references creation stays consistently unchanged across wikis we've rolled out to; i.e. no dipping or unexpected spikes. This task is about creating a chart that displays reference creation over time a for defined user segment. This chart will also be beneficial for future analysis of the user behavior with references.
We don't need to account for subsequently deleted references. The purpose is to get a snapshot of activities rather than the exact amount of references.

    1. Questions to be answered
  • What is the daily average of reference creation in Visual Editor by user segment as described in Essential Metrics in the last 12 months.

Desired Outputs

The desired outputs of this task are listed and confirmed as being finished below.

  • A line chart in superset (we will integrate it into a dashboard at a later stage)
  • All de.wiki specific references to be aggregated into the chart ('Einfach', 'Literatur', 'Webseite')
  • Chart filters by wikis - in our case de.wiki (We will expand to more wikis upon further rollouts)
  • Chart displays total reference creation WoW
  • nice to have: zoom in on daily creation.
  • Chart filters by user segments as described in Essential Metrics
  • Filter by VisualEditor vs Wikitext
  • Timeframe of data: last 12 months at a minimum

Deadline

Please make the time sensitivity of this request clear with a date that it should be completed by. If there is no specific date, then the task will be triaged based on its priority.

26.10.2025


Information below this point is filled out by the task assignee.

Assignee Planning

Sub Tasks

A full breakdown of the steps to complete this task.

  • Subtask

Estimation

Estimate:
Actual:

Data

The tables that will be referenced in this task.

  • link_to_table

Notes

Things that came up during the completion of this task, questions to be answered and follow up tasks.

  • Note

Event Timeline

Lina_Farid_WMDE renamed this task from [Analytics] [Request] Please add a descriptive title to [Analytics] [Request] Wikipedia Reference Creation over time.Sep 19 2025, 12:39 PM
Lina_Farid_WMDE updated the task description. (Show Details)

@AndrewTavis_WMDE I've intentionaly left the data source information open so that the best approach is determined by you and the technical wishes team.

Data sources and limitations to consider:

  • Scraper can get absolute numbers of references per month, but cannot give us insight into what editing behavior created those references, and doesn't have per-week resolution (dumps are on the 1st and 20th of the month).
  • VE instrumentation can give us "added a ref" events, but cannot measure anything about wikitext.

@Lina_Farid_WMDE do you need Andrew's help here?

@Ifrahkhanyaree_WMDE, no not at the moment. Thanks for checking!