Once we are able to compute ongoing daily editor metrics for WE3.3.7 in T405039: Global Editor Metrics - Data Pipeline, we will need to backfill data in Cassandra from 2025-01-01.
We have mediawiki.page_change.v1 events in the event_sanitized database since late 2024. However, the user_central_id field added in T403664: EventBus - Add central user id to MediaWiki events is not available before 2025-09-19. This field is required to compute global editor metrics.
Once T365648: Add user_central_id to mediawiki_history and mediawiki_history_reduced Hive tables is done, we can backfill from mediawiki_history.
Done is
- Editor metrics tables in Hive are backfilled starting 2025-01-01. Queries for this are being developed in Add HQL for edit_per_editor_per_page_daily and pageview_per_editor_per_page_daily (1196892)
- Editor metrics tables in Cassandra are backfilled starting 2025-01-01.