Page MenuHomePhabricator

[Data Quality] [NEEDS GROOMING][SPIKE] Define how we can validate that mw.page_content_change is complete
Open, Needs TriagePublic

Description

User Story
As a platform engineer, I need to determine if the mw.page_content_change is a complete dataset
Why?

So that we can inform consumers of potential issues and try to see if we can make changes upstream to improve completeness.

Discussion points?
  • How do we do it? Can we diff a history table of revisions vs the stream periodically?

Event Timeline

Ahoelzl renamed this task from [NEEDS GROOMING][SPIKE] Define how we can validate that mw.page_content_change is complete to [Data Quality] [NEEDS GROOMING][SPIKE] Define how we can validate that mw.page_content_change is complete.Oct 20 2023, 5:06 PM