Page MenuHomePhabricator

ETL pipeline for upatrolled recentchanges daily activity
Open, In Progress, MediumPublic

Description

Baseline: https://nbviewer.org/urls/gitlab.wikimedia.org/kcvelaga/automoderator-measurement/-/raw/main/baselines/T348863_content_moderation_backlogs_rchanges.ipynb

Draft table schema

`wiki_db` string COMMENT 'wiki db name',
`date` date COMMENT 'The paritition date for which the metric is computed over.',
`rc_date` date COMMENT 'The date for which the recentchanges status is aggegated for, usually 15 days prior to the paritition date.', 
`is_ns0` boolean COMMENT 'Indicates whether the log is on namespace zero or not.',
`is_page_creation` boolean COMMENT 'Indicates whether the revision resulted in page creation.',
`patrol_status` string COMMENT 'Patrol status of the change, possible values: autopatrolled, patrolled, unpatrolled.',
`n_revisions` int COMMENT 'Number of revisions for the given dimensions.'

Details

TitleReferenceAuthorSource BranchDest Branch
Queries and notebooks for moderation metrics (related to Automoderator)repos/product-analytics/data-pipelines!13kcvelagaautomoderator_puppetmain
Customize query in GitLab

Event Timeline

KCVelaga_WMF created this task.
KCVelaga_WMF renamed this task from ETL pipeline for recentchanges daily activity to ETL pipeline for upatrolled recentchanges daily activity.Tue, Jun 25, 5:08 AM
KCVelaga_WMF updated the task description. (Show Details)