With T221338 complete, the edits_hourly data (T211173) should now be ready to use. The purpose of this task is to dobule-check the data and make sure there are no major outstanding issues.
Outstanding Issues:
* content edit counts in wmf.mediawiki_history are not fully reliable. [[ https://phabricator.wikimedia.org/T221338 | T221338 ]]
* all anon users display with 10,000 edit count on edit_hourly dataset. [[https://phabricator.wikimedia.org/T224941 | T224941]]
Proposed checks:
| **check** | **status** |
| View in Turnilo and Superset. Confirm data appears as expected by applying various filters and splits.|
| Confirm data matches query results on `wmf.mediawiki_history` data and monthly contributors metrics numbers.| In progress. Confirmed edit counts for user edit count buckets, namespace_is_talk, and user_is_adminstrator match with mediawiki_history queries
| Confirm that the content page edit counts issue was corrected by comparing to query results on MariaDB replicas.| In progress
| Confirm anon users display with correct edit_count| ✅ Confirmed. Note: Anonymous editors do not have an edit count at this time because that info is not available in Data lake.