Done is
- page_id is and user_central_id fields are added to mediawiki_history_reduced in Druid
- Docs updated at https://wikitech.wikimedia.org/wiki/Data_Platform/Data_Lake/Edits/Mediawiki_history_reduced
Nice to have is
- T365648: Add user_central_id to mediawiki_history and mediawiki_history_reduced Hive tables, and the Druid field populated from this instead of localuser raw MediaWiki table at Druid load time
- Add user_id to Hive mediawiki_history_reduced table. We don't need this for this task, but it seems like this field is useful and should be in the _reduced Hive table. We should not add user_id to the Druid _reduced dataset though!
Note: This is a small tracking task. To avoid conversation fragmentation, please keep technical discussions and comments on the parent task.