Motivation
We want to make sure user ids really refer to the same users. We also want to see if users encounter edit conflicts in the very first edit they ever made.
Todo
- join on User_Text (globally unique user name or IP address, so make sure to filter out not logged in users)
- take all user ids that encountered an edit conflict in the past 3 months, and join them with mediawiki history-number of revisions ever made (cumulative edits). This will also include the 0 edit cases for edit conflicts.
- take out the bot edits on the edit conflict side (using is_bot flag from the logging schema)
Notes
Should be done by the end of May 25th