It seems like we recently crossed a milestone with the bot for catching copyright violations. (see T185163; I'm not sure if the milestone is 1M credits, or exactly 2 years since last load)
This may be a good time to consider a Press Release with aim to:
- Maintain the good relationship with turnitin team who kindly provided us with great service for copyright violation detection used in the core of the system
- Get more users involved in patrolling the suspected edits using CopyPatrol (especially in wikis for which we run the bot but not all cases are getting reviewed, e.g eswiki)
- Thank for the patrollers, and in particular the most active patrollers in the leaderboard ( https://tools.wmflabs.org/copypatrol/en/leaderboard )
- To make it interesting (and for my own curiosity) it would be awesome if we can include some patterns/tips&tricks we learned from the project:
- Who is more likely to have suspected edits? (new users? people who get paid from companies?)
- What subjects are the most common for copyright violations?
- Where does the copyrighted content come from? (small websites/large websites)?
- Are there specific terms/words/patterns that are more likely to appear/not appear in copyrighted material?