Prompted by the Wikipedia Edit Review Experiment @calbon conducted, @santhosh generated an LLM prompt to:
- Systematically evaluate proposed edits to Wikipedia articles
- Identify potential violations of Wikipedia content policies
- Provide objective, concise assessments of edit changes
The outcome of this investigation seems promising in so far as Santhosh is demonstrating that an LLM can identify the specific policy violations (with en.wiki links) content edits introduce:
Among other things, the above is leading the Editing Team to immediately wonder: Might the approach Santhosh piloted be a reliable and scalable way to detect presence/absence of policy violations in new content edits?
The Editing Team asks this question most immediately curious to learn if this approach could enable us to evaluate the impacts of Tone (T365301) and Paste Check (T359107).
Requirements
- Review the approach Santhosh piloted and document the extent to which we think it could be effective at reliably detecting the presence of copyright and non-netural tone within new content edits.
