Create a medawiki-utilities bot to find articles in given categories, category trees, and lists. For each such article, find passages with (1) facts and statistics which are likely to have become out of date and have not been updated in a given number of years, and optionally (2) phrases which are likely unclear. Add an indication of the location and the text of those passages either to the page in question using templates, to a bookkeeping page with other page names as headings, and/or to a database local to the bot. Use a customizable array of keywords and regular expressions and measures of text comprehensibility to find such passages for review. Use Wikiwho to pre-compute the age of each word in an article (to avoid the move and blanking issues described in e.g., http://wikitrust.soe.ucsc.edu/talks-and-papers ) before processing each article of interest.
(Done:) Present flagged passages to one or more reviewers. Update html files with the reviewer(s)' response, but keep the original text in files with different suffixes. When reviewers disagree, update the files to reflect that fact, and present the question to a third reviewer to break the tie.
(TODO:) Note that the reputation scores won't be generally displayed to the reviewers but the review items that reduced the score will be displayed to them.
The project repositories contain all the information regarding the current state of the project and will be updated regularly. Check them out for the most up-to-date information.
Project repositories: https://github.com/priyankamandikal/minireview
and: https://github.com/jsalsman/arowf (merge in progress as of July 17, 2016)
Wiki page: https://www.mediawiki.org/wiki/Accuracy_review
Project Blog (update in progress as of July 17, 2016):
https://priyankamandikal.wordpress.com/wiki-accuracy-review/
Etherpad:
https://etherpad.wikimedia.org/p/accuracyreview
Primary mentor: @Jsalsman
Co-mentors: Fabian Flöck (@FaFlo) and @Maribelacosta
Skills: Python 2 and 3, natural language processing
Estimated project time for a senior contributor: 2 weeks
Tasks completed till date:
Deploying and running Wikiwho code in PythonAnywhere
Reviewer reputation database design
Login system in Python Flask for registering and logging in reviewers
SMOG Readability Testing
Implement the Flesch–Kincaid readability test
Review item queue database design
Manual list-based input
Reviewer workflow