By dividing the spectrum of ORES scores into discrete levels and labeling these in a way that makes their predictive value clear, enable users to effectively use the system to meet their reviewing needs. In the immediate future, this system will be used on the ERI revisions to Recent Changes and in the ReviewStream feed.
As of Nov. 1 2016, here's how we're defining and describing these levels :
CONTRIBUTION QUALITY [Damaging]
Very likely good
Highly accurate at finding almost all problem-free edits.
May have problems
Finds most flawed or damaging edits but with lower accuracy.
Likely have problems
Finds half of flawed or damaging edits with medium accuracy.
Very likely have problems
Highly accurate at finding the most obvious 30% of flawed or damaging edits.
USER INTENT [Good Faith]
Very likely good faith
Highly accurate at finding almost all good-faith edits.
May be bad faith
Finds most bad-faith edits but with a lower accuracy.
Likely bad faith
Highly accurate at finding most obvious obvious 20% of bad-faith edits.