This pretty short article is scored 4.39 (FA) in ORES. This result seems wrong.
Description
Description
Related Objects
Related Objects
- Duplicates Merged Here
- T212055: ORES predicts a surprisingly high score for a stub article on fawiki
Event Timeline
Comment Actions
I bet this has something to do with our training set including Redirects or other types of short wikitext articles rated highly. We should look through our training set to see if we have many example of highly-rated short articles.
Comment Actions
Here are our FA quality observations. It looks like the first two are excessively short. Could these be mis-labeled?
Quality | text length | rev_id |
FA | 2698 | 22209198 |
FA | 2783 | 22931379 |
FA | 9268 | 21595797 |
FA | 9268 | 21595797 |
FA | 17927 | 22790480 |
FA | 18572 | 19544614 |
FA | 19255 | 21422786 |
FA | 20313 | 22332851 |
FA | 20593 | 22236347 |
FA | 21709 | 22271457 |
FA | 22754 | 21265916 |
FA | 22755 | 22925983 |
FA | 23587 | 22343509 |
FA | 23589 | 22762032 |
FA | 23750 | 22960348 |
FA | 23763 | 22357296 |
FA | 24267 | 19416403 |
FA | 25661 | 21352991 |
FA | 25976 | 22267990 |
FA | 26512 | 21276368 |
FA | 27918 | 21298974 |
... snip ... | ||
FA | 142803 | 22345176 |
FA | 142938 | 22951851 |
FA | 145626 | 22914865 |
FA | 146537 | 22846950 |
FA | 146540 | 22329970 |
FA | 148656 | 22287634 |
FA | 153672 | 22122809 |
FA | 153672 | 22122809 |
FA | 154083 | 22848012 |
FA | 154136 | 22284243 |
FA | 160097 | 21935665 |
FA | 160104 | 22304823 |
FA | 160823 | 22908335 |
FA | 160834 | 22156021 |
FA | 161865 | 22286873 |
FA | 168988 | 22083811 |
FA | 168988 | 22083811 |
FA | 197997 | 22289502 |
FA | 207018 | 22278332 |
Comment Actions
It's quite likely but because of two mislabels, we should not miss-classify future cases that off.