What are the general trends?
See https://www.mediawiki.org/wiki/ORES/Issues/Article_quality
What are the general trends?
See https://www.mediawiki.org/wiki/ORES/Issues/Article_quality
Status | Subtype | Assigned | Task | ||
---|---|---|---|---|---|
Resolved | Chtnnh | T247847 Proposal (GSoC 2020): Implement articlequality and draftquality model for ptwiki and apply insights to models for bs, uk, hi wikis | |||
Resolved | Chtnnh | T250809 Review model performance for ptwiki 'articlequality' and 'draftquality' | |||
Resolved | Chtnnh | T251905 Write report about misclassification reports |
https://gist.github.com/chtnnh/15a77653279d50a0b90179aa83db4fca
This is the difference between model performance before and after adding words_to_watch to the feature_lists/ptwiki.py on selected articles that were misclassified by the old model
I just reviewed @Chtnnh's post at https://www.mediawiki.org/wiki/ORES/Issues/Article_quality#Summary I made some modifications to the table to make it easier to read.
I think we should change the column values to include both the prediction and the weighted_sum. That will help us track improvements.
CLASS_WEIGHTS = {"1": 1, "2": 2, "3": 3, "4": 4, "5": 5, "6": 6} def weighted_sum(score): sum = 0 for cls, proba in score['probability'].items(): sum += CLASS_WEIGHTS[cls] * proba return sum
https://www.mediawiki.org/wiki/ORES/Issues/Article_quality
The table has been updated under the summary section of the misclassification reports. This task can be closed as resolved.