We want to get estimates of how many total unillustrated articles on each of the relevant wikis will have an image recommended by the new pipeline, for different levels of likelihood-that-an-image-is-good in the recommendation. This is necessary for us to make a decision about which confidence score cutoff to use in making the recommendations. In general, we want the highest confidence score possible, but if there aren't enough recommendations at a high score, we will consider using a lower score.
The wikis are:
pt
ru
id
The likelihood-that-an-image-is-good levels we want to measure are 0.9, 0.8, 0.7
Acceptance criteria:
- Document the number of suggestions for unillustrated articles in the above wikis at the 0.9 confidence level
- Work with product management to evaluate whether that number is sufficient
- If not, measure again at the 0.8 level, etc.