In Persian Wikipedia we imitate English Wikipedia's wp10 quality model. The problem is that there are not enough people to assess quality of all articles so we only have stub articles, good articles, featured articles completely determined but there are also lots of articles that should be categorized as "B", "C", or "Start" but they haven't categorized.
So here's my suggestion: Do what we did with edit quality campaign, get a 20K sample, autolabel stub, featured and good articles and ask users for what's left. I think we should start with 20K because we have lots of stub articles that can be filtered out easily.
