We should have a good rule of thumb for any input dataset that an ORES model uses.
This task was originally created to discuss about the randomization in sampling of 2k articles per mid-level category using PAWS vs. manually sampling them after fetching certain number of articles per WikiProject using queries like:
https://en.wikipedia.org/w/api.php?action=query&generator=embeddedin&geititle=Template:WikiProject%20Accessibility&geinamespace=1&prop=info Essentially, we decided to store the output of @Sumit's script in the repo to preserve the history of it.
Should we do that for all samples? What is a good set of guidelines?