We would like to put together a list of articles where automatically generated simplified versions would be most useful (say 100-1000 articles).
Criteria for most useful are:
- difficult to read. This can be articles with a high readability score via the multilingual readability model
- a simple version does not exist in Simple Wikipedia
- long text (specifically the lead section)
- many pageviews (high demand)
- few/no edits (not controversial, no recent events, etc)
Notes:
- It makes sense to start with English Wikipedia first. But the same analysis can be easily expanded to other languages as well (Simple Wikipedia only exists for English).