Principal user story:
- As a re-user of Lexeme content in all languages, I want comprehensive coverage in as many languages as possible.
Secondary user stories:
- As a Lexeme editor, I want to help work on the most needed Lexemes in my language(s).
- As a community organiser, I want to know in what languages I should advocate for more contributions.
We should provide first-party, prominent encouragement of productive user behaviour, by measuring Lexeme language completeness, and providing listings of missing/incomplete Lexemes for editors (or tools for editors) to use to guide their efforts.
Starting corpora could include:
- existing Wikipedia / etc. articles in that language
- Official "basic word lists", such as the five cited lists compiled at simple:Wikipedia:Basic English combined wordlist
- Similar word frequency lists (perhaps via d:Q8893349 (Category:Linguistics lists))