We should update the /page/definition endpoint to use lexemes from Wikidata ASAP, because the current results are:
- Very poor. They are blobs of arbitrary HTML from which clients are required to strip markup. In some cases, they even include messages between editors in the page text itself.
- Very expensive to create.
- Only available for the content in the English Wiktionary.
- Overly broad in scope. A little-known fact about the Wiktionary project is that each Wiktionary aims to provide a definition in its language for every term in every language. Consequently, the English Wiktionary contains definitions for many thousands of terms like https://en.wiktionary.org/w/index.php?curid=6100340 for which there's a vanishingly small likelihood of an English-language user of the Wikipedia app (the sole consumer of this endpoint, as far as we know) ever running a search.
Analyze coverage (for English) for lexemes vs. the current implementation.