* Download enwiktionary [[ https://dumps.wikimedia.org/other/enterprise_html/runs/20220401/enwiktionary-NS0-20220401-ENTERPRISE-HTML.json.tar.gz | HTML dump ]] (April 1st, 2022)
* Untar
* Extract data for the page "[[ https://en.wiktionary.org/wiki/apreciable | apreciable ]]":
```
$ jq -r 'select(.name == "apreciable")' enwiktionary_*ndjson | head
{
"name": "apreciable",
"identifier": 2713698,
"date_modified": "2021-03-19T05:53:16Z",
"version": {
"identifier": 62182446,
"comment": "convert {{es-adj-old}} to new {{es-adj}} format",
```
**What happens?**:
The data returned is from March 2021. (`"date_modified": "2021-03-19T05:53:16Z"`)
**What should have happened instead?**:
The data returned is from March 2022. (last edit 2022-03-09, [[ https://en.wiktionary.org/w/index.php?title=apreciable&diff=66107705&oldid=62182446&diffmode=source | diff ]])
There seem to be missing or outdated pages in all the recent (enwikt) HTML dumps I've tried. If it's useful, I can try to compile a list by diffing with the XML dump.