xmlreader.XmlDump has a boolean parameter allrevisions that states "If True, parse all revisions instead of only the latest one." If False, it calls the function ._parse_only_latest() in .parse(). However, two issues arise:
- As written, ._parse_only_latest() yields the first matching revision found without comparing revisionid to other candidates. In the case of the test suite and my own separate wikidump, this returns the earliest revision of the page, not the latest.
- The accompanying test, tests.xml_reader_tests.test_XmlDumpFirstRev(), specifies that it indeed tests loading the first revision. While the test is currently passing, it would be more helpful to match the function and instead check that it returns the latest revision.
PR to follow shortly!