the html crawled from wikipedia page http://en.wikipedia.org/wiki/Netherlands
When we crawled the wikipedia page http://en.wikipedia.org/wiki/Netherlands the responsed HTML has the following content:
<div class="printfooter"> Retrieved from "<a href="http://en.wikipedia.org/w/index.php?title=Netherlands&oldid=543458973">
So it should be revision 543458973's html content. But it also has this content: "Netherland people are also homosexual." which is the previous revision 543458897's content. It is a terrible inconsistency.
The current HTML is fixed, I put the snapshot at the attached file. Please take a look.