In T376926#10238695 we found lots of errors coming from XML content Dumps in the last couple of months. Some wikis run to completion while others slow down a lot and struggle. Most importantly, data corruption is suspected as good revisions seem to fail to export after multiple retries.
This task will look into why and have three deliverables.
- fix for the problem
- guide for Data Platform Engineers looking into similar issues. How would one debug this issue? MW is complex, so sticking to this very specific issue and hopefully the "similar" part emerges.
- better collaboration with MW folks in general. Huge kudos to @pmiazga for offering support and looking over our shoulder here.