As part of the 2015-16 Q2 experimental goal, we did some analysis of HTML content for a sample of articles. The HTML output was mostly driven by Parsoid, however, so we should redo this analysis but with the MediaWiki parser as the backend.
AC
- Change joakin/loot-content-analysis to use the MediaWiki parser
- Publish the results to mobile-l
- Use the results to prioritise any future engineering work
- Explore whether we can do this sitewide using a database dump