The idea here is to ingest old pagecounts-ez files into hive and generate new and shiny files that solve the following problems:
- Uses standard wiki names (es.wikipedia, en.wikisource, hy.wiktionary), instead of fun ones (es.z, en.s, en.y).
- Solves the one hour skew problem (https://wikitech.wikimedia.org/wiki/Analytics/Data_Lake/Traffic/Pagecounts-ez#One_hour_skewing_issue).
- Adds columns we're adding to the new incarnation of the dump (access site, page id).