Page MenuHomePhabricator

Automate XML-to-parquet transformation for XML dumps (oozie job)
Closed, ResolvedPublic8 Estimated Story Points

Description

This task involves:

Event Timeline

Milimetric moved this task from Incoming to Smart Tools for Better Data on the Analytics board.

Change 463370 had a related patch set uploaded (by Joal; owner: Joal):
[analytics/refinery/source@master] Add MediawikiXMLDumpsConverter spark job

https://gerrit.wikimedia.org/r/463370

Change 463548 had a related patch set uploaded (by Joal; owner: Joal):
[analytics/refinery@master] Add mediawiki-history-wikitext oozie job

https://gerrit.wikimedia.org/r/463548

Change 463370 merged by jenkins-bot:
[analytics/refinery/source@master] Add MediawikiXMLDumpsConverter spark job

https://gerrit.wikimedia.org/r/463370

mforns set the point value for this task to 8.Oct 8 2018, 4:07 PM

Change 463548 merged by Mforns:
[analytics/refinery@master] Add mediawiki-history-wikitext oozie job

https://gerrit.wikimedia.org/r/463548

OMG ! Done ... Sorry for having skipped that one :S