Create clean simplewiki output from edit history reconstruction
  • Wait until page history tests are finished (T143322)
  • import latest simplewiki with new oozie sqoop (or use existing sqoop import)
  • Run the new algorithm on simplewiki
  • Solidify the SQL scripts that join revision to the history reconstruction tables and output the denormalized tables
  • Vet the resulting denormalized table ourselves
  • Once this is done, pass the details on how to use the denormalized data to Erik