- Wait until page history tests are finished (T143322)
- import latest simplewiki with new oozie sqoop (or use existing sqoop import)
- Run the new algorithm on simplewiki
- Solidify the SQL scripts that join revision to the history reconstruction tables and output the denormalized tables
- Vet the resulting denormalized table ourselves
- Once this is done, pass the details on how to use the denormalized data to Erik
Description
Description
Status | Subtype | Assigned | Task | ||
---|---|---|---|---|---|
Resolved | None | T120037 Vital Signs: Please provide an "all languages" de-duplicated stream for the Community/Content groups of metrics | |||
Resolved | None | T120036 Vital Signs: Please make the data for enwiki and other big wikis less sad, and not just be missing for most days | |||
Resolved | odimitrijevic | T130256 Wikistats 2.0. | |||
Duplicate | Milimetric | T141536 Compare early results of Wikistats 2.0 with Wikistats 1.0 | |||
Resolved | mforns | T143321 Create clean simplewiki output from edit history reconstruction | |||
Resolved | mforns | T143322 Edit History: Review scala code functionality and make page and user output uniform |