Page MenuHomePhabricator

pageviews dumps contain invalid lines
Closed, DuplicatePublic


Some pagedumps-*.gz files contain entries that are wrapping across multiple lines. For example:


en.m Current_rating_of_the_circuit_=_5_A
34._When_is 1 0

starting on line 2933031. It looks like this should be all on a single line.

Event Timeline

Aquameta created this task.Feb 25 2019, 6:30 PM
ArielGlenn triaged this task as Normal priority.Mar 4 2019, 12:43 PM
ArielGlenn added a project: Analytics.
Milimetric raised the priority of this task from Normal to High.Mar 4 2019, 4:35 PM
Milimetric moved this task from Incoming to Data Quality on the Analytics board.
Milimetric moved this task from Data Quality to Ops Week on the Analytics board.