Page MenuHomePhabricator

Augment refinery-dump-status-webrequest-partitions script to show more useful status of webrequest raw partitions
Closed, DeclinedPublic

Description

Currently, when this script outputs status about raw webrequest partitions, it prints and X for partitions that are not 100% good. Even if there is only 1 duplicate, an X will be shown. This makes it difficult to reason about the severity of data loss of duplicates in this report.

2015-03-12T00/1H ||    .   |    .   |    X   |    .   |    .   |

I'd like to have something more useful here. Maybe max(abs(percent_different)) if any percent_different field is non zero. Qchris says that to do this, we'd need to edit the dump_dataset_raw_webrequest_partition method in the script.

Event Timeline

Ottomata raised the priority of this task from to Low.
Ottomata updated the task description. (Show Details)
Ottomata added a project: Analytics-Clusters.
Ottomata added subscribers: Ottomata, JAllemandou, QChris.
Ottomata set Security to None.
Milimetric subscribed.

We usually need to look further if the email flags any problems, and it doesn't happen very often