Tracking task to handle the following event:
Data Loss ERROR - Workflow webrequest-load-check_sequence_statistics-wf-upload-2018-12-1-14 This is an ERROR. This job has failed and refine jobs have been cancelled. Please, have a look at the error file attached to this email and the statistics table wmf_raw.webrequest_sequence_stats_hourly for a detailed explanation, and take necessary action! Thanks :) -- Oozie
Attached data:
30 requests (0.0% of total) have incomplete records. 8041489 requests (6.038% of valid ones) were lost.
This is probably a occurrent of the infamous dt:"-" issue (varnishkafka not able to grab the Varnish request's Timestamp:Resp) generating a false alarm, but better to investigate properly. This means that related data will need to hold for a bit before being generated (like pageviews, etc..).