This Hive query and variants of it repeatedly fail after several hours with an error message like this in stage 2:
Stage-Stage-2: Map: 627 Reduce: 739 Cumulative CPU: 77.38 sec HDFS Read: 148778813 HDFS Write: 0 FAIL
The full log including the entire query is also pasted here: P5604
It happens both from stat1004 and from stat1002.
The strange this is that the problem goes away if one simply removes this clause in the WHERE condition of the outer query:
AND event_comment NOT REGEXP '\\{\\{R from '
(or if one replaces both clauses in that line with the one clause event_comment not regexp '[Rr]edir' - that's the version of @Milimetric's query I posted at T149021#3287887, and the failing version above results from applying the correction at T149021#3331707 ).