Page MenuHomePhabricator

Raw text webrequest partitions for 2014-12-30T20/1H not marked successful
Closed, ResolvedPublic

Description

The webrequest text partition *) for 2014-12-30T20/1H has not been
marked successful.

What happened?

*)

_________________________________________________________________
qchris@stat1002 // jobs: 0 // time: 10:32:28 // exit code: 0
cwd: ~
~/cluster-scripts/dump_webrequest_status.sh 150
  +------------------+--------+--------+--------+--------+
  | Date             |  bits  | mobile |  text  | upload |
  +------------------+--------+--------+--------+--------+
[...]
  | 2014-12-30T20/1H |    .   |    .   |    X   |    .   |
[...]
  +------------------+--------+--------+--------+--------+


Statuses:

  . --> Partition is ok
  M --> Partition manually marked ok
  X --> Partition is not ok (duplicates, missing, or nulls)

Event Timeline

QChris raised the priority of this task from to Medium.
QChris updated the task description. (Show Details)
QChris added a project: Analytics-Clusters.
QChris added subscribers: Unknown Object (MLST), QChris, Milimetric and 2 others.

It only affects amssq42.esams.wmnet, which has 101 duplicates between
2014-12-30T20:50:33 and 2014-12-30T20:50:38

analytics1012:

[2014-12-30 20:52:02,145] 12790193314 [kafka-processor-9092-0] ERROR kafka.network.Processor  - Closing socket for /10.20.0.142 because of error
java.io.IOException: Connection reset by peer

analytics1022:

[2014-12-30 20:52:25,638] 12785829850 [kafka-processor-9092-1] ERROR kafka.network.Processor  - Closing socket for /2620:0:862:102:a6ba:dbff:fe2b:770e because of error
java.io.IOException: Connection reset by peer

Both 10.20.0.142 and 2620:0:862:102:a6ba:dbff:fe2b:770e belong to private1-esams.

10.20.0.142 is amssq42.

(No similar messages on the remaining brokers analytics1018, and analytics1021)

QChris claimed this task.

Deduped the partition.