Page MenuHomePhabricator

Add monitoring metric for connection tracking table usage
Closed, ResolvedPublic

Description

Over the weekend, the kafka* servers saw a sharp increase in tracked connections (T131028)., which has now settled down to normal levels. We should have a monitoring metric (Diamond collector?) which reads the current table usage from /proc/sys/net/netfilter/nf_conntrack_count, so that we can track down and correlate changes.

Event Timeline

Change 280265 had a related patch set uploaded (by Elukey):
Add diamond nf_conntrack counter.

https://gerrit.wikimedia.org/r/280265

Change 280265 merged by Elukey:
Add diamond nf_conntrack counter.

https://gerrit.wikimedia.org/r/280265

So this now requires enabling, right ?

diamond::collector { 'NfConntrackCount': }
akosiaris triaged this task as Medium priority.Apr 20 2016, 11:30 AM
elukey claimed this task.

Already added, the only remaining step was to decide if the collector needs to be deployed everywhere or not, but it is not in the scope of the ticket. Closing!