Page MenuHomePhabricator

MaxConnTrack Netfilter: Maximum number of allowed connection tracking entries alert on cloudvirt1060:9100
Closed, ResolvedPublic

Description

Common information

  • alertname: MaxConnTrack
  • cluster: wmcs
  • instance: cloudvirt1060:9100
  • job: node
  • prometheus: ops
  • severity: critical
  • site: eqiad
  • source: prometheus
  • team: wmcs

Firing alerts


Event Timeline

taavi subscribed.

This is a new alert, but seems like an issue regardless.

Mentioned in SAL (#wikimedia-cloud-feed) [2024-01-15T14:27:59Z] <taavi@cloudcumin1001> START - Cookbook wmcs.openstack.cloudvirt.safe_reboot (T355061)

Mentioned in SAL (#wikimedia-cloud-feed) [2024-01-15T14:32:27Z] <taavi@cloudcumin1001> END (FAIL) - Cookbook wmcs.openstack.cloudvirt.safe_reboot (exit_code=99) (T355061)

Mentioned in SAL (#wikimedia-cloud-feed) [2024-01-15T14:32:55Z] <taavi@cloudcumin1001> START - Cookbook wmcs.openstack.cloudvirt.safe_reboot (T355061)

Mentioned in SAL (#wikimedia-cloud-feed) [2024-01-15T14:37:07Z] <taavi@cloudcumin1001> END (FAIL) - Cookbook wmcs.openstack.cloudvirt.safe_reboot (exit_code=99) (T355061)

Mentioned in SAL (#wikimedia-cloud-feed) [2024-01-16T11:02:06Z] <taavi@cloudcumin1001> START - Cookbook wmcs.openstack.cloudvirt.safe_reboot (T355061)

Mentioned in SAL (#wikimedia-cloud-feed) [2024-01-16T11:22:47Z] <taavi@cloudcumin1001> END (FAIL) - Cookbook wmcs.openstack.cloudvirt.safe_reboot (exit_code=99) (T355061)

Cautiously resolving after a reboot.