Page MenuHomePhabricator

[infra,k8s,o11y] introduce additional observability for calico and general networking
Open, MediumPublic

Description

Sometimes it is unclear if calico is working as expected.

Also, there is a warning that is unclear if it is concerning or not:

calico-node 2024-11-26 10:45:54.480 [WARNING][87] felix/table.go 680: Chain had unexpected inserts, marking for resync actualRuleIDs=[]string{"", "0i8pjzKKPyA34aQD"} chainName="POSTROUTING" ipVersion=0x4 table="nat"

Also, we would like to get information to know if network connections are working as expected, for flows that cross out of the pod network, for example:

  • ldap-ro.eqiad.wikimedia.org
  • gitlab.wikimedia.org
  • gerrit.wikimedia.org
  • phabricator.wikimedia.org
  • any randomly chosen wiki
  • Checking connectivity to frequently used external services such as irc.libera.chat, github.com, packagist.org, pypi.org, and npmjs.com would be nice to haves as well.

Event Timeline

aborrero triaged this task as Medium priority.
aborrero moved this task from Backlog to Radar/observer on the User-aborrero board.
aborrero renamed this task from toolforge: introduce additional observability for calico to toolforge: introduce additional observability for calico and general networking.Nov 27 2024, 1:44 PM
aborrero updated the task description. (Show Details)
dcaro renamed this task from toolforge: introduce additional observability for calico and general networking to [infra,k8s,o11y] introduce additional observability for calico and general networking.Nov 27 2024, 2:22 PM
dcaro edited projects, added Toolforge; removed Cloud-VPS.