kube-apiserver need to reach webhooks running inside of the cluster
Closed, ResolvedPublic
Actions

Assigned To

Authored By

	JMeybohm
	Sep 14 2021, 1:40 PM

Description

Istio comes with two webhooks by default:

A mutating webhook istio-sidecar-injector that we can potentially ignore as we don't use injection
A validating webhook istiod-istio-system that is used to validate Istio CRD objects

The latter one we can not ignore as installing istio already triggers a validation request and the kube-apiserver will fail to call the service backing the webhook. This is the scenario described in T285927.

After chatting with @akosiaris we came up with ~~three~~four possible solutions:

1. Announce Kubernetes Service IPs via BGP (calico) so they are reachable from outside the cluster

This is what we currently have in staging-codfw as part of the work done in T238909.

Pro:

No additional components needed on kubernetes masters
Shares the traffic flow with how service traffic would reach the cluster (we're not sure about this yet)

Con:

Depends on calico properly announcing Kubernetes Service IPs (which we have not fully implemented yet)
Need --masquerade-all on nodes (which effectively hides the real client IP from PODs, https://github.com/kubernetes/kubernetes/issues/24224)
Needs calico announced service ips being highly available. This is being worked upstream.

2. Make Kubernetes Masters (tainted) worker nodes

This is what @elukey has implemented for ml clusters in T285927.

Pro:

Shares the traffic flow with other intra-cluster traffic to ClusterIPs
Kubernetes masters are known to the Kubernetes API (e.g. we can control access via NetworkPolicies and run dedicated workloads on them - istio control plane for example)
No dependency on calico announcing Kubernetes Service IPs (like with 1.)

Con:

Lots of new dependencies on masters: kube-proxy, kubelet and docker. Making them way more complex and resource hungry.
Makes it easier for workload on the nodes to reach the master on various ports (e.g. as a result of a bug in iptables rules manipulation). This is not theoretical, a CVE already exists: https://discuss.kubernetes.io/t/security-advisory-cve-2020-8558-kubernetes-node-setting-allows-for-neighboring-hosts-to-bypass-localhost-boundary/11788
Will make Kubernetes Masters BGP peer with core routers. Can/should we prevent that?

3. Run kube-proxy on Kubernetes Masters

Just run the kube-proxy process on Kubernetes Masters, essentially providing them with the needed iptables rules to reach ClusterIP services.

Pro:

Shares the traffic flow with other intra-cluster traffic to ClusterIPs
Less additional components than 2.
No dependency on calico announcing Kubernetes Service IPs (like with 1.)

Con:

Additional process (kube-proxy) is needed on the masters (making them more complex plus requiring some puppet work)
Makes it easier for workload on the nodes to reach the master on various ports (e.g. as a result of a bug in iptables rules manipulation). This is not theoretical, a CVE already exists: https://discuss.kubernetes.io/t/security-advisory-cve-2020-8558-kubernetes-node-setting-allows-for-neighboring-hosts-to-bypass-localhost-boundary/11788
Makes it a bit more complex to reason about masters being part of the cluster.
A tested (in WMCS) but not really supported from upstream scenario.

4. Work around this issue by disabling webhooks

With the outcome of T287007#7431081 this is no longer a viable option

As we potentially won't use Istio CRDs to configure Ingress in first place (see the Configuration part of T287007), we could try to work around this requirement by disabling/not deploying the webhooks at all. I'm not sure if that is possible, though.

Pro:

No additional components on the masters
No dependency on calico announcing Kubernetes Service IPs (like with 1.)
No dependency to istiod (serving the webhooks) from kube-apiserver

Con:

Hard deviation from the Istio setup standard
We might have to revisit this problem/decision later (for things like OPA or other alternatives to PSPs: T273507)

We're going with option 2, todos:

Migrate staging-eqiad
Migrate codfw
Migrate eqiad
Remove unused master.pp parameter profile::kubernetes::master::expose_puppet_certs

Details

Subject	Repo	Branch	Lines +/-
Add label node-role.kubernetes.io/master to masters	operations/puppet	production	+5 -1
kubernetes::master: Remove expose_puppet_certs parameter	operations/puppet	production	+0 -18
Add k8s masters in eqiad eBGP config	operations/homer/public	master	+2 -0
Upgrade eqiad kubernetes masters to tainted full nodes	operations/puppet	production	+38 -19
Fix nrpe_check_disk_options hiera key for kubernetes masters	operations/puppet	production	+1 -1
Add k8s masters in codfw eBGP config	operations/homer/public	master	+2 -0
Upgrade codfw kubernetes masters to tainted full nodes	operations/puppet	production	+88 -5
Enable overlayfs on kubernetes masters	operations/puppet	production	+6 -0
Split profile::kubernetes::master_hosts by DC	operations/puppet	production	+6 -5
Enable IPv6DualStack for kubelet on staging masters	operations/puppet	production	+3 -1
Add keys needed for k8s node profile to main master nodes	labs/private	master	+5 -0
Add kubestagemaster1001 to k8s_staging eBGP config	operations/homer/public	master	+1 -0
Upgrade staging-eqiad kubernetes master to a full node	operations/puppet	production	+22 -20
Add master IPs to main/wikikube clusters	operations/deployment-charts	master	+19 -0
Remove the hacks around masquerade-all	operations/deployment-charts	master	+2 -14
Fix nrpe_check_disk_options hiera key for kubernetes staging masters	operations/puppet	production	+2 -2
Add kubestagemaster2001 to k8s_staging eBGP config	operations/homer/public	master	+1 -0
Update codfw kubernetes master to a full node	operations/puppet	production	+69 -0
Make disabled insecure API the default on kubernetes masters	operations/puppet	production	+13 -34
Add scheduler_token to all k8s masters	labs/private	master	+3 -2
k8s-apiserver: Disable insecure API on systems that no longer need it	operations/puppet	production	+15 -0
Add missing notify on kube-scheduler config change	operations/puppet	production	+1 -0
Migrate kube-scheduler away from insecure API	operations/puppet	production	+14 -2
Add profile::kubernetes::master::scheduler_token to staging	labs/private	master	+1 -0

Related Objects
Search...

Status	Assigned	Task
Resolved	Joe	T252745 Sandbox/limit child processes within a container runtime
Open	None	T261277 Create a gateway in kubernetes for the execution of our "lambdas"
Resolved	JMeybohm	T290966 Implement POC for istio ingress
Resolved	JMeybohm	T290967 kube-apiserver need to reach webhooks running inside of the cluster
Resolved	elukey	T299634 Create a partman config for kubernetes masters

Event Timeline

JMeybohm created this task.Sep 14 2021, 1:40 PM

JMeybohm updated the task description. (Show Details)Sep 14 2021, 1:47 PM

JMeybohm updated the task description. (Show Details)Sep 15 2021, 7:48 AM

JMeybohm mentioned this in T290966: Implement POC for istio ingress.Sep 15 2021, 8:59 AM

akosiaris updated the task description. (Show Details)Sep 16 2021, 3:30 PM

JMeybohm updated the task description. (Show Details)Oct 6 2021, 12:20 PM

JMeybohm updated the task description. (Show Details)Oct 12 2021, 7:39 AM

JMeybohm updated the task description. (Show Details)Oct 20 2021, 8:35 AM

Change 754003 had a related patch set uploaded (by JMeybohm; author: JMeybohm):

[operations/puppet@production] Migrate kube-scheduler away from insecure API

https://gerrit.wikimedia.org/r/754003

gerritbot added a project: Patch-For-Review.Jan 14 2022, 5:40 PM

Change 754006 had a related patch set uploaded (by JMeybohm; author: JMeybohm):

[labs/private@master] Add profile::kubernetes::master::scheduler_token to staging

https://gerrit.wikimedia.org/r/754006

Change 754006 merged by JMeybohm:

[labs/private@master] Add profile::kubernetes::master::scheduler_token to staging

https://gerrit.wikimedia.org/r/754006

JMeybohm mentioned this in rLPRIc4f864392bd0: Add profile::kubernetes::master::scheduler_token to staging.Jan 14 2022, 6:26 PM

Change 754003 merged by JMeybohm:

[operations/puppet@production] Migrate kube-scheduler away from insecure API

https://gerrit.wikimedia.org/r/754003

Maintenance_bot removed a project: Patch-For-Review.Jan 17 2022, 10:10 AM

Change 754462 had a related patch set uploaded (by JMeybohm; author: JMeybohm):

[operations/puppet@production] Add missing notify on kube-scheduler config change

https://gerrit.wikimedia.org/r/754462

gerritbot added a project: Patch-For-Review.Jan 17 2022, 10:13 AM

Change 754462 merged by JMeybohm:

[operations/puppet@production] Add missing notify on kube-scheduler config change

https://gerrit.wikimedia.org/r/754462

Maintenance_bot removed a project: Patch-For-Review.Jan 17 2022, 11:10 AM

Change 754514 had a related patch set uploaded (by JMeybohm; author: JMeybohm):

[operations/puppet@production] k8s-apiserver: Disable insecure API on systems that no longer need it

https://gerrit.wikimedia.org/r/754514

Change 754515 had a related patch set uploaded (by JMeybohm; author: JMeybohm):

[operations/puppet@production] Make disabled insecure API the default on kubernetes masters

https://gerrit.wikimedia.org/r/754515

Change 754514 merged by JMeybohm:

[operations/puppet@production] k8s-apiserver: Disable insecure API on systems that no longer need it

https://gerrit.wikimedia.org/r/754514

Change 754556 had a related patch set uploaded (by JMeybohm; author: JMeybohm):

[operations/puppet@production] Update codfw kubernetes master to a full node

https://gerrit.wikimedia.org/r/754556

Change 754945 had a related patch set uploaded (by JMeybohm; author: JMeybohm):

[operations/homer/public@master] Add kubestagemaster2001 to k8s_staging iBGP config

https://gerrit.wikimedia.org/r/754945

Change 755389 had a related patch set uploaded (by JMeybohm; author: JMeybohm):

[labs/private@master] Add scheduler_token to all k8s masters

https://gerrit.wikimedia.org/r/755389

Change 755389 merged by JMeybohm:

[labs/private@master] Add scheduler_token to all k8s masters

https://gerrit.wikimedia.org/r/755389

JMeybohm mentioned this in rLPRI10adb74d3dec: Add scheduler_token to all k8s masters.Jan 19 2022, 2:05 PM

Change 754515 merged by JMeybohm:

[operations/puppet@production] Make disabled insecure API the default on kubernetes masters

https://gerrit.wikimedia.org/r/754515

Mentioned in SAL (#wikimedia-operations) [2022-01-19T14:33:36Z] <jayme> disabled insecure API on all k8s masters - T290967

Change 754556 merged by JMeybohm:

[operations/puppet@production] Update codfw kubernetes master to a full node

https://gerrit.wikimedia.org/r/754556

Change 754945 merged by jenkins-bot:

[operations/homer/public@master] Add kubestagemaster2001 to k8s_staging eBGP config

https://gerrit.wikimedia.org/r/754945

Change 755698 had a related patch set uploaded (by JMeybohm; author: JMeybohm):

[operations/puppet@production] Fix nrpe_check_disk_options hiera key for kubernetes staging masters

https://gerrit.wikimedia.org/r/755698

Change 755698 merged by JMeybohm:

[operations/puppet@production] Fix nrpe_check_disk_options hiera key for kubernetes staging masters

https://gerrit.wikimedia.org/r/755698

JMeybohm updated the task description. (Show Details)Jan 21 2022, 8:48 AM

Change 755920 had a related patch set uploaded (by JMeybohm; author: JMeybohm):

[operations/deployment-charts@master] Remove the hacks to around masquerade-all

https://gerrit.wikimedia.org/r/755920

Change 755920 merged by jenkins-bot:

[operations/deployment-charts@master] Remove the hacks around masquerade-all

https://gerrit.wikimedia.org/r/755920

Change 755924 had a related patch set uploaded (by JMeybohm; author: JMeybohm):

[operations/deployment-charts@master] Add master IPs to main/wikikube clusters

https://gerrit.wikimedia.org/r/755924

Change 755924 merged by jenkins-bot:

[operations/deployment-charts@master] Add master IPs to main/wikikube clusters

https://gerrit.wikimedia.org/r/755924

JMeybohm updated the task description. (Show Details)Jan 21 2022, 1:44 PM

Change 755977 had a related patch set uploaded (by JMeybohm; author: JMeybohm):

[operations/puppet@production] Upgrade staging-eqiad kubernetes master to a full node

https://gerrit.wikimedia.org/r/755977

Change 755978 had a related patch set uploaded (by JMeybohm; author: JMeybohm):

[operations/homer/public@master] Add kubestagemaster1001 to k8s_staging eBGP config

https://gerrit.wikimedia.org/r/755978

Change 755977 merged by JMeybohm:

[operations/puppet@production] Upgrade staging-eqiad kubernetes master to a full node

https://gerrit.wikimedia.org/r/755977

Change 755978 merged by jenkins-bot:

[operations/homer/public@master] Add kubestagemaster1001 to k8s_staging eBGP config

https://gerrit.wikimedia.org/r/755978

JMeybohm updated the task description. (Show Details)Jan 25 2022, 8:32 AM

Mentioned in SAL (#wikimedia-operations) [2022-01-25T08:32:43Z] <jayme> kubernetes staging migrated tainted worker node setup - T290967

Change 757407 had a related patch set uploaded (by JMeybohm; author: JMeybohm):

[operations/puppet@production] Enable IPv6DualStack for kubelet on staging masters

https://gerrit.wikimedia.org/r/757407

Change 757408 had a related patch set uploaded (by JMeybohm; author: JMeybohm):

[operations/puppet@production] Split profile::kubernetes::master_hosts by DC

https://gerrit.wikimedia.org/r/757408

Change 757433 had a related patch set uploaded (by JMeybohm; author: JMeybohm):

[operations/puppet@production] Enable overlayfs on kubernetes masters

https://gerrit.wikimedia.org/r/757433

Change 757434 had a related patch set uploaded (by JMeybohm; author: JMeybohm):

[operations/puppet@production] Upgrade codfw kubernetes masters to tainted full nodes

https://gerrit.wikimedia.org/r/757434

Change 757437 had a related patch set uploaded (by JMeybohm; author: JMeybohm):

[operations/homer/public@master] Add k8s masters in codfw eBGP config

https://gerrit.wikimedia.org/r/757437

Change 757438 had a related patch set uploaded (by JMeybohm; author: JMeybohm):

[operations/homer/public@master] Add k8s masters in eqiad eBGP config

https://gerrit.wikimedia.org/r/757438

Change 757441 had a related patch set uploaded (by JMeybohm; author: JMeybohm):

[labs/private@master] Add keys needed for k8s node profile to main master nodes

https://gerrit.wikimedia.org/r/757441

Change 757441 merged by JMeybohm:

[labs/private@master] Add keys needed for k8s node profile to main master nodes

https://gerrit.wikimedia.org/r/757441

JMeybohm mentioned this in rLPRI3eb3c5774cd8: Add keys needed for k8s node profile to main master nodes.Jan 26 2022, 1:44 PM

Change 757407 merged by JMeybohm:

[operations/puppet@production] Enable IPv6DualStack for kubelet on staging masters

https://gerrit.wikimedia.org/r/757407

Change 757408 merged by JMeybohm:

[operations/puppet@production] Split profile::kubernetes::master_hosts by DC

https://gerrit.wikimedia.org/r/757408

Change 757433 merged by JMeybohm:

[operations/puppet@production] Enable overlayfs on kubernetes masters

https://gerrit.wikimedia.org/r/757433

Change 757615 had a related patch set uploaded (by JMeybohm; author: JMeybohm):

[operations/puppet@production] Upgrade eqiad kubernetes masters to tainted full nodes

https://gerrit.wikimedia.org/r/757615

Change 757631 had a related patch set uploaded (by JMeybohm; author: JMeybohm):

[operations/puppet@production] kubernetes::master: Remove expose_puppet_certs parameter

https://gerrit.wikimedia.org/r/757631

Change 757434 merged by JMeybohm:

[operations/puppet@production] Upgrade codfw kubernetes masters to tainted full nodes

https://gerrit.wikimedia.org/r/757434

Host rebooted by jayme@cumin1001 with reason: cgroup_enable=memory after docker install

Change 757437 merged by jenkins-bot:

[operations/homer/public@master] Add k8s masters in codfw eBGP config

https://gerrit.wikimedia.org/r/757437

Host rebooted by jayme@cumin1001 with reason: cgroup_enable=memory after docker install

Change 757658 had a related patch set uploaded (by JMeybohm; author: JMeybohm):

[operations/puppet@production] Fix nrpe_check_disk_options hiera key for kubernetes masters

https://gerrit.wikimedia.org/r/757658

Change 757658 merged by JMeybohm:

[operations/puppet@production] Fix nrpe_check_disk_options hiera key for kubernetes masters

https://gerrit.wikimedia.org/r/757658

JMeybohm updated the task description. (Show Details)Jan 27 2022, 2:11 PM

Change 757615 merged by JMeybohm:

[operations/puppet@production] Upgrade eqiad kubernetes masters to tainted full nodes

https://gerrit.wikimedia.org/r/757615

Host rebooted by jayme@cumin1001 with reason: cgroup_enable=memory after docker install

Change 757438 merged by jenkins-bot:

[operations/homer/public@master] Add k8s masters in eqiad eBGP config

https://gerrit.wikimedia.org/r/757438

Host rebooted by jayme@cumin1001 with reason: cgroup_enable=memory after docker install

Change 757631 merged by JMeybohm:

[operations/puppet@production] kubernetes::master: Remove expose_puppet_certs parameter

https://gerrit.wikimedia.org/r/757631

JMeybohm updated the task description. (Show Details)Jan 28 2022, 8:48 AM

All control planes have been migrated, I've also updated the docs at https://wikitech.wikimedia.org/wiki/Kubernetes/Clusters/New to make this the new default.

Change 759741 had a related patch set uploaded (by JMeybohm; author: JMeybohm):

[operations/puppet@production] Add label node-role.kubernetes.io/master to masters

https://gerrit.wikimedia.org/r/759741

Change 759741 merged by JMeybohm:

[operations/puppet@production] Add label node-role.kubernetes.io/master to masters

https://gerrit.wikimedia.org/r/759741

elukey closed subtask T299634: Create a partman config for kubernetes masters as Resolved.Mar 17 2022, 6:09 PM

JMeybohm mentioned this in T305435: Migrate kubernetes masters to bullseye.Apr 5 2022, 8:55 AM

akosiaris mentioned this in T306649: Agree strategy for Kubernetes BGP peering to top-of-rack switches.May 3 2022, 3:52 PM

kube-apiserver need to reach webhooks running inside of the clusterClosed, ResolvedPublicActions

Description

1. Announce Kubernetes Service IPs via BGP (calico) so they are reachable from outside the cluster

2. Make Kubernetes Masters (tainted) worker nodes

3. Run kube-proxy on Kubernetes Masters

4. Work around this issue by disabling webhooks

Details

Related ObjectsSearch...

Event Timeline

kube-apiserver need to reach webhooks running inside of the cluster
Closed, ResolvedPublic
Actions

Related Objects
Search...