| | Status | Subtype | Assigned | Task |
---|
| | In Progress | | Raymond_Ndibe | T362867 [infra,k8s] Upgrade Toolforge Kubernetes to version 1.28 |
| | Resolved | | Raymond_Ndibe | T359641 [infra,k8s] Upgrade Toolforge Kubernetes to version 1.27 |
| | | | | Restricted Task |
| | Resolved | | Slst2020 | T327025 [infra,k8s] Upgrade Toolforge Kubernetes to version 1.26 |
| | Resolved | | aborrero | T316107 [infra,k8s] Upgrade Toolforge Kubernetes to version 1.25 |
| | Resolved | | aborrero | T279110 [infra] Replace PodSecurityPolicy in Toolforge Kubernetes |
| | Resolved | | aborrero | T364297 [k8s,infra] track PSP migration plan |
| | Resolved | | aborrero | T364312 [maintain-kubeusers,infra,k8s]: introduce some logic to backfill maintain-kubeuser resources (like per-tool kyverno policies) |
| | Resolved | | aborrero | T366564 toolforge: new maintain-kubeusers takes long time to loop over all the accounts to reconcile them |
| | Resolved | | aborrero | T366598 maintain-kubeusers: metrics, monitoring and alerting |
| | Resolved | | aborrero | T367332 toolforge maintain-kubeusers backtrace |
| | Resolved | | Andrew | T367348 Incident: 2024-06-12 toolforge k8s control plane |
| | Duplicate | | dcaro | T367349 Fix HA proxy load-balancer health check monitor to not poll nodes where the API is not responding |
| | Resolved | | aborrero | T367350 [k8s,infra] Verify that kyverno policies are evaluated only for namespaced resources |
| | Resolved | | aborrero | T367386 [k8s,infra] kyverno has a track record of overloading the cluster, maybe on new ways |
| | Resolved | | aborrero | T367388 [k8s,infra] consider scaling the k8s control plane |
| | Resolved | | aborrero | T367389 [k8s,infra,alerting] improve HAproxy and k8s apiserver interaction |
| | Declined | | aborrero | T367950 Decision Request - Toolforge pod security via custom admission webhook |
| | Declined | | aborrero | T367952 toolforge: drop kyverno |
| | Declined | | aborrero | T367985 toolforge: create a new custom admission webhook to handle pod security settings |
| | Resolved | | aborrero | T368044 Toolforge: redeploy kyverno after the outage |
| | Resolved | | aborrero | T368141 toolforge: kyverno: change policies to Enforce |
| | Resolved | | aborrero | T368142 Toolforge: drop PodSecurityPolicy |