Page MenuHomePhabricator

Add Istio (and related) config to allow LW isvcs to talk to ML Cassandra machines
Open, Needs TriagePublic5 Estimated Story Points

Description

This should mostly be a network policy but may need other configuration.

Event Timeline

Change 1012668 had a related patch set uploaded (by Klausman; author: Klausman):

[operations/deployment-charts@master] admin_ng: Add network policy to allow LW isvcs to access ML Cassandra

https://gerrit.wikimedia.org/r/1012668

klausman set the point value for this task to 1.Mar 19 2024, 2:35 PM
klausman moved this task from Unsorted to In Progress on the Machine-Learning-Team board.
klausman removed a project: Epic.

Change 1012696 had a related patch set uploaded (by Klausman; author: Klausman):

[operations/puppet@production] profile::thanos: Fix broken regex for istio latency bucket RR

https://gerrit.wikimedia.org/r/1012696

Change 1012696 merged by Filippo Giunchedi:

[operations/puppet@production] profile::thanos: Fix broken regex for istio latency bucket RR

https://gerrit.wikimedia.org/r/1012696

Change #1012668 merged by jenkins-bot:

[operations/deployment-charts@master] admin_ng: Add network policy to allow LW isvcs to access ML Cassandra

https://gerrit.wikimedia.org/r/1012668

Change #1015006 had a related patch set uploaded (by Klausman; author: Klausman):

[operations/puppet@production] deployment_server: Add external service block for Cassandra/ml-cache

https://gerrit.wikimedia.org/r/1015006

Change #1015008 had a related patch set uploaded (by Brouberol; author: Brouberol):

[operations/deployment-charts@master] external-services: enable in all ml k8s clusters

https://gerrit.wikimedia.org/r/1015008

Change #1015008 merged by Brouberol:

[operations/deployment-charts@master] external-services: enable in all ml k8s clusters

https://gerrit.wikimedia.org/r/1015008

Change #1015006 merged by Klausman:

[operations/puppet@production] deployment_server: Add external service block for Cassandra/ml-cache

https://gerrit.wikimedia.org/r/1015006

Change #1015021 had a related patch set uploaded (by Brouberol; author: Brouberol):

[operations/deployment-charts@master] external-services: create namespace in aux/ml clusters

https://gerrit.wikimedia.org/r/1015021

Change #1015021 merged by Brouberol:

[operations/deployment-charts@master] external-services: create namespace in aux/ml clusters

https://gerrit.wikimedia.org/r/1015021

Mentioned in SAL (#wikimedia-operations) [2024-03-27T12:33:50Z] <brouberol> redeploying external-services in all k8s clusters to account for the newly exposed ml-cassandra cluster - T360428

Change #1015029 had a related patch set uploaded (by Klausman; author: Klausman):

[operations/deployment-charts@master] charts/kserve-inference: Wire up generated network policy for LW services

https://gerrit.wikimedia.org/r/1015029

Change #1015074 had a related patch set uploaded (by Klausman; author: Klausman):

[operations/deployment-charts@master] modules: Add new version of external-services-networkpolicy

https://gerrit.wikimedia.org/r/1015074

Change #1015077 had a related patch set uploaded (by Klausman; author: Klausman):

[operations/deployment-charts@master] modules: add v1.0.1 of external-services-networkpolicy in prep for 1015074

https://gerrit.wikimedia.org/r/1015077

Change #1015074 had a related patch set uploaded (by Klausman; author: Klausman):

[operations/deployment-charts@master] modules: Change external-services-networkpolicy to allow specifying appname

https://gerrit.wikimedia.org/r/1015074

Change #1015077 merged by jenkins-bot:

[operations/deployment-charts@master] modules: add v1.0.1 of external-services-networkpolicy in prep for 1015074

https://gerrit.wikimedia.org/r/1015077

Change #1015074 merged by jenkins-bot:

[operations/deployment-charts@master] modules: Change external-services-networkpolicy to allow specifying appname

https://gerrit.wikimedia.org/r/1015074

Change #1015029 merged by jenkins-bot:

[operations/deployment-charts@master] charts/kserve-inference: Wire up generated network policy for LW services

https://gerrit.wikimedia.org/r/1015029

Change #1015292 had a related patch set uploaded (by Klausman; author: Klausman):

[operations/deployment-charts@master] ml-servics/experimental: Fix transposed app name for netpolicy

https://gerrit.wikimedia.org/r/1015292

Change #1015292 merged by jenkins-bot:

[operations/deployment-charts@master] ml-servics/experimental: Fix transposed app name and chart version

https://gerrit.wikimedia.org/r/1015292

Change #1015333 had a related patch set uploaded (by Brouberol; author: Brouberol):

[operations/deployment-charts@master] rbac: grant RBAC perms on calico networkpolicis to the kserve-deploy clusterrole

https://gerrit.wikimedia.org/r/1015333

Change #1015333 merged by Brouberol:

[operations/deployment-charts@master] rbac: grant RBAC perms on calico networkpolicis to the kserve-deploy clusterrole

https://gerrit.wikimedia.org/r/1015333

Change #1015340 had a related patch set uploaded (by Klausman; author: Klausman):

[operations/deployment-charts@master] charts/kserve-inference: move netpol generation outside the service loop

https://gerrit.wikimedia.org/r/1015340

Change #1015340 merged by jenkins-bot:

[operations/deployment-charts@master] charts/kserve-inference: move netpol generation outside the service loop

https://gerrit.wikimedia.org/r/1015340

Change #1020194 had a related patch set uploaded (by Klausman; author: Klausman):

[operations/puppet@production] deployment_server: Change Puppet query for ML Cassandra Clusters

https://gerrit.wikimedia.org/r/1020194

Change #1021895 had a related patch set uploaded (by Klausman; author: Klausman):

[operations/deployment-charts@master] ml-services: tweak reference to ML Cassandra clusters

https://gerrit.wikimedia.org/r/1021895

Change #1020194 merged by Klausman:

[operations/puppet@production] deployment_server: Add Cassandra to autogenerated external svcs

https://gerrit.wikimedia.org/r/1020194

klausman changed the point value for this task from 1 to 5.Tue, Apr 23, 2:44 PM
klausman set Final Story Points to 5.

Change #1021895 merged by jenkins-bot:

[operations/deployment-charts@master] ml-services: tweak reference to ML Cassandra clusters

https://gerrit.wikimedia.org/r/1021895