We have an ML development sandbox running the WMF KServe stack running on cloudvps.
Info: https://wikitech.wikimedia.org/wiki/User:Accraze/MachineLearning/ML-Sandbox
Currently we can run an inference service with a predictor and things work well. When we add a transformer to the isvc spec, we get a 503 service not found.
Digging a bit deeper, it seems that we can reach both the predictor and transformer endpoints directly
(i.e. - enwiki-articlequality-predictor-default.kserve-test.example.com)
However, when we use the standard service hostname that follows (ex. enwiki-articlequality.kserve-test.example.com), then we get a 503.
This is most likely due to how we have our cluster-local-gateway configured in istio. The top-level kserve isvc should be able to route the incoming request to the transformer, which should then communicate with the predictor.