As far as I can see in kfserving/storage.py, it seems that the best way for us to make models available to Kfserving/Kubeflow is via Swift.
Some general questions to answer:
- Does kfserving support Swift? On paper yes since Swift supports the S3 API, but we should verify.
- Is SRE onboard with us storing models on Swift? (from the capacity perspective, bandwidth, etc..). If so, we should reach out to SRE to ask for more info :)
- Storing private models on Swift is surely good, but how should we do for public models? Should we store them in Swift, and somehow make them available via Commons or similar?
- How do we deploy models? Should we allow people to just push them to Swift?