Huggingface server has been bumped to pytorch 2.3.0. This allows us also to use one of the latest ROCm versions.
We will need to use a new pytorch base image from production images.
The procedure to follow is the one mentioned in the README of inference services.
- Sync the wikimedia fork https://github.com/wikimedia/kserve
- Install new dependencies and update the requirement.txt file.