The current status of the end-to-end model retraining is:
- Data-generation pipeline ✅
- Tone-check retraining code ✅
- CI/CD pipelines ✅
- Model retraining DAG ✅
- Export retrained model inside PVC ✅
In order to proceed with the next steps we need to export the retrained version of the model to a space (e.g. S3 bucket) outside the PVC.
After we export the new version of the retrained model we can move forward to evaluate and test it in order to proceed to the next operations for model productionisation.
Decisions need to be made:
- Where it would be better to build the functionality of exporting the model:
- Inside the retraining docker image: ml-pipelines
- Directly on DAG: Airflow-DAGs
- Permissions and infra related actions:
- Does the pod running the retraining image have R/W access to S3?
- Is it possible to export it directly from the PVC to the bucket?
- Actions on S3:
- Create a generic bucket for exporting there all models
