Machine vision-generated labels for Commons images will be requested from one or more MV providers. We need to answer the following lifecycle questions about this data:
- How long should we retain candidates? Forever, or can they be dropped after a certain time?
- This will depend in part on the requirements for promotion to SDC and model feedback.
- How does this affect how candidates should be stored?
- If/when should previously fetched data be refreshed?