Update the batch upload document based on the experience from the first batch uploads.
For the generic phases and tasks we could borrow from the open standard CRISP-DM and then sort and fill-in our specialized tasks, see overview on Wikipedia and a practical guide.
Using the images from GAR as an example, identify tasks, subtasks and possible checkboxes to set up a scalable workflow for batch uploads in the Connected Open Heritage project. Preferably a set of such tasks, subtasks etc could be copy-pasted as a Connected-Open-Heritage-Batch-Uploads project contaning e.g.
- Data cleaning and transformation
- Check already uploaded images on commons
- Check and clean metadata file
- Store raw image files safely
- Review and link in mapping files from editors/domain experts
...
- Setup Institution templates on Commons
- Cooperation templae COH
- Instituion
- Media uploaded from Institution
...
- Setup Institution on Wikidata
- Institution item
- Properties etc
...
- Manual test upload one image
- Review by...
..
- Scripts and documentation
- Clone earlier project folders
...