The first step for Structured Data on Commons is adding a multilingual plaintext caption field (we're using the 'label' field in the MediaInfo extension)
File pages need to be findable using the data in this field
This is the master ticket for the work
This is the User Story (saga) for this ticket: