We have settled on the following command to scan the backlog of files on commonswiki:
mwscript extensions/MediaModeration/maintenance/scanFilesInScanTable.php --wiki=commonswiki --use-jobqueue --sleep 30 --verbose
This script is currently being run "manually" via a tmux session on maint2002. However, this means that the script:
- Will in a weeks time be running on a no longer released version of mediawiki
- Is not resistant to interruptions, such as restarts of the maint2002 hosts or changes of the active DC.
Instead this script should be run through a puppet configuration such that the job is:
- Restarted semi-frequently to ensure updates to the script are applied
- Resistant to being randomly interrupted, as it would automatically restart