1. Develop an extension that can be invoked through the job queue when image/video/audio files are uploaded or can be called through a maintenance script to process already uploaded image/video/audio files.
2. The tool would use PhotoDNA to compare the files with hashes in the industry-wide data bases for child protection and terrorism content. The tool would pull directly from the hash corpus using the API, if feasible.
3. If a match between the corpus and the file is detected, the tool will:
* Send an email with the link to the matching media file to the Trust and Safety team to review.
* If the content is flagged as a child protection issue, the tool will automatically delete (takedown) the matched content in a parallel action to the email.
* Terrorism content will not trigger an automatic takedown.
* There should be a flag that can be set to determine if the tool is automatically taking down content or flagging so testing can be done of both methods
4. It is preferable that this functionality be built into an extension that is invoked asynchronously through the job queue after a file is uploaded. It should not prevent the file from being uploaded and processed. Processing would happen in the background.
[x] Create git project on gerrit
[x] Create a wireframe.
[] PhotoDNA integration - In Progress
[] Use JobSpecification to run requests asynchronously
[] Define the stragegy for deleting files.
[] Send emails to "Trust and Safety team"
[] Create a page on MediaWiki:Extension portal (https://www.mediawiki.org/wiki/Category:All_extensions, https://www.mediawiki.org/wiki/Extension:Name)
[] TBD