We are looking for data that tracks how much multimedia content gets added to Wikipedia articles from different vectors (Visual Editor, direct Wikitext editing, or bots). We'd like to use the existing edit tags for each to differentiate, and create a dashboard that visualizes this data.
This will help inform decision making around what avenues to pursue for adding more media to articles in order to fulfill the requirements of the SDAW grant. Specifically, it will also help us track our progress towards the "media added to 5 million content pages" requirement in the grant.
The first grant report is due June 1 2021, so we'd like to have these measurements in advance of that.
Original Task:
From the parent task: What are the vectors for how multimedia content gets added to Wikipedia articles? Is it Visual Editor, direct Wikitext editing, or bots? We have dashboards for this information for edits in general but it's not granular enough for us to distinguish multimedia additions.
We'd also like to explore whether we can differentiate media that was uploaded as part of the edit versus media that is searched for via image search in VE. See @nettrom_WMF's comment in T266067#6634887:
There's an open question about whether we should differentiate between uploads and adding media. I think that depends on how uploads are logged by MediaWiki. If a user uploads two files through VE and adds them into an article, does that show up as two file uploads and one edit in the system? Are those uploads tagged in a way that makes them easy to separate from uploads outside of VE? Depending on how easy they are to identify, we might not need to do something specific about them.