User Details
- User Since
- Aug 21 2017, 4:16 PM (327 w, 1 d)
- Availability
- Available
- IRC Nick
- cormacparle
- LDAP User
- Cparle
- MediaWiki User
- CParle (WMF) [ Global Accounts ]
Yesterday
Mon, Nov 27
I suspect this might be because we generate our suggestions once a week based partially on data that only gets updated once a week
Thu, Nov 23
Wed, Nov 22
Tue, Nov 21
Fri, Nov 10
@Sneha myself and @matthiasmullie have been talking about the checkboxes. If I click the "The is AI generated media" checkbox
Thu, Nov 9
Not really sure on either of these, let us talk to people doing similar image-analysis work and get back to you ...
This actually seems like it might be something to do with SQLAlchemy and mysql recycling connections differently https://stackoverflow.com/questions/29755228/sqlalchemy-mysql-lost-connection-to-mysql-server-during-query
Wed, Nov 8
Looks like it's this commit in mediawiki-core that causes the problem d0a96db0f94fafa65ff693288d4ad6bf034378a3 @Krinkle any ideas what might be going awry?
Note that the mediainfoview does not exist error is still present in the console on commons after rolling back the train, even though the bug described by this ticket is gone
- Initial access for a one-time study is really all we need for now, and if the data was ready for us to begin work on in early January that'd be perfect. If we succeed in creating a model of likelihood-of-deletion that has some predictive accuracy we can worry about keeping the model up-to-date with more recent data later on
Tue, Nov 7
Oct 28 2023
Oct 27 2023
Oct 26 2023
The spacing below the separator line should be same as above the line.
@Etonkovidova @Sneha there is now (since our changes) no default for regular UW, but campaigns can still set default licenses
See also T340908
Oct 25 2023
Thanks @Jdlrobson ... does that mean we can close this?
Oct 24 2023
Oct 23 2023
Oct 20 2023
Ok so I've been through the analysis, and tbh it's not always clear when something is a copyvio. Even when you have the text "copyright" in a DR it doesn't necessarily mean that the file is being deleted because of copyright issues - sometimes all of a user's uploads will get deleted and copyright will be mentioned in the DR along with COM:SCOPE, so we don't know that individual images are copyvios.
Oct 19 2023
@hashar or @jijiki any idea who might be able help solve this? The flickr foundation are working on a tool to copy freely licensed image from Flickr to commons (called flickypedia) and wanted to test against beta
I'm using the following to match copyright violations
opening_reason LIKE '%copyright violation%' OR opening_reason LIKE '%copyvio%' OR opening_reason like '%logo%' OR opening_reason like '%no license%' OR opening_reason like '%no permission%' OR opening_reason like '%Commons:Copyright_rules_by_subject_matter%' OR opening_reason like '%Commons:Licensing%' OR opening_reason like '%COM:BOOK%' OR opening_reason like '%COM:CSD#F1%' OR opening_reason like '%COM:CSD#F2%' OR opening_reason like '%COM:CSD#F3%' OR opening_reason like '%COM:CSD#F4%' OR opening_reason like '%COM:CSD#F5%' OR opening_reason like '%COM:CSD#F6%' OR opening_reason like '%COM:DW%' OR opening_reason like '%COM:EI%' OR opening_reason like '%COM:FAIRUSE%' OR opening_reason like '%COM:L%' OR opening_reason like '%COM:PERMISSION%' OR opening_reason like '%COM:NETCOPYVIO%' OR opening_reason like '%COM:PCP%' OR opening_reason like '%COM:POSTER%' OR opening_reason like '%COM:TOO%' OR opening_reason like '%COM:TOYS%' OR closing_reason LIKE '%copyright violation%' OR closing_reason LIKE '%copyvio%' OR closing_reason like '%logo%' OR closing_reason like '%no license%' OR closing_reason like '%no permission%' OR closing_reason like '%Commons:Copyright_rules_by_subject_matter%' OR opening_reason like '%Commons:Licensing%' OR closing_reason like '%COM:BOOK%' OR closing_reason like '%COM:CSD#F1%' OR closing_reason like '%COM:CSD#F2%' OR closing_reason like '%COM:CSD#F3%' OR closing_reason like '%COM:CSD#F4%' OR closing_reason like '%COM:CSD#F5%' OR closing_reason like '%COM:CSD#F6%' OR closing_reason like '%COM:DW%' OR closing_reason like '%COM:EI%' OR closing_reason like '%COM:FAIRUSE%' OR closing_reason like '%COM:L%' OR closing_reason like '%COM:PERMISSION%' OR closing_reason like '%COM:NETCOPYVIO%' OR closing_reason like '%COM:PCP%' OR closing_reason like '%COM:POSTER%' OR closing_reason like '%COM:TOO%' OR closing_reason like '%COM:TOYS%'
@mfossati does this seem reasonable?
Oct 17 2023
Currently the explanations get shown only for the default licence, see the screenshot
Oct 16 2023
@Sneha some questions on the text displayed to the user here
Oct 13 2023
Oct 11 2023
Oct 6 2023
Oct 5 2023
I did some prelimary work on this while finishing up T344060. Results below. Note this is only for filenames, it's a bit more work to extract descriptions
Oct 3 2023
Conclusion
- between Jan 1 2021 and Sept 1 2023 there were 1394606 unique filenames deleted
- ~22k of these had GPS metadata
- these can be clustered into ~500 clusters with epsilon = 1km
- only 335 of the clustered deleted files had "freedom of panorama" or "FoP" in their deletion request text
Oct 2 2023
Some extra incidental data about deletions:
Sep 27 2023
FWIW the first seems more natural to me too.
Sep 18 2023
We're disabling MachineVision on commons on Wed, and uninstalling it not long after, so I'm not sure there's a good reason to spend time doing this work
FYI this will be resolved by https://phabricator.wikimedia.org/T340540 when it's done
Preliminary findings from the spike (building on T340546)
Planning to remove it completely from commons @Amire80 - afaik that's the only place it's enabled. We're going to start off with a notice that it's disabled, then a while later actually undeploy it
Sep 14 2023
Because of @Pigsonthewing’s reservations about the WMF asking the community to evaluate depicts annotations, I took a random sample of depicts annotations added in 2023 via the Special:SuggestedTags, and evaluated them myself. Here are the results: