General image classifier for commons
Open, LowPublic
Actions

Assigned To

Authored By

	Halfak
	Jan 17 2017, 8:53 PM

Description

What it does: Image classification for commons uploads.

Wiki thing it helps with:

Crude image categorization (human/selfie, dog, street, house, car) is easier than specific, biggest thing is that you need some train set (thus are not able to predict unseen categories) T331134
Find uncategorized images
Find likely unwanted images (copyvio, etc.)
Estimate image quality T184739
Estimate image creation date (1920s, 2000s) which could be used to verify PD-old claims
AI which combines image elements with articles and suggests relevant images (combine text based) T236142
Automatically generate image captions and alt text

Things that might helps us get this AI built (optional):

https://commons.wikimedia.org/wiki/User:Basvb/Deeplearning (sort on probability)
Tensorflow: https://www.tensorflow.org/how_tos/image_retraining/
https://commons.wikimedia.org/wiki/User:Multichill/Using_OpenCV_to_categorize_files
https://en.wikipedia.org/wiki/User:DrTrigonBot/doc?rdfrom=commons:User:DrTrigonBot/doc#Categorization
https://research.googleblog.com/2016/09/show-and-tell-image-captioning-open.html
Could build a Wikidata-game like interface to provide training data or to assess predictions for inclusion in commons.
Structured data on commons!

Related efforts:

How to move forward?

Spec out what needs to be done. (We: Commons, WLM, WMF Research and any other community/entity that should be involved).
WMF Research creates a collaboration to work with one student on the research component. I estimate what we're talking about is a 8-12 week work depending on how much details we want to go into.
We work with a few developers to implement change. This latter can be a bit more work than usual given that the code-base of Commons will change due to Structured Data on Commons project.

Related Objects
Search...

Status	Assigned	Task
Open	Miriam	T155538 General image classifier for commons
Open	None	T192444 Identify faces in images on commons
Invalid	Miriam	T228441 Design a pipeline for image classification
Resolved	Miriam	T242229 Test the feasibility of a classifier trained on Commons categories
Resolved	Miriam	T242969 A list of meaningful Commons Categories whose images can be used to train image classifiers
Resolved	Miriam	T242970 A set of prototypes of image classifiers trained on images from Commons Categories
Resolved	Miriam	T242971 A report on accuracy and performance of the classification models
Declined	Miriam	T248692 Train image classifiers based on Commons Categories from scratch.
Resolved	Miriam	T250150 Improve prototypes of image classifiers trained on images from Commons Categories

Event Timeline

Halfak created this task.Jan 17 2017, 8:53 PM

Restricted Application added a subscriber: Aklapper. · View Herald TranscriptJan 17 2017, 8:53 PM

Halfak updated the task description. (Show Details)Jan 17 2017, 8:54 PM

Halfak updated the task description. (Show Details)Jan 18 2017, 12:18 AM

Halfak updated the task description. (Show Details)Jan 18 2017, 12:20 AM

Halfak added subscribers: Lydia_Pintscher, Dbrant.

EBernhardson subscribed.Jan 20 2017, 8:02 PM

@ellery & @leila, this is something that came up quite often at the Dev. Summit. Is it something you'd be interested in looking into?

Halfak added a project: artificial-intelligence.Jan 20 2017, 8:38 PM

Halfak added a subscriber: merbst.Jan 20 2017, 9:12 PM

Ricordisamoa awarded a token.Jan 22 2017, 12:50 AM

Ricordisamoa subscribed.

MichaelMaggs awarded a token.Feb 2 2017, 8:32 AM

MichaelMaggs subscribed.

Halfak triaged this task as Low priority.Feb 2 2017, 3:32 PM

@Halfak I'm going to assign this task to myself just so I'm able to find it easily and as something I'm interested to work on. Realistically, I can get to this in April or May 2017 once some of the current projects are some more off the ground. Also, I'm cc-ing a few people in WLM international team as I know this is something that will be highly relevant to the work of WLM and the international team may put development resources towards it.

leila claimed this task.Feb 2 2017, 4:56 PM

leila added a project: Wiki-Loves-Monuments (2017).

leila added a project: Research-Freezer.

leila added subscribers: LilyOfTheWest, Slaporte, Yarl and 3 others.

JeanFred updated the task description. (Show Details)Feb 2 2017, 4:59 PM

leila updated the task description. (Show Details)Feb 2 2017, 5:50 PM

I think some first steps are to define what exactly we'd like to classify and then to tag (or better derive tags) for those things we want to classify. When I made my Proof of concept the main thing I ran into was downloading the images (preferably loads of images on small size (300x300px)) which took relatively long making it unfeasible to classify all images in the uncategorized images (and a future step could even be to reclassify loads of already classified images into new categories if relevant).

Something I thought of as a possibility was to try and train models for different classification tasks and then offer these in the pywikibot framework for others to reuse.

I'd love to continue on the models (have to find some time of course) but struggled a bit with integrating this into existing structures.

It could be possible to use this in trying to generate tags for something like T125273 after which these tags can be used for other purposes (categorisation, search).

Would it be a good idea to have a discussion somewhere in the next weeks to see what people have in mind for an image classifier?

Some questions that I think needs answering:

What do we want to classify?: scope of classes and images to classify.
In which way do we want to integrate this? e.g.: a separate tool which runs on images to be categorized, or something more native adding tags to all images.
What are some steps we can take to facilitate the start of this? e.g.: I think it would be good to annotate some images/collect useful categories we could use as classes.

@Basvb was just about to say this. Can you ping me off-thread and we set up a time to chat?

Basvb mentioned this in T49492: Automatically propose/suggest a category for images.Apr 20 2017, 8:11 PM

Hi! I was recently at this Facebook dev summit and I've come across this (possibly) useful library that (I believe) Facebook uses to do image recognition. https://github.com/facebookresearch/multipathnet.
https://caffe2.ai is another one that was widely mentioned.

They're not very openly talking about this, but it's being employed a lot at Facebook. See https://code.facebook.com/posts/457605107772545/under-the-hood-building-accessibility-tools-for-the-visually-impaired-on-facebook/

The recently (4-4-2017 (re)-shared message (original 11-08-2016 on commons-l) on wikitech-l about some of Google's work with Commons images might also be relevant. https://www.youtube.com/watch?v=HgWHeT_OwHc&feature=youtu.be&t=2h1m19s and https://cloud.google.com/blog/big-data/2016/05/explore-the-galaxy-of-images-with-cloud-vision-api

LilyOfTheWest added a project: Wikimedia-Hackathon-2017.May 14 2017, 8:17 PM

Strainu subscribed.Aug 1 2017, 6:49 PM

Restricted Application added a subscriber: PokestarFan. · View Herald TranscriptAug 1 2017, 6:49 PM

I'm very interested in seeing some of the work you're doing. I've personally invested some time last year to detect monument signs using opencv's templateMatch algorithms, but I haven't followed up recently on this.

Puik subscribed.Aug 1 2017, 7:11 PM

In T155538#3490277, @Strainu wrote:

I'm very interested in seeing some of the work you're doing. I've personally invested some time last year to detect monument signs using opencv's templateMatch algorithms, but I haven't followed up recently on this.

Hi Strainu, whose work are you referring to? If there's any recent developments on this I'm very interested to hear about it. What I did hasn't continued since last year and is not much more than downloading 2 sets of images and following the https://www.tensorflow.org/tutorials/image_retraining tutorial. If we can think of some small task (not requiring to go over all images/thousands of categories) we can maybe try to get something to work. Think along the lines of predicting portrait/selfies or COM:PENIS pictures for easier clean up. Another approach can be the complete/large approach (detecting the objects depicted in every image on commons and labeling those), but then some outside help would be a good idea.

In T155538#3490759, @Basvb wrote:

Hi Strainu, whose work are you referring to?

The good thing about images is that they don't differ much across countries, so any work here would be great to see. I would be especially interested in seeing something done over the Panoramio images, many of those are still uncategorised and unprocessed.

An interesting test project could be the detection of images which need to be rotated. For these images getting (endless) training data is trivial: We can assume 99,9% of the images have the correct orrientation. We can select all images as the positive class and rotate an equal amount of images and use these as the negative class.

The amount of rotated images per day (currently done by SteinsplitterBot) falls within 10-50 it seems. Some of these images have had an incorrect orrientation for years or months, others are directly marked by their uploader.

Relevant work: http://www.cs.toronto.edu/~guerzhoy/oriviz/crv17.pdf

• SandraF_WMF subscribed.Nov 7 2017, 5:00 PM

• Capt_Swing subscribed.Nov 7 2017, 6:22 PM

Capankajsmilyo added a subtask: T192444: Identify faces in images on commons.Apr 18 2018, 2:36 PM

Capankajsmilyo subscribed.

Capankajsmilyo awarded a token.Apr 18 2018, 2:46 PM

Miriam subscribed.Apr 18 2018, 6:23 PM

Any progress so far? It's been more than a year since it was proposed.

xSavitar awarded a token.Jun 13 2018, 1:08 PM

xSavitar subscribed.

Ainali subscribed.Jun 18 2018, 6:02 AM

Effeietsanders awarded a token.Jun 26 2018, 6:12 AM

Ujjwalagrawal17 awarded a token.Jul 15 2018, 8:19 AM

This task looks super interesting. Can you share the link to the project codebase and documentation?

Aklapper mentioned this in T120437: Category suggestions based on filename, description and location.Jul 16 2018, 1:11 PM

Ladsgroup moved this task from Ideas to Backlog/Lift Wing on the Machine-Learning-Team board.Aug 3 2018, 11:40 AM

Liuxinyu970226 awarded a token.Nov 11 2018, 12:15 PM

Liuxinyu970226 subscribed.

Capankajsmilyo updated the task description. (Show Details)May 11 2019, 9:53 AM

Miriam updated the task description. (Show Details)Jan 8 2020, 4:16 PM

Miriam updated the task description. (Show Details)Jan 8 2020, 4:23 PM

@Miriam can you look into who can be the best person to own the task? (I'm thinking the person is you but I'm cautious in assigning it to you.;)

@leila done!

Lokal_Profil unsubscribed.Mar 20 2020, 12:22 PM

Aklapper edited projects, added Wiki-Loves-Monuments; removed Wiki-Loves-Monuments (2017).Jul 15 2020, 8:35 PM

• ACraze moved this task from Backlog/Lift Wing to Backlog/Other on the Machine-Learning-Team board.Jan 20 2021, 12:57 AM

Miriam closed subtask T228441: Design a pipeline for image classification as Invalid.Jun 18 2021, 11:09 AM

This is probably somewhat of topic: Instead of trying to fine tune manually what goes into "depicts" (see Special:SuggestedTags), wouldn't it be more interesting to make this queryable by adding them into a new property on a large scale?

Miriam removed Miriam as the assignee of this task.Aug 22 2022, 2:56 PM

Miriam claimed this task.

HakanIST subscribed.Oct 23 2022, 10:22 PM

Miriam updated the task description. (Show Details)Mar 3 2023, 3:36 PM

General image classifier for commonsOpen, LowPublicActions

Description

Related ObjectsSearch...

Event Timeline

General image classifier for commons
Open, LowPublic
Actions

Related Objects
Search...