Fri, Jan 17
Thu, Jan 16
Wed, Jan 15
Mon, Jan 13
Weekly update: paper was accepted with good reviews !! there is some work to do to respond to reviewers' comments, which we will do in the next month.
Weekly update: sent invitation emails to a few speakers and upcoming submission deadline reminder.
- downloaded the list of coco-stuff classes which include highly generic categories of people, animals, and things which exist in the visual world: https://github.com/nightrome/cocostuff
- downloaded the list of categories in Commons, with the counts of the number of images per categories.
- to create the initial seed of categories we want to consider for object categorization in Commons, I computed fasttext vectors on both COCO categories and Commons Categories, and I am checking what are the commons categories that we can use to represent COCO categories.
Weekly update: progress as expected. I had to double check the output of the citation needed models as there were some inconsistencies, which were solved by looking at some of the data. There is need for a text pre-processing pipeline which is close to the one we used for training, otherwise the model might give some unexpected output.
Weekly Update: none. Team work is on hold until ICWSM deadline.
Weekly update: project proposal is ready. Investigating possibilities with Rutgers' university. We might decide to merge collaboration with Rutgers' within the Visual Knowledge Gaps project (T242603). In that case, I will reach out to further contacts starting last week of Jan .
Wed, Jan 8
Thu, Jan 2
Visualization available at http://wikiview.net
Done, presentation "Machine Vision in Social Computing" available at: https://docs.google.com/presentation/d/1O1Ni40tovcFIh_Btj3qpWFh5Pz8j5Bh_U0u5rqfTbW0/edit?usp=sharing
The output of the first stage of this research, as well as the opportunities arisen for collaborations and internships, led to the decision of having an intermediate step for this task.
In collaboration with @AikoChou, an Outreachy intern, we will have a system that can release periodic data dumps exposing sentences in Wikipedia needing citations. This task is tracked at: T233707.
I'll leave this task open until we hear back from the Web Conference
When started this sub-task as part of our efforts on citation usage studies, we reealised that this is a huge effort that will be spun off to a completely new project
Dec 11 2019
@leila @Isaac does this look good? https://commons.wikimedia.org/wiki/File:Wiki_Workshop_2020_banner.png
Dec 2 2019
Update after changing learning rate and modifying few parameters in the training:
- we reach 91% accuracy on the validation set (2% improvement compared to previous model) and a significant loss reduction.
- Manual validation on an external dataset shows improvements on overfitting (80% of new photos and 100% of new graphics are correctly recognized)
- Problems persist for near-abstract photos and for extremely low-res photos
- All this with an training time of 18-minutes only for a dataset of 150k images!
Nov 29 2019
Nov 28 2019
Hi! Can I have access too please? My username is mirrys
Nov 7 2019
Finally I managed to do some progress. I built a simple model for testing purposes.
Oct 22 2019
Proposal submitted on September 30th: https://github.com/OlehOnyshchak/WikiImageRecommendation/blob/master/Papers/Project_Proposal/paper.pdf
Oct 21 2019
Oct 15 2019
Oct 7 2019
Oct 3 2019
@Achillesheel02 thanks for the feedback and yes, please send me to my email.
Hi @Lucideuclid - many thanks for your interest in this project :)
Sep 27 2019
Sep 24 2019
Sep 18 2019
Aug 21 2019
Examples of issues of Google Vision API, related to the fact that their models are probably trained on unevenly distributed data: https://docs.google.com/presentation/d/1qfD3q9Ij79_luAKXNdVvZoPFsUvFQKcvIGmIvWC19z0/edit?usp=sharing
Aug 14 2019
Aug 13 2019
Aug 7 2019
So Rachel said we are welcome to join the open mic session at the beginning of the hackathon. Each speaker has 1-1.5 minutes to pitch projects " that newcomers can participate in and/or needs significant collaboration from other attendees".
She suggested that, once we have the mini-projects ready, we share them on phab : https://phabricator.wikimedia.org/project/view/3922 and also on the Wikimania Hackathon telegram channel.
Aug 6 2019
@leila mini-projects for hackathon can be added to the phab workboard for the Hackathon: https://phabricator.wikimedia.org/project/view/3922/. Checking now with Rachel about the pitches.