Page MenuHomePhabricator

Story idea for Blog: Wikipedia Image Captioning Competition
Closed, ResolvedPublic

Description

Please provide the following information.

  • Provide a short summary of your proposed post for the Wikimedia Technical Blog. Blog readers will see this as the preview to your post:

We would like to publish a post to advertise the launch of a Kaggle competition. In the competition, participants will be given a large dataset of open Wikipedia image and textual data, and they will be asked to build systems that, given an image, can retrieve the piece of text which is closer to that image. As part of the launch of this competition, we are also releasing (at least) 6M image files and corresponding visual features. Therefore, part of the post will be about this data release and how it empowers scientific and developer communities.

Tech community events and outreach programs

  • Which audience or audiences do you think your post is appropriate for?:

Volunteer developers, researchers, data scientists

  • Will you need assistance with writing your blog post, or do you already have a draft? If you have a draft, please provide a link here:

I don't have a draft yet, but I can make one by next week.

  • Does your post need to be published by a certain date?

The launch of the competition will be between the 5th and the 10th of September, so ideally we would time the blog post release around those dates.

https://upload.wikimedia.org/wikipedia/commons/f/f1/Wikipedia20_Knowledge.svg
(this will be the image in the competition header)

  • Do you have any other questions or comments?

Once your request is received, a technical blog admin will review it and reach out to you through Phabricator.

Event Timeline

@Miriam this sounds like a great post! Let me know when the draft is ready. After, I can take a pass at editing it and post it to the blog.

@Miriam Touching base to see how you are doing with this. Let me know!

Hi @srodlund ! Yes, just finished today - you can find here the first draft for the blog post: https://docs.google.com/document/d/18TSGax5Xwo3mgDeCs5XliMFZDM6rezfLRvB2yykf6iU/edit
Feel free add comments and suggestions! Thanks a lot!

@Miriam Great! I will take an editing pass at this tomorrow and will provide suggestions!

@Miriam this is a really cool and exciting post! I have done a first pass and have provided some suggested edits and comments. Can you review these and accept or decline them? After, I might suggest a couple of organizational changes (possibly adding a couple of more headers), but I want to look at it with the changes in place first.

@srodlund thank you so much for your pass and for the detailed comments! You are the best :) I accepted most of your suggestions and responded to the comments.

@Miriam Perfect! I will move this over to the blog for formatting.

It should be ready to publish on Monday 30 of August. Is it okay if I publish it then or do you want to wait a couple of days to have it closer to the contest open? I'll be leaving 3 Sept for the US Labor Day holiday weekend, so we'll want to publish at least before that.

Hi Sarah, the competition is launching on September 9th. Would it be possible to wait until then for publication? Thanks again!

Yes! I'll schedule it for then!

@Miriam This has been posted! Will you take a look and let me know if you think anything needs to be changed?

https://techblog.wikimedia.org/2021/09/09/the-wikipedia-image-caption-matching-challenge-and-a-huge-release-of-image-data-for-research/

Once you have had a chance to review it, I'll share the post on Twitter.

Just a note ~ I updated and republished this post. Please take a look and let me know if there should be additional changes.