Page MenuHomePhabricator

Gamified Knowledge Base Completion Plugin for Wikibase/Wikidata
Open, Needs TriagePublic

Description

Brief summary

Open data collections -- like Wikidata -- are created and maintained by volunteers and are thriving on community knowledge. Missing or new information needs to be added by community members, otherwise the knowledge base will dry out and be obsolete after a while. Hence, completing a knowledge base is a crucial task within any community-driven initiative. Consequently, helping people to integrate their knowledge into a knowledge base will benefit the growth, correctness, and topicality of Wikidata.

In our earlier research [1] we already showed that it is possible to find outliers in graph-based knowledge bases which need to be checked by experts to ensure the data quality. Our recent implementation of a tool called Wikidatacomplete (c.f., [2]) shows how facts are extracted from text and offered to users for validation. After this process step the validated fact is pushed to Wikidata for later integration. Hence, identifying demands for the Wikidata completion is possible and already has proven its value.

We propose here to implement a Wikibase plugin that is dedicated to facilitate the Wikidata completing process. While navigating through Wikidata, the plugin will show to the user facts extracted from textual sources as well as other knowledge bases (e.g., Wikipedia) which need to be validated. Hence, the Wikibase plugin is showing users suggestions of facts that should be added or changed within the Wikidata knowledge base. To compute the suggestion previously developed services will be used.

Additionally, a badge Web service interface needs to be integrated allowing users to integrate their badge into their profiles of social networks (e.g., on Wikidata’s user page, GitHub profile, Linkedin profile) to show their dedication and motivate other users to contribute, too. A rule-based system for earning badges needs to be implemented.

Skills required

  • JavaScript
  • basic knowledge of PHP might be useful

Mentors

@DD063520 @AnBo-de @Gabinguo @Aleksandr.perevalov

Microtasks

  • Get familiar with data structures available in Wikidata
  • Select 3 Wikidata entities and manually find missing facts based on external data sources
  • Understand the Wikidatacomplete UI and APIs
  • Activate the Recoin Wikibase plugin in your Wikibase account to see a similar suggestion mechanism
  • Set up the MediaWiki development environment
  • Understand how a Wikimedia plugin is working will provide suggestions: install the Recoin Wikibase plugin in your development environment and analyze the source code to learn how a plugin works

[1] Didier Cherix, Ricardo Usbeck, Andreas Both, and Jens Lehmann (2014). Lessons learned—the case of crocus: Cluster-based ontology data cleansing. In European Semantic Web Conference (pp. 14-24). Springer, Cham.
[2] Bernhard Kratzwald, Guo Kunpeng, Stefan Feuerriegel, and Dennis Diefenbach. IntKB: A Verifiable Interactive Framework for Knowledge Base Completion. International Conference on Computational Linguistics (COLING), 2020

Remark: A long version of the project description is available here.

Event Timeline

This would be very useful imho. We have a similar tool with the Primary Sources Tool (https://www.wikidata.org/wiki/Wikidata:Primary_sources_tool) It is unfortunately currently not in a very good state. Maybe it's worth looking at reviving that.

Gopavasanth added a subscriber: Gopavasanth.

@AnBo-de Thanks for bringing this project here!

I'm sure you are aware of the GSoC timeline: https://developers.google.com/open-source/gsoc/timeline and the mentor guidelines: https://www.mediawiki.org/wiki/Google_Summer_of_Code/Mentors

Also, If your project is ready to get featured, could you list it here by following the format in https://www.mediawiki.org/wiki/Google_Summer_of_Code/2020/Ideas_for_projects at the earliest? Thanks :)

@Gopavasanth

Hi!

yes we are aware of the timeline and the mentor guidelines ...

I added the project https://www.mediawiki.org/wiki/Google_Summer_of_Code/2021#Ideas_for_projects

Can you check if evrything is ok?

Salut
D063520

I am facing some issue regarding set up what to do

@Bavisettinarayan hi, I'm not totally sure what installation guide you are following. But I would recommand you to take this docker container:
https://github.com/wmde/wikibase-docker
I set it up multiple times myself. The instructions are here:
https://github.com/wmde/wikibase-docker/blob/master/README-compose.md
Once you are done you should have a fully mediawiki + wikibase environment.
Next step should be to install this extension:
https://www.mediawiki.org/wiki/Extension:Gadgets
Hope it helps!
Salut
D063520

I have also reviewed the project, idea is super cool and it fits the scope of the GSoC program.

Thank you for verifying the project scope @srishakatux !

@DD063520 Thanks for adding your project here, Looks good :-)

no, this is the github repo for setting up mediawiki + wikibase ... so basically the infrastructure you need to develop the plugin .....

@DD063520 Hi, While getting the dump for extensions I'm getting this and then nothing happens


After setting up my environment and I'm following instructions from here:
https://github.com/wmde/wikibase-docker/blob/master/README-compose.md

Hi, this looks normal, the wikibase is empty, when you set it up it is a fresh emtpy install ....
But besides that, what would you like to do?

Hi, this looks normal, the wikibase is empty, when you set it up it is a fresh emtpy install ....
But besides that, what would you like to do?

Hey @DD063520 , Thanks for replying I want to get a extensions folder described in https://www.mediawiki.org/wiki/Extension:Gadgets so do
I need to create a fresh directory for Extensions? Because I cannot see any existing Extensions directory

so .... I think the documentation here:
https://www.mediawiki.org/wiki/Extension:Gadgets
starts from the fact that there is only the media wiki installation. But you have multiple containers. So you have to go into the one where the mediawiki instance is running:

docker ps

then you get a list and one should be called like:

88d6f8e101d3 wikibase/wikibase:1.35-bundle "/bin/bash /entrypoi…" 6 weeks ago Up 5 days 0.0.0.0:8383->80/tcp euknowledgegraph_wikibase-135_1

you can go into it, i.e. have a bash attached to the machine by:

sudo docker exec -i -t 88d6f8e101d3 /bin/bash

there you should also find your extention folder ...

have fun ; )
D063520

@D063520 Sorry for the late reply! And Thank you so much it worked! :)

@D063520 I have installed the extension. Is it correct?

looks good! but I must admit that I never did it ; ), installed many other extensions though ; )

One Question, Where do we have to update that the Microtasks given to us are completed?

One Question, Where do we have to update that the Microtasks given to us are completed?

@srishakatux As you wrote most of the information at https://www.mediawiki.org/wiki/Google_Summer_of_Code/Mentors I would like to ask you: Is there a statement defining where the students should document their progress and completion of the given microtasks?



From today morning the wikibase containers of docker have stopped working. I uninstalled docker desktop for windows 2 times and reinstalled the compose images and also the docker image from website. When I run on port 8181 it's showing localhost refused to connect. I also tried to run the image on my second pc still I was facing the same issue. I'm using Windows 10 with Linux WSL2 on Ubuntu 20.04 LTS.
So can anyone help me out how I can fix this issue?
Update : It fixed by taking a new image from mediawiki docker website