Wiki Loves Open Data offers a framework and is the result of a collaboration with the Wikidata team and community, including some chapters and projects.
BUT even if WMF Strategic Partnerships and some chapters are in talks with organizations, we cannot count that as “ongoing projects” yet.
- @SVentura (Strategic Partnerships) started promising talks with World Bank, OECD, and others, but the requirement for CC0 licensing is the main obstacle for quick collaborations.
- The involvement of @Lydia_Pintscher (Wikidata), @Wittylama (Europeana), @Susannaanas (WMFI), @Yair_rand (volunteer developer) and @johl (WMDE) among others has been very valuable and puts this first step in a promising direction inspired by the GLAM precedent.
The problem (previous description)
We are missing a community framework allowing Wikidata content and tech contributors, data engineers, and open data organizations to collaborate effectively on this use case:
- Open data organization has a subset of interesting data that could be used to improve Wikimedia wikis after being added to Wikidata.
What are the specific problems that Wikimedia volunteers and/or interested open data organizations are facing?
- What makes an open data organization? We need a technical definition of "open data" compatible with Wikimedia, useful for organizations to check whether it applies to them or not.
- What types of contributions are welcomed? Open data orgs might want to dump all of their data somewhere. Wikidata might just accepts a few data types. How to know what is possible, what is useful?
- How does the process work? Everybody knows Wikipedia, less know Wikidata, even less know how to contribute to Wikidata, and even less how to show that data in Wikipedia, which frequently is the ultimate goal of the open data owners.
- How does licensing and attribution work? Licenses allowed in Wikidata, how are attributions shown, what happens when others reuse that data.
- How is the data updated? Are there any expectations on open data organizations in updating the data contributed? Also, what happens with changes made by other contributors to that data? How to upstream changes? How to avoid that they are just overridden in a next update?
- How to contribute resources? How to make effective use of resources available, i.e. a team in an open data organization is willing to work on a Wikidata project, there is a possibility to fund a data engineer in residence, to organize workshops and sprints (datathons?)
- What precedents and ongoing projects are there? Especially at the beginning, any previous / ongoing experience is going to be very useful for new open data organizations and Wikimedia contributors willing to get involved.
- How to track all of the relevant conversations about a particular objective over time? Contrary to above, not everyone is familiar with Wikipedia social structures and tools. Use of watchlists, etc. can be taught, tools could be improved.
- How to visualize a data model that spans multiple entity types within the context of wikidata?
- What else?
Out of scope
Use cases that we are NOT pursuing in this quarterly goal:
- Open data organization has a humongous amount of data to be injected entirely to Wikidata.
- Wikidata contributors start compiling a directory with the Sum of all Data.
- Wikidata/Commons contributors start building a Wikimedia version of http://datahub.io/
Wiki Loves Open Data, A basic framework agreed with the Wikidata community and documented, offering a process that addresses the questions asked above. Imagine the GLAM framework applied to data.
GLAM has created documentation, campaigns, tools, success stories, a network of volunteers and cultural organizations, and even some new jobs. Today, a Wikimedian living in a place with an interesting gallery/library/archive/museum (or someone working in any of these institutions) has a framework and a support network to learn how to establish a first contact and organize a first activity. Let's try to build a framework allowing an easy start for open data projects.
This framework needs to be tested and improved with real collaborations with some alpha-testing open data organizations that will need to be very patient and understanding with us. For that we will need experienced Wikidata contributors and consolidated Wikimedia teams able to handle the relationship with these organizations and work (or find the resources to work) on the technical solutions to their problems.
This goal aims to start walking in the right direction until reaching a first milestone that we can be happy about.
- ProteinBoxBot: bots for populating wikidata with trusted biomedical information and for using that information to drive applications such as Wikipedia.
|Open data organization||Wikimedia team mediating||Contact person(s)|
|World University and School||@Scott_WUaS|
- Initial announcement in Wikidata mailing list
- Echo of the announcement in Wikidata project chat
- Planned by the end of July: https://www.wikidata.org/wiki/Wikidata:Wiki_Loves_Open_Data (see T104701)
Measurement of success
- Publication of basic documentation and community processes for open data engineers and organizations willing to contribute to Wikidata.
- Ongoing projects with 1 open data org.
- Wikidata team
- Wikidata community
- Strategic Partnerships team
Relation with WMF Call to Action
EXPERIMENT: support innovation & new knowledge