Page MenuHomePhabricator

project Wikipedia Cultural Diversity Observatory (WCDO) / datasets, visualizations and vital articles lists
Closed, ResolvedPublic

Description

This project will be presented at Hackathon 2018, Barcelona.

What it is:
The Wikipedia Cultural Diversity Observatory (WCDO) is a project to raise awareness on Wikipedia’s current state of cultural diversity, providing datasets, visualizations and statistics, and pointing out solutions to improve intercultural coverage.
Full description: https://meta.wikimedia.org/wiki/Grants:Project/Wikipedia_Cultural_Diversity_Observatory_(WCDO)

Main goal:
WCDO aims to correct this underrepresentation, providing focused data and strategic recommendations for cross-language collaboration so that each language edition will have a truly international representation of the world’s knowledge.

The main goal is to make each language edition much more culturally diverse, and for this, I consider that each of them should have a minimum representation of the other cultures (e.g. 100 articles about every other culture make a total of 30,000 articles, which is not unachievable).

Project's Rosetta Stone:
The project relies on a language - territories mapping for each Wikipedia language edition. This database contains the specific regions or countries where the language is native or official, their Qitem, demonym, ISO 3166, ISO 3166-2, among others.
You can check it here: https://github.com/marcmiquel/WCDO/blob/master/language_territories_mapping/Wikipedia_language_territories_mapping_quality.xlsx

If the information from this database has inconsistencies, the content selection will not work properly.

Please, ping me if there is any error, either here or at my e-mail: marcmiquel@gmail.com

Outcomes:
1- CCC Datasets for each Wikipedia language edition.
2- Lists of Vital CCC articles that every language should have.
3- Data visualizations.
4- Academic Research publication.

Current state of development:
The project is currently at the second-third phase (delivering the data visualizations and the lists of top priority articles).

Check the git-hub with documentation / method code:
https://github.com/marcmiquel/WCDO

Technical issues to pay attention to:
Performance and bottlenecks: https://phabricator.wikimedia.org/T192825
The project is now hosted in a VPS: https://phabricator.wikimedia.org/T189058

Collaborators:
The project is conducted by Marc Miquel (researcher and activist), in charge of planning, developing and communicating research in order to make them useful to the communities.
There are already more than 30 collaborators.

Anyone who wants to use CCC to create visualizations is welcomed. This is the project for anyone who wants to compare Wikipedia language editions in any specific cultural aspect and communicate to Wikimedians.

Presentations:
The project has already been presented at Pre-Hackathon 2018, Wikiindaba (e.g. ).

UPCOMING:
The project will be presented at Hackathon 2018, Barcelona.

Event Timeline

Vvjjkkii renamed this task from project Wikipedia Cultural Diversity Observatory (WCDO) / datasets, visualizations and vital articles lists to vidaaaaaaa.Jul 1 2018, 1:11 AM
Vvjjkkii removed marcmiquel as the assignee of this task.
Vvjjkkii triaged this task as High priority.
Vvjjkkii updated the task description. (Show Details)
Vvjjkkii removed a subscriber: Aklapper.
CommunityTechBot renamed this task from vidaaaaaaa to project Wikipedia Cultural Diversity Observatory (WCDO) / datasets, visualizations and vital articles lists.Jul 2 2018, 4:25 PM
CommunityTechBot assigned this task to marcmiquel.
CommunityTechBot raised the priority of this task from High to Needs Triage.
CommunityTechBot updated the task description. (Show Details)
CommunityTechBot added a subscriber: Aklapper.

Assuming this session took place