Wikicurricula is a visualization tool that shows metrics about Wikipedia articles that are relevant to a country's national school curriculum.
The visualization was developed by Wikimedia Italy's project [[https://meta.wikimedia.org/wiki/Wikipedia_e_scuola_italiana | Wikipedia a Scuola Italiana]]. It consists of a static website that visually represents the data stored in a TSV file, plus a Python script that generates this TSV by calling different APIs
[[https://wikicurricula.wikimedistas.uy|The implementation of this tool in Uruguay]] additionaly takes advantage of [[ https://www.wikidata.org/wiki/Wikidata:Wikidata_for_Education | Wikidata for Education]], Wikimedia's initiative to digtize curriculum data. Currently, this process is done manually: we make a [[https://w.wiki/7P$C|query at Wikidata Query Service]] to obtain the curriculum topics, we download the data, clean it, and run the Python script.
**The main goal of this project is to use this visualization tool as an interactive and engaging way to visualize curriculum structured data on Wikidata.**
For this, we need to:
1. Create a boilerplate project, that is easily reusable for new countries and languages. Document the process of setting it up.
2. Make the integration between Wikidata and the visualization tool more seamless.
3. Improve the user interfase
**Useful links**
//Boilerplate//
* The repo we'll be working on: https://github.com/wikicurricula-uy/wikicurricula-boilerplate
//Italy - Wikipedia e Scuola italiana//
* Visualization: https://itwiki-scuola-italiana.toolforge.org/
* Codebase: https://github.com/giovannipro/giovannipro.github.io/tree/master/wikipedia-scuola-italiana
* Data gathering script: https://github.com/wikicurricula-uy/curriculaBot/blob/d90de51751d7e5e83f12f9a7fa7c1f1660483aa3/bot.py
* Meta page: https://meta.wikimedia.org/wiki/Wikipedia_e_scuola_italiana
//Uruguay - Wikicurrícula Uy//
* Visualization: https://wikicurricula.wikimedistas.uy/
* Codebase: https://github.com/wikicurricula-uy/wikicurricula-uy.github.io
* Curriculum topics query: https://w.wiki/7P$C
* Data gathering script: https://github.com/wikicurricula-uy/curriculaBot
//Ghana//
* Curriculum topics query: https://w.wiki/7Zge
//Wikidata for Education//
* Meta page: https://www.wikidata.org/wiki/Wikidata:Wikidata_for_Education
**Skillis**
//**Required**//
* Basic web development skills.
* Intermediate proficiency at any programming language, preferrably Python
* Being comfortable around databases, spreadsheets and datasets
* Basic Git usage
//**Appreciated**//
* Familiarity with SQL/SPARQL/Wikidata Query Service
* Experience with [[https://d3js.org/|d3js]] library
* Basic understanding of Spanish or Italian
**Microtasks**
[] Make a local install of [[https://github.com/wikicurricula-uy/wikicurricula-boilerplate|Wikicurrícula Boilerplate]]. **Write down the instructions** at the Readme file. The visualization is a static HTML+CSS+JS. The data-gathering folder contains scripts that are run manually.
[] Translate the boilerplate's interface to English
[] Make a [[https://github.com/wikicurricula-uy/wikicurricula-boilerplate/blob/main/data-gathering/bot.py|data gathering script]] that can work with articles from the English Wikipedia
[] Write instructions at the Readme file on how to feed the visualization with data from a new curriculum. The instructions should include building and running the Wikidata query.
[] In the Spanish version of the data-gathering script, the feature that counts the templates (//avisi//) added to an article is not working. Fix and enable this feature.
[] Implement - and if possible, deploy! - Wikicurricula for [[https://w.wiki/7Zge|Ghana's national curriculum]], with reference to the English Wikipedia. Document the process at the boilerplate project, or improve the documentation that has been previously written. Which parts of this process are the most cumbersome? What can we change in the code, or automate, so that new implementations are easier? If you come up with any ideas, [[https://github.com/wikicurricula-uy/wikicurricula-boilerplate/issues|write a Github issue]]
[] In the visualization, make it possible to order the articles in the x axis by number of references
[] Create a view that shows the curriculum topics that don't have an article in the analyzed Wikipedia (topics of Uruguay's curriculum that don't have an article in the Spanish Wikipedia, topics of the Ghana's curriculum that don't have an article in the English Wikipedia)
**Ready to start?**
* Make your pull requests to the [[https://github.com/wikicurricula-uy/wikicurricula-boilerplate|Boilerplate repo]].
* Many files such as voci_2023.tsv, subjects.tsv, query.tsv contain hardcoded data that should not be included in the boilerplate. They have been included in the repo for your reference, but feel free to remove them and add them to the git ignore once we have proper documentation.
* Remember that the goal of this project is to create a boilerplate that can be easily adapted to other countries and languages needs. Clear documentation, descriptive names and comments are more important than the features themselves.
* Questions? Email datos at wikimedistas.uy or message @Pirakalamina on Telegram