Wikipedia's automatic citation generator (Citoid) creates formatted citations from given web sources. However, sometimes metadata are not accurately retrieved. One way to fix this is with Web2Cit, a tool to collaboratively improve automatic citations in Wikipedia.
The Web2Cit community has already indicated (both manually and automatically) the expected citation metadata for a series of webpages, some of which do not match their automatic citations. In this workshop we will introduce the basics of Web2Cit, and show how to use it to fix automatic citations in Wikipedia for these sources, or other relevant to your communities.
- Title of session: How to fix automatic citations in Wikipedia with Web2Cit
- Session description: We will introduce the basics of Web2Cit and show how to use it to fix automatic citations for previously identified sources or others which may be relevant to your local communities.
- Username for contact: @diegodlh & @Nidiah
- Session duration (25 or 50 min): 50 min
- Session type (presentation, workshop, discussion, etc.): workshop
- Language of session (English, Arabic, etc.): English or Spanish (depending on the audience)
- Prerequisites (some Python, etc.): some XPath and JSON might be useful
- Any other details to share?:
- Interested? Add your username below:
If you attended last year's hackathon Web2Cit session, this one may still be relevant. Web2Cit development continued since last year's hackathon session. Most notable changes include:
- Translation test support, to indicate expected output for specific webpages, including tests automatically generated from data from our research team.
- Web2Cit monitor, which regularly checks translation tests and publishes test results.
- JSON-LD support, a popular format to embed metadata in webpages.
Proposed agenda
- Web2Cit basics and installation.
- Identify a problematic webpage (either bring one relevant to your community or choose one from the Web2Cit monitor list).
- Indicate or check the expected output on a translation test.
- Configure a translation template to match the expected output.