- Context.
The #1Lib1Ref (One Librarian, One Reference) campaign encourages librarians, researchers, and volunteers worldwide to improve the reliability of Wikipedia by adding citations to articles that lack references. One of the most impactful ways to contribute is by focusing on highly visible articles that lack proper citations.
- Description.
I'm looking for a list of the top 100 Wikipedia articles (ordered by visibilty) that meet two key criteria:
- They lack references (i.e., they have no references).
- They are highly visible (i.e., they receive significant traffic and are frequently viewed by readers).
- Expected Deliverable.
An actionable code that users could replicate (ideally a Python notebook - and even better: a tool)
If this not possible, the resulting data frames for the following languages (en, es, fr, id, sr, hr, id, pl, ro) is also good for now.
The information delivered last time is desirable (revision_timestamp page_id page_title revision_id page_length num_refs num_wikilinks num_categories num_media num_headings num_views)
- Estimated Effort.
I believe that since this has already been delivered once, it shouldn't take more than a week.
- Priority It should be available before the campaign starts in May
I need this task resolved in:
- 1 month.
- 3 months.
- 6 months.
- Whenever you get to it :-)
- Other. Do you have any other questions or comments ?
For use by WMF Research team; please leave everything below as it is:
- Does the request serve one of the existing Research team's audiences? If yes, choose the primary audience. (1 of 4)
- What is the type of work requested?
- What is the impact of responding to this request?
- Support a technology or policy need of one or more WM projects
- Advance the understanding of the WM projects.
- Something else. If you choose this option, please explain briefly the impact below.