This is the third task for T300207, What's in a name? Automatically identifying first and last author names for Wikicite and Wikidata, aimed at getting you familiar with structured citations.
- Ask @Mike_Peel and @Pigsonthewing for an item about a scientific article to work on. This may be the same as one you've worked on in the previous tasks, or may be a different one. You can request an item on this task page, and one of us will reply.
- Load the item into the code you wrote for Task 2, and print out the author information from it
- The item will also have a link to the journal, where bibtex or RIS information will be available about the citation. Load this in to your code (use https://docs.python.org/3/howto/urllib2.html ) and print out the author information it contains.
- Try to match up the authors in the Wikidata item with those in the Bibtex or RIS file, and print the information about each author from both sources together.
- Identify which is the first part of the author citation, and which is the last part. If you can, write these into the Wikidata item, using the P9687 and P9688 properties ('author first names' and 'author last names' respectively).
Save your code to a repository, or create a page like https://www.wikidata.org/wiki/User:Mike_Peel/Outreachy_3 (under your username)
Once you are happy, send me a link to your page (by email, on my talk page, or replying to this ticket as you prefer). Make sure to also register it as a contribution on the Outreachy website ( https://www.outreachy.org/outreachy-may-2022-internship-round/communities/wikimedia/whats-in-a-name-automatically-identifying-first-an/contributions/ )!
Hints:
- You can find examples in https://github.com/mpeel/wikicode/blob/master/example.py
- And more at https://www.mediawiki.org/wiki/Manual:Pywikibot/Create_your_own_script