Page MenuHomePhabricator

Wikipedia pages parsed as website, not encyclopedia
Closed, ResolvedPublic

Description

During the transition of Citoid to Zotero v2 (T197242: Transition citoid to use Zotero's translation-server-v2), we noticed that Wikipedia pages are now being parsed as website articles rather than encyclopedia entries. It seems like Zotero is not used for Wikipedia articles any more. For instance, Darth Vader used to have the following output with Zotero v1:

[
  {
    "itemType": "encyclopediaArticle",
    "notes": [],
    "tags": [],
    "title": "Darth Vader",
    "rights": "Creative Commons Attribution-ShareAlike License",
    "encyclopediaTitle": "Wikipedia",
    "url": "https://en.wikipedia.org/w/index.php?title=Darth_Vader&oldid=871815944",
    "extra": "Page Version ID: 871815944",
    "language": "en",
    "date": "2018-12-03",
    "abstractNote": "Darth Vader is a fictional character in the Star Wars franchise. He is the main villain of the original trilogy, but, as Anakin Skywalker, is the main protagonist of the prequel trilogy. Star Wars creator George Lucas has collectively referred to the first six episodic films of the franchise as \"the tragedy of Darth Vader.\"Originally a Jedi prophesied to bring balance to the Force, Anakin Skywalker is lured to the dark side of the Force by Palpatine, who is secretly a Sith Lord. After fighting a lightsaber battle with his former mentor Obi-Wan Kenobi in which he is dismembered, Vader is transformed into a cyborg. He then serves the Galactic Empire as Darth Vader until he redeems himself by saving his son, Luke Skywalker, and killing Palpatine, sacrificing his own life in the process. He is also the father of Princess Leia Organa, the secret husband of Padmé Amidala, and grandfather of Kylo Ren, the main villain of the Star Wars sequel trilogy.\nThe character has been portrayed by numerous actors. His cinematic appearances span the first six Star Wars films, as well as Rogue One, and his character is referenced in both The Force Awakens and The Last Jedi. He also appears in numerous iterations of the Star Wars Expanded Universe including television series (most substantially The Clone Wars), video games, novels, literature and comic books.\nDarth Vader has become one of the most iconic villains in popular culture, and has been listed among the greatest villains and fictional characters ever. The American Film Institute listed him as the third greatest movie villain in cinema history on 100 Years... 100 Heroes and Villains, behind Hannibal Lecter and Norman Bates. His role as a tragic hero in the prequel trilogy met with mixed reviews.",
    "libraryCatalog": "Wikipedia",
    "accessDate": "2018-12-04",
    "source": [
      "Zotero"
    ]
  }
]

After switching to Zotero v2 we get the following:

[
  {
    "url": "https://en.wikipedia.org/wiki/Darth_Vader",
    "itemType": "webpage",
    "title": "Darth Vader - Wikipedia",
    "language": "en",
    "accessDate": "2018-12-04",
    "websiteTitle": "en.wikipedia.org",
    "source": [
      "citoid"
    ]
  }
]

Event Timeline

mobrovac created this task.

Mentioned in SAL (#wikimedia-operations) [2018-12-04T10:43:01Z] <mobrovac@deploy1001> Started deploy [restbase/deploy@8abcbda]: Disable Citoid test for switching it to Zotero v2 - T211088 T197242

Mentioned in SAL (#wikimedia-operations) [2018-12-04T11:03:59Z] <mobrovac@deploy1001> Finished deploy [restbase/deploy@8abcbda]: Disable Citoid test for switching it to Zotero v2 - T211088 T197242 (duration: 20m 59s)

This was due to Citoid not being able to communicate with Zotero. Fixed now.