Page MenuHomePhabricator

Investigation: See if Earwig's API is using the correct revision of the Wikipedia page for comparisons
Closed, ResolvedPublic1 Estimated Story Points

Description

Sometimes the comparisons returned by Earwig's tool don't highlight any matching content. This is either because the source has changed or the article has changed. Let's make sure the article changing isn't an issue and that Earwig's API is using the content of the revision specified (rather than the most recent version of the article).

Event Timeline

See https://api.ithenticate.com/en_us/dv/6277?lang=en_us&o=23281859 for an example of a case highlighted by Turnitin, but that doesn't show any matches in CopyPatrol. Revision that should be compared is https://en.wikipedia.org/w/index.php?title=List_of_Star_Wars_spacecraft&diff=725593921.

kaldari triaged this task as Medium priority.Jun 16 2016, 5:45 PM
kaldari moved this task from New & TBD Tickets to Needs Discussion on the Community-Tech board.
kaldari set the point value for this task to 1.Jun 16 2016, 5:51 PM
kaldari renamed this task from Investigation: Make sure Earwig's API is using the correct revision of the Wikipedia page to Investigation: See if Earwig's API is using the correct revision of the Wikipedia page for comparisons.Jun 16 2016, 5:53 PM

I can confirm Earwig's API is using the supplied revision. I fixed one copyright violation, and Earwigs API still showed the content from the original edit. Then I revision-deleted it, under enwiki's revdel policy, tried Earwig's API again and got the error The given revision ID doesn't seem to exist. This tells me it's doing what it's supposed to, so our problem must lie with the original source/diff comparison in EranBot.