Page MenuHomePhabricator

Adjust proxied links in mainspace
Closed, ResolvedPublic

Description

When users cite content from The Wikipedia Library which they access via our proxy server, they might simply copy the URL in their address bar, which will include our EZProxy URL. We could invest in a bot to clean up these URLs, replacing them with the accessible versions.

For Rock's Backpages, as an example, this looks like https://www-rocksbackpages-com.wikipedialibrary.idm.oclc.org/Library/Article/camel-over-the-moon. For any user who isn't authorized to access Rock's Backpages via TWL, this URL will result in a Library Card authentication error. Instead, the user should have added the direct Rock's Backpages URL; https://www.rocksbackpages.com/Library/Article/camel-over-the-moon, which displays an abstract and the option to login with various other methods.

We could explore a number of options for solving this problem:

  • Providing users with more guidance about how to avoid adding a proxied URL.
  • Creating a bot from scratch capable of replacing a proxied URL with its original form.
    • This could also potentially un-proxy URLs from other libraries.
  • Work with an existing bot (such as User:Citation bot) to integrate this workflow.

All OCLC EZPRoxy proxied URLs from the library on the English Wikipedia can be viewed at https://en.wikipedia.org/w/index.php?target=https%3A%2F%2F*.wikipedialibrary.idm.oclc.org&title=Special%3ALinkSearch

This problem isn't just limited to The Wikipedia Library - https://en.wikipedia.org/w/index.php?title=Special:LinkSearch&limit=500&offset=0&target=https%3A%2F%2F%2A.idm.oclc.org shows that there are hundreds of URLs added via access from other libraries using OCLC EZProxy.

Event Timeline

Samwalton9-WMF renamed this task from Proxied links in mainspace to Handle proxied links in mainspace.Dec 9 2019, 10:19 AM
Samwalton9-WMF updated the task description. (Show Details)

Many vendors explicitly mention a proxy-link as their preferred citation. Gale, Ebsco et al. If we ever take Proquest into our fold, we need to either allow proxy-links or nothing at all.

Thanks for that - we'll keep this in mind :)

Seeing that this might already be an issue from the soft launch - a couple of rock's backpages links now exist with our proxy prefix in mainspace: https://en.wikipedia.org/w/index.php?target=https%3A%2F%2F*.wikipedialibrary.idm.oclc.org&title=Special%3ALinkSearch

These links are totally useless to readers since they attempt to resolve via the proxy and give a 403.

Samwalton9-WMF renamed this task from Handle proxied links in mainspace to Adjust proxied links in mainspace.May 12 2020, 8:49 AM
Samwalton9-WMF triaged this task as Medium priority.
Samwalton9-WMF updated the task description. (Show Details)
Samwalton9-WMF added a subscriber: Smith609.

@Smith609 Does Citation bot (or any other bot you know of) currently do anything to adjust proxied URLs, per the task description?

I don't think it does at present, but it should be straightforward to implement. I suggest proposing the action at https://en.wikipedia.org/wiki/User_talk:Citation_bot, with a generalized description of the problem (i.e. this is a bigger request than just Rock's back pages, or oclc). GlazerMann, who is doing most of the work on the bot at present, will be able to feed back as to whether he has the capacity to implement this in the bot.

Samwalton9-WMF claimed this task.
Samwalton9-WMF moved this task from Incoming tasks to Done on the Library-Card-Platform board.

This should be resolved for OCLC proxy URLs thanks to https://github.com/ms609/citation-bot/pull/2877 :)