Page MenuHomePhabricator

aolnews.com not being rescued
Closed, ResolvedPublic

Description

Example URL:

https://tools.wmflabs.org/iabot/index.php?page=manageurlsingle&url=http%3A%2F%2Fwww.aolnews.com%2F2010%2F02%2F22%2Fthe-undertaker-burned-during-accident-at-elimination-chamber%2F

When running IABot on the target article it does not rescue the link even though dead. Probable soft 404. All aolnews.com links should be treated as dead. Is there a way?

First reported here:

https://en.wikipedia.org/wiki/Wikipedia:Bot_requests#Broken_links_to_aolnews.com

Event Timeline

Restricted Application added a subscriber: Cyberpower678. · View Herald Transcript

They are now 301s to http://www.aol.com/

$ wget -S http://www.aolnews.com/2010/02/22/the-undertaker-burned-during-accident-at-elimination-chamber/
--2017-09-02 01:57:23-- http://www.aolnews.com/2010/02/22/the-undertaker-burned-during-accident-at-elimination-chamber/
Resolving www.aolnews.com... 195.93.85.44
Connecting to www.aolnews.com|195.93.85.44|:80... connected.
HTTP request sent, awaiting response...

HTTP/1.1 301 Moved Permanently
Date: Fri, 01 Sep 2017 23:57:24 GMT
Server: Apache
Location: http://www.aol.com/
Content-Length: 227
Keep-Alive: timeout=15, max=9996
Connection: Keep-Alive
Content-Type: text/html; charset=iso-8859-1

Location: http://www.aol.com/ [following]

The link you have provided shows that the URL is already archived on said article. IABot will not do anything to it.

Sorry missed that :) Here's another for Evander Holyfield

https://tools.wmflabs.org/iabot/index.php?page=manageurlsingle&url=http%3A%2F%2Fwww.aolnews.com%2F2011%2F02%2F03%2Fevander-holyfields-cut-postpones-brian-nielsen-fight-to-may-7%2F

Running Analyze Page doesn't rescue. It's probably safe to assume all aolnews.com are dead even though they redirect to aol.com

You can use the domain data tool to set all the URLs in the domain aolnews.com to dead.

Ok.. I set the domain to dead and reran the bot on the 440 articles

https://tools.wmflabs.org/iabot/index.php?page=viewjob&id=682

Seems to be working!