Page MenuHomePhabricator

IABot inserts links to archive snapshots with broken document viewer
Closed, InvalidPublic

Description

There seems to be a problem. Text below copy/pasted from my talk page

Hi, you added a lot of archived links to old scanned journal links on sites like Biodiversity Library and Archive.org with this[https://en.wikipedia.org/w/index.php?title=Lord_Howe_swamphen&curid=532471&diff=838305258&oldid=835156983] edit, but when I click on the archive links added, they are unable[https://web.archive.org/web/20180426051443/https://www.biodiversitylibrary.org/page/6381651#page/742/mode/1up] to actually show the scanned pages. Don't know whether the software used on those pages prevent such archiving, but in any case, it doesn't seem to work, so maybe it is a waste of work? [[User:FunkMonk|FunkMonk]] ([[User talk:FunkMonk|talk]]) 10:20, 26 April 2018 (UTC)
: Sorry, I only ran a bot. I was unaware of these issues. Should I self-revert, or leave a note to the bot owner pointing out this problem? [[User:Lingzhi|Lingzhi]] ♦ [[User talk:Lingzhi|(talk)]] 10:24, 26 April 2018 (UTC)
::I'm not sure what to do, actually, it's the first time I notice there is an issue. But yeah, it seems the bot should maybe skip sites that use whatever those "page turning" softwares are. The edit should probably be reverted, though it seems at least the first link (to an old html site) worked. [[User:FunkMonk|FunkMonk]] ([[User talk:FunkMonk|talk]]) 10:28, 26 April 2018 (UTC)
::I'll revert and notify, then. Sorry for the inconvenience! [[User:Lingzhi|Lingzhi]] ♦ [[User talk:Lingzhi|(talk)]] 10:32, 26 April 2018 (UTC)

Event Timeline

Cirdan renamed this task from was asked to self-revert IABOT edits to IABot inserts links to archive snapshots with broken document viewer.Apr 26 2018, 5:41 PM
Cirdan triaged this task as Low priority.

Thanks for reporting this issue. However, currently the best you can do is to remove the broken archive URLs from the IABot's database so that they are not added again. You can do this yourself on the IABot's management interface.

In general, you should always verify that an archive URL added by IABot leads to a readable/usable snapshot.

Cirdan lowered the priority of this task from Low to Lowest.Jun 2 2018, 8:08 AM
Vvjjkkii renamed this task from IABot inserts links to archive snapshots with broken document viewer to m6daaaaaaa.Jul 1 2018, 1:13 AM
Vvjjkkii removed Cyberpower678 as the assignee of this task.
Vvjjkkii raised the priority of this task from Lowest to High.
Vvjjkkii updated the task description. (Show Details)
Vvjjkkii added a subscriber: Cyberpower678.
Green_Cardamom renamed this task from m6daaaaaaa to IABot inserts links to archive snapshots with broken document viewer.Jul 1 2018, 4:59 AM
Green_Cardamom assigned this task to Cyberpower678.
Green_Cardamom lowered the priority of this task from High to Lowest.
Green_Cardamom updated the task description. (Show Details)
Green_Cardamom removed a subscriber: Cyberpower678.

So this is not something IABot is going to be able to detect, but perhaps the Wayback Machine can detect these and filter them out of the resultset when IABot queries it. I have forwarded this report to Internet Archive.