IABot is inserting duplicate archives in cases where the archive URL is mentioned in a source before the actual URL. This is because the archive URL is always assumed to be after the original. Cyberbot needs to be able to handle these cases too.
Description
Description
Status | Subtype | Assigned | Task | ||
---|---|---|---|---|---|
Resolved | Cyberpower678 | T120433 Migrate dead external links to archives | |||
Resolved | Cyberpower678 | T141347 Create and test v1.2 of InternetArchiveBot (tracking) | |||
Resolved | Cyberpower678 | T141213 Archives of a URL mentioned before the URL, not after, will result in the archive getting ignored. |
Event Timeline
Comment Actions
This is a rather complicated bug to fix. The bot's been built around the fact that the archive usually comes after the initial URL.