Page MenuHomePhabricator

Convert archive.today to archive.org
Open, Needs TriagePublic

Description

In this diff archive.today was converted to archive.org

https://en.wikipedia.org/w/index.php?diff=940731108

This should not happen for a couple reasons. Archive.today uses a different capture and rendering engine it is sometimes more accurate than Wayback, it is often a go-to site for when Wayback doesn't work or doesn't work well. There is also the issue of content drift, converting from provider to another shouldn't be done arbitrarily unless there is a compelling reason as it can cause content drift. Users who create archive.today links may do so for a reason and their decisions should be given priority unless there is a reason to change via bot (site is going down, has known bad archives, link not working).

Event Timeline

Restricted Application added a project: Internet-Archive. · View Herald TranscriptFeb 14 2020, 3:26 PM
Restricted Application added a subscriber: Cyberpower678. · View Herald Transcript

Is any progress being made on this?

@Kailash29792 Are you still seeing problems? I tried running the bot on this test page and it did nothing, left it alone, which is not right either but better than before. https://en.wikipedia.org/wiki/User:GreenC/testcases/generic

Sadly, I am still experiencing such problems. Here is proof.

There are seven (known) domains: archive.is, .fo, .li, .today, .vn, .md, .ph

The bot does not recognize all of them so it sees the URL as being invalid and converts it to Wayback.

The problem can be seen here:
https://github.com/cyberpower678/Cyberbot_II/blob/master/IABot/Core/APII.php
Line # 2493

@Cyberpower678 this looks like a simple fix..? The alternative is I write a bot that searches for the missing three (md ph vn) and converts them to .today but it would only work on Enwiki and seems easier to add the lines to IABot.

Green_Cardamom added a comment.EditedTue, Mar 17, 5:43 PM

I wrote a small bot that will run perpetually (3x week) on enwiki until the IABot bug is fixed. It converts archive.(ph|md|vn) to archive.today but only when the URL is short not long form. Looks to be about 700 to begin with and each day 0-5 perhaps. On Tools in /data/project/botwikiawk/atoday

Is there any place where there's a discussion of "best practices" for creating archive.today links?