Page MenuHomePhabricator

Pandora link broken
Closed, ResolvedPublic

Description

Diffs:

https://en.wikipedia.org/w/index.php?title=Gone_(NSYNC_song)&diff=prev&oldid=771665776
https://en.wikipedia.org/w/index.php?title=Bail_Act_1978&diff=prev&oldid=771666453

The original link is fine and should not have been changed the new link is broken.

The URL formats for Australia are listed here:

https://en.wikipedia.org/wiki/Wikipedia:List_of_archives_on_Wikipedia

Are they all known to IABot?

Event Timeline

Restricted Application added a subscriber: Aklapper. · View Herald Transcript

Yes they are all known. I used that page to compile the newest additions to the archive validation subroutine. I try to sanitize them to a single format when they're used to make them more consistent. The test URL I used I converted to all of the formats available and it worked in all instances. I didn't expect it would break other URLs. :(

This is just a testament to how complex such a bot is if you want it to be accurate.

The Australia links can't be sanitized. That could be true for others also. We know Wayback can be sanitized, but service each has its own requirements.

I understand about the many issues. WaybackMedic has to deal with these issues also. All the bugs are rare.. but there there are many many many rare bugs in total (no fault of IABot it's the nature of the problem). The solution is constant monitoring of diffs and running in supervised mode until you can't find bugs. Run small batches at a time - move to the next batch when the bugs in the last batch are fixed. This is a lot of work and time unfortunately but it's the only way.

So far every service I have encountered is either focused on one way to format a snapshot link or provides links in different formats to the same snapshot. Pandora is the first to not do this.

How odd. I've been able to convert the pandora links to different formats and load them all on the List of archives page, with exception to the webarchive subdomain. That appears to be a different service.

That could be it looked like different sub-divisions of the Australian government.

I'm about ready to throw a hammer somewhere. I hate this archiving service. :/

I figured it has to do with being on the opposite side of the planet.

I figured it has to do with being on the opposite side of the planet.

Best services on the northern hemisphere, while the Southern Hemisphere has the worst? I can agree to that. Their toilets flush in the wrong direction anyways. XD

I swear if I hear another bug come up about this service, I swear I will tear down that datacenter. :p