Page MenuHomePhabricator

Ref mangled with certain garbage data input
Closed, ResolvedPublic

Description

Diff:

https://en.wikipedia.org/w/index.php?title=Michelle_Rodriguez&type=revision&diff=776434029&oldid=776216267

I come across 1 or 2 of these in every 10000 article batch. It's always the same garbage data input and result. There are two copies of the same url .. one in a {{cite web}} and one bare link trailing at the end. It throws IAbot for a loop, causing it to embed a {{cite web}} inside another {{cite web}}.

Event Timeline

Restricted Application added a subscriber: Aklapper. · View Herald Transcript
Cyberpower678 moved this task from Inbox to v1.3 on the InternetArchiveBot board.
Cyberpower678 moved this task from Unsorted to Bugs on the InternetArchiveBot (v1.3) board.

There's an issue with the custom str_replace function.

It's as I feared. I have to rewrite the function to fix this. This could take some time to fix.