Page MenuHomePhabricator
Feed Advanced Search

Dec 18 2023

Mikeblas created T353676: The bot shouldn't cause new referencing errors.
Dec 18 2023, 9:52 PM · InternetArchiveBot

Aug 28 2023

Mikeblas updated the task description for T345105: InternetArchiveBot makes bad syntax worse.
Aug 28 2023, 6:08 PM · InternetArchiveBot
Mikeblas created T345105: InternetArchiveBot makes bad syntax worse.
Aug 28 2023, 6:08 PM · InternetArchiveBot

Jul 13 2023

Mikeblas created T341833: InternetArchiveBot "rescued" a footnote definition by removing it entirely, leaving the article with an undefined footnote error.
Jul 13 2023, 9:38 PM · InternetArchiveBot

Oct 7 2022

Mikeblas created T320219: Bot inconsistently altering references on enwiki.
Oct 7 2022, 1:39 AM · InternetArchiveBot

Jan 28 2021

Mikeblas added a comment to T272432: General fixes combining duplicate unnamed references picks name that conflicts with template.

The tracking category is visible in preview, as is the red CS1 error text.

Jan 28 2021, 8:00 PM · AutoWikiBrowser

Aug 10 2020

Mikeblas added a comment to T210871: Citoid is overwriting editor provided values without notification (was "Bloomberg - Are you a robot?").

This bug continues to spread bogus links and titles in references throughout the corpus. Sure, that's a problem caused by the users who trust the script too much and don't review the changes they're making, but it could be mitigated very simply with a fix in this code -- why not speccificly block "are you a robot?" titles from being created until a better fix can be developed?

Aug 10 2020, 3:11 PM · Citoid

Sep 15 2019

Mikeblas added a comment to T232922: IABot makes edits that cause new errors in edited articles.

Unfortunately, no metrics of the bot's activity are visible to me. Since we know that it's not checking it's own work, how are measuring its error rate?

Sep 15 2019, 3:48 PM · InternetArchiveBot
Mikeblas added a comment to T232922: IABot makes edits that cause new errors in edited articles.

If its behavior can't be fixed, then it shouldn't exercise that behavior -- that is, it should not longer make changes to any reference, since it can't know if it is making the article (or the reference) any better or worse. The automated unpredictable breaking of articles isn't desirable. I don't think machine learning is necessary; just defensive programming in the face of dirty input. Being more efficient at breaking articles is not a feature compared to not breaking articles more slowly.

Sep 15 2019, 3:32 PM · InternetArchiveBot
Mikeblas added a comment to T232922: IABot makes edits that cause new errors in edited articles.

Why can't IABot know that there are multiple definitions of the reference it's about to change?

Sep 15 2019, 2:56 PM · InternetArchiveBot
Mikeblas added a comment to T232922: IABot makes edits that cause new errors in edited articles.

This IABot edit to the Abasy article shows a slightly different pattern. Duplicate (but identical and safe) definitions were present in the refs= list, but IABot decided to shorten one to a self-closing tag, and that caused a duplicate ref def error.

Sep 15 2019, 2:29 PM · InternetArchiveBot
Mikeblas added a comment to T232922: IABot makes edits that cause new errors in edited articles.

Considering the "ARIA News 28 Oct" reference in the "ARIA Music Awards of 2014" article, there was previously no duplicate reference. This can be seen by viewing the article revision before InternetArchiveBot made its edits. In that version, there is on error listed in the "References" section of the article. In that revision, the "ARIA News 28 Oct" is defined twice. One definition is in the body of the article, the other is given as a parameter to the refs= parameter of the {{reflist}} template. These references aren't duplicate as far as the rendering engine is concerned because, even tho they have the same name, their content is identical, character-for-character even including case and white space.

Sep 15 2019, 2:42 AM · InternetArchiveBot

Sep 14 2019

Mikeblas created T232922: IABot makes edits that cause new errors in edited articles.
Sep 14 2019, 3:33 PM · InternetArchiveBot

Aug 9 2019

Mikeblas added a comment to T224344: bot causes duplicate reference definitions.

You've said "no further report is needed", but the bot is still causing duplicate references in its edits. Does your comment meant that you're not interested in fixing this problem, or that you believe to be fixed -- even though it actually isn't? Or, maybe something else ... ?

Aug 9 2019, 4:24 PM · InternetArchiveBot

Jul 30 2019

Mikeblas added a comment to T224344: bot causes duplicate reference definitions.

Another example is here:
https://en.wikipedia.org/w/index.php?title=Everyday_(Ariana_Grande_song)&type=revision&diff=908535879&oldid=902472967&diffmode=source

Jul 30 2019, 5:25 PM · InternetArchiveBot

Jul 29 2019

Mikeblas added a comment to T224344: bot causes duplicate reference definitions.

Here is another problematic edit:
https://en.wikipedia.org/w/index.php?title=Firework_(song)&type=revision&diff=908421990&oldid=904848422&diffmode=source

Jul 29 2019, 6:34 PM · InternetArchiveBot

Jul 21 2019

Mikeblas added a comment to T224344: bot causes duplicate reference definitions.

Another faulty edit is shown here:
https://en.wikipedia.org/w/index.php?title=Debashree_Roy&diff=next&oldid=906145012&diffmode=source

Jul 21 2019, 5:28 PM · InternetArchiveBot

Jul 6 2019

Mikeblas added a comment to T224344: bot causes duplicate reference definitions.

That seems like a fine solution to me. If the bot can't handle badly formatted input, it should do what it can to detect that condition before it performs an edit. I think your proposal would do that.

Jul 6 2019, 10:55 PM · InternetArchiveBot

Jun 18 2019

Mikeblas added a comment to T224344: bot causes duplicate reference definitions.

Another faulty edit is here: https://en.wikipedia.org/w/index.php?title=British_Rail_Class_314&type=revision&diff=902025852&oldid=900760109&diffmode=source

Jun 18 2019, 9:46 PM · InternetArchiveBot

Jun 7 2019

Mikeblas added a comment to T224344: bot causes duplicate reference definitions.

> does not mean it's valid wikitext

Jun 7 2019, 5:20 PM · InternetArchiveBot
Mikeblas added a comment to T224344: bot causes duplicate reference definitions.

Here's another edit where IABot created a duplicate reference:

Jun 7 2019, 3:52 AM · InternetArchiveBot

Jun 4 2019

Mikeblas added a comment to T224344: bot causes duplicate reference definitions.

> The MediaWiki parser is quite generous,

Jun 4 2019, 9:31 AM · InternetArchiveBot

May 27 2019

Mikeblas added a comment to T224344: bot causes duplicate reference definitions.

I can't understand why you're saying that the article has invalid wikitext. The page rendered correctly and without error before InternetArchiveBot made its edits. To be clear, it was only after InternetArchiveBot made its edit that the page rendered with an error.

May 27 2019, 4:14 PM · InternetArchiveBot
Mikeblas added a comment to T224344: bot causes duplicate reference definitions.

References in wikipedia can be reused. We can say <ref name="AnchorName">The Pittsburgh Press</ref> to define a reference named "AnchorName", then use only <ref name="AnchorName"/> when we want to repeat that same reference elsewhere in the article.

May 27 2019, 3:48 AM · InternetArchiveBot

May 25 2019

Mikeblas added a comment to T224344: bot causes duplicate reference definitions.

This edit is another recent example: https://en.wikipedia.org/w/index.php?title=Beast_(Canadian_band)&diff=prev&oldid=895490927&diffmode=source

May 25 2019, 6:41 PM · InternetArchiveBot
Mikeblas created T224344: bot causes duplicate reference definitions.
May 25 2019, 6:40 PM · InternetArchiveBot

Oct 15 2018

Mikeblas added a comment to T205803: Duplicate reference name errors in English Wikipedia caused by MediaWiki templatestyle handling?.

I'm sure there's a lot that I don't know about the developmnet process used by this project. For example, I don't understand what "a patch was uploaded" means. I guess it doesn't mean that a fix was actually deployed, because if the fix was live, the kludges wouldn't be necessary.

Oct 15 2018, 11:12 AM · MW-1.32-notes (WMF-deploy-2018-10-16 (1.32.0-wmf.26)), Cite, TemplateStyles
Mikeblas added a comment to T205803: Duplicate reference name errors in English Wikipedia caused by MediaWiki templatestyle handling?.

Over the last 12 hours (or so) there has been a significant drop in the number of topics in Category:Pages with duplicate reference names. I don't see here any notes that indicate an intentional fix was made. But I also note that all I know about the mechanisms involved came from researching this issue.

Oct 15 2018, 9:48 AM · MW-1.32-notes (WMF-deploy-2018-10-16 (1.32.0-wmf.26)), Cite, TemplateStyles

Oct 1 2018

Mikeblas added a comment to T205803: Duplicate reference name errors in English Wikipedia caused by MediaWiki templatestyle handling?.

The number of articles listed in Category:Pages with duplicate reference names continues to grow, I guess as the safe copies of the cache expired.

Oct 1 2018, 1:51 PM · MW-1.32-notes (WMF-deploy-2018-10-16 (1.32.0-wmf.26)), Cite, TemplateStyles

Sep 30 2018

Mikeblas created T205803: Duplicate reference name errors in English Wikipedia caused by MediaWiki templatestyle handling?.
Sep 30 2018, 4:01 PM · MW-1.32-notes (WMF-deploy-2018-10-16 (1.32.0-wmf.26)), Cite, TemplateStyles