Page MenuHomePhabricator

Harej (James Hare)
Developer

Today

  • Clear sailing ahead.

Tomorrow

  • Clear sailing ahead.

Thursday

  • Clear sailing ahead.

User Details

User Since
Oct 16 2014, 12:59 AM (401 w, 5 d)
Availability
Available
IRC Nick
harej
LDAP User
Harej
MediaWiki User
Harej [ Global Accounts ]

Software engineer at Internet Archive. I do other stuff too.

Recent Activity

Yesterday

Harej added a project to T290896: URL is cut off when parsing, resulting in links being treated as dead when they are not: IABot-Priority-Eastern-Europe-Former-USSR.
Mon, Jun 27, 10:40 PM · IABot-Priority-Eastern-Europe-Former-USSR, InternetArchiveBot
Harej added projects to T305243: Internet Archive Bot occasionally duplicates its own templates: IABot-Priority-2022, IABot-Priority-Eastern-Europe-Former-USSR.
Mon, Jun 27, 10:38 PM · IABot-Priority-Eastern-Europe-Former-USSR, IABot-Priority-2022, InternetArchiveBot
Harej added projects to T304278: Bot deletes content on Ukrainian Wikipedia article [ukwiki]: IABot-Priority-2022, IABot-Priority-Eastern-Europe-Former-USSR.
Mon, Jun 27, 10:38 PM · IABot-Priority-Eastern-Europe-Former-USSR, IABot-Priority-2022, InternetArchiveBot
Harej edited Description on IABot-Priority-2022.
Mon, Jun 27, 10:37 PM
Harej added projects to T283964: InternetArchiveBot alters whitespace in template calls, affecting parameter alignment [bgwiki]: IABot-Priority-2022, IABot-Priority-Eastern-Europe-Former-USSR.
Mon, Jun 27, 10:36 PM · IABot-Priority-Eastern-Europe-Former-USSR, IABot-Priority-2022, InternetArchiveBot
Harej added a project to T265488: IABot bug observed in Russian Wikipedia: IABot-Priority-Eastern-Europe-Former-USSR.
Mon, Jun 27, 10:34 PM · IABot-Priority-Eastern-Europe-Former-USSR, Russian-Sites, InternetArchiveBot
Harej added a project to T265813: Wrong processing of book templates in Russian Wikipedia: IABot-Priority-Eastern-Europe-Former-USSR.
Mon, Jun 27, 10:33 PM · IABot-Priority-Eastern-Europe-Former-USSR, InternetArchiveBot (v3.0), Russian-Sites
Harej added projects to T310212: Polish Wikipedia: Adding archive to templates with archive.is/org links in url does not look correct : IABot-Priority-2022, IABot-Priority-Eastern-Europe-Former-USSR.
Mon, Jun 27, 10:31 PM · IABot-Priority-Eastern-Europe-Former-USSR, IABot-Priority-2022, InternetArchiveBot
Harej added projects to T202804: Deploy InternetArchiveBot on the Finnish Wikipedia (fiwiki): IABot-Priority-2022, IABot-Priority-Eastern-Europe-Former-USSR.
Mon, Jun 27, 10:26 PM · IABot-Priority-Eastern-Europe-Former-USSR, IABot-Priority-2022, InternetArchiveBot
Harej created IABot-Priority-Eastern-Europe-Former-USSR.
Mon, Jun 27, 10:25 PM
Harej added a project to T224689: IABot 2.0beta14 encodes cyrillic characters in URLs, making links incomprehensible for humans: IABot-Priority-2022.
Mon, Jun 27, 10:16 PM · IABot-Priority-2022, InternetArchiveBot
Harej created IABot-Priority-2022.
Mon, Jun 27, 10:13 PM
Harej removed a watcher for MediaWiki-extensions-CollaborationKit: Harej.
Mon, Jun 27, 9:35 PM
Harej created T311459: Runpage email does not respect user's language settings.
Mon, Jun 27, 8:34 PM · InternetArchiveBot

Mon, Jun 20

Klein awarded T307612: Clean up redundant template parameters on Albanian Wikipedia sqwiki a Love token.
Mon, Jun 20, 6:07 PM · InternetArchiveBot

Wed, Jun 15

Harej triaged T310735: Bot is misclassifying subscription sites as High priority.
Wed, Jun 15, 6:43 PM · InternetArchiveBot
Harej moved T310735: Bot is misclassifying subscription sites from Inbox to Next on the InternetArchiveBot board.
Wed, Jun 15, 6:42 PM · InternetArchiveBot
Harej created T310735: Bot is misclassifying subscription sites.
Wed, Jun 15, 6:42 PM · InternetArchiveBot

Wed, Jun 8

Harej created T310212: Polish Wikipedia: Adding archive to templates with archive.is/org links in url does not look correct .
Wed, Jun 8, 6:15 PM · IABot-Priority-Eastern-Europe-Former-USSR, IABot-Priority-2022, InternetArchiveBot
Harej moved T299576: Choosing a closed wiki in IABot management interface causes endless auth failure loop from Current Work to Next on the InternetArchiveBot board.
Wed, Jun 8, 6:14 PM · InternetArchiveBot
Harej moved T286738: Changes made in "Configure bot behavior" are not saving from Current Work to Next on the InternetArchiveBot board.
Wed, Jun 8, 6:14 PM · InternetArchiveBot
Harej moved T309728: Parameter error on Chinese Wikipedia from Current Work to Next on the InternetArchiveBot board.
Wed, Jun 8, 6:14 PM · Chinese-Sites, InternetArchiveBot
Harej moved T309730: Bot is marking some links as dead, even though they are considered alive internally from Current Work to Next on the InternetArchiveBot board.
Wed, Jun 8, 6:14 PM · InternetArchiveBot
Harej moved T302917: IABot is not properly handling duplicated named references from Current Work to Next on the InternetArchiveBot board.
Wed, Jun 8, 6:14 PM · InternetArchiveBot

Wed, Jun 1

Harej moved T295459: IABot does not recognize Статья template on Russian Wikipedia [ruwiki] from Blocked to Current Work on the InternetArchiveBot board.
Wed, Jun 1, 6:47 PM · Russian-Sites, InternetArchiveBot
Harej changed the status of T295459: IABot does not recognize Статья template on Russian Wikipedia [ruwiki] from Stalled to Open.
Wed, Jun 1, 6:46 PM · Russian-Sites, InternetArchiveBot
Harej created T309728: Parameter error on Chinese Wikipedia.
Wed, Jun 1, 6:21 PM · Chinese-Sites, InternetArchiveBot

May 28 2022

Harej added a comment to T298424: Cannot open IABot Management Interface(504).

I cannot open it now.

May 28 2022, 4:42 PM · InternetArchiveBot

May 25 2022

Harej closed T304164: Turkish parameter update as Resolved.
May 25 2022, 6:47 PM · Turkish-Sites, InternetArchiveBot
Harej added a comment to T304164: Turkish parameter update.

I would like to note that https://tr.wikipedia.org/wiki/%C5%9Eablon:Kaynak does include those parameters, which is why the bot thought of using them. We have however updated the template maps to remove those fields (except in the template where it is being used). We recommend updating the TemplateData on that template to remove those parameters.

May 25 2022, 6:47 PM · Turkish-Sites, InternetArchiveBot

May 24 2022

Harej added a comment to T308193: [Small wiki toolkits] InternetArchiveBot.

I advise printing the slides to PDF and uploading the PDF.

May 24 2022, 7:18 PM · InternetArchiveBot, Wikimedia-Hackathon-2022, Small-Wiki-Toolkits

May 12 2022

Harej moved T299438: archive.today #select feature from Current Work to Backlog: URLs on the InternetArchiveBot board.
May 12 2022, 4:33 PM · InternetArchiveBot
Harej moved T299438: archive.today #select feature from Backlog: URLs to Current Work on the InternetArchiveBot board.
May 12 2022, 4:33 PM · InternetArchiveBot
Harej moved T308193: [Small wiki toolkits] InternetArchiveBot from Inbox to Current Work on the InternetArchiveBot board.
May 12 2022, 4:33 PM · InternetArchiveBot, Wikimedia-Hackathon-2022, Small-Wiki-Toolkits
Harej renamed T308193: [Small wiki toolkits] InternetArchiveBot from InternetArchiveBot to InternetArchiveBot [Wikimedia Hackathon 2022].
May 12 2022, 4:33 PM · InternetArchiveBot, Wikimedia-Hackathon-2022, Small-Wiki-Toolkits

May 9 2022

Harej moved T305416: Internetarchive bot new fieldnames with aditional "-" duplicate parameters already present in references (url-arquivo should be urlarquivo, url-morta should be urlmorta etc.) from Backlog: Configuration and Deployment to Current Work on the InternetArchiveBot board.
May 9 2022, 7:00 PM · InternetArchiveBot
Harej triaged T305416: Internetarchive bot new fieldnames with aditional "-" duplicate parameters already present in references (url-arquivo should be urlarquivo, url-morta should be urlmorta etc.) as Medium priority.
May 9 2022, 7:00 PM · InternetArchiveBot
Harej moved T305879: Wrong parameter used by InternetArchive bot in glwiki (extra code " == DeadURL or "non" in urlmorta) from Backlog: Configuration and Deployment to Current Work on the InternetArchiveBot board.
May 9 2022, 6:51 PM · InternetArchiveBot
Harej triaged T305879: Wrong parameter used by InternetArchive bot in glwiki (extra code " == DeadURL or "non" in urlmorta) as Medium priority.
May 9 2022, 6:51 PM · InternetArchiveBot

May 4 2022

Harej created T307612: Clean up redundant template parameters on Albanian Wikipedia sqwiki.
May 4 2022, 6:23 PM · InternetArchiveBot

May 2 2022

Harej triaged T306960: IABot marks links to example sites as unreachable as Medium priority.
May 2 2022, 9:39 PM · InternetArchiveBot
Harej moved T305879: Wrong parameter used by InternetArchive bot in glwiki (extra code " == DeadURL or "non" in urlmorta) from Inbox to Backlog: Configuration and Deployment on the InternetArchiveBot board.
May 2 2022, 9:38 PM · InternetArchiveBot
Harej moved T306960: IABot marks links to example sites as unreachable from Inbox to Backlog: URLs on the InternetArchiveBot board.
May 2 2022, 9:38 PM · InternetArchiveBot
Harej moved T306964: Cannot add "dead-url"parameter to Cite Template automatically from Inbox to Backlog: Syntax on the InternetArchiveBot board.
May 2 2022, 9:38 PM · Chinese-Sites, InternetArchiveBot
Harej closed T277698: InternetArchiveBot adds links to archives that have been excluded from the Wayback Machine as Declined.

There is no feasible solution to this problem. Our caches are built in such a way that we assume that, so long as we are aware of an archive link, we assume that archive link will always exist (this is generally how the Internet Archive operates). And, indeed, even when an archive is hidden from public view, the archive still technically exists. To feasibly solve this problem at scale we would need to regularly seek updates on the visibility status of each archive, since it can change according to policy as highlighted above. Even if we had a hot cache of known "invisible" archives, this would create an additional operational strain: the bot would have to go through and remove the archives that are no longer visible, and then re-add those archives if a subsequent policy decision re-enables their access.

May 2 2022, 7:28 PM · Internet-Archive, InternetArchiveBot
Harej closed T294880: IABot incorrectly marks social media dead links as live as Resolved.

The instagram domain was improperly marked as "permalive" and this has been fixed. The listed YouTube URL returns a 200 status code for dead videos, which to the bot means the link is alive.

May 2 2022, 7:06 PM · User-Harej, InternetArchiveBot
Harej added a comment to T294244: britishnewspaperarchive.co.uk archives are all worthless login urls.

The domain (and its several variants) have been marked as subscription sites and we deleted the archive links to this URL.

May 2 2022, 6:52 PM · InternetArchiveBot
Harej closed T294244: britishnewspaperarchive.co.uk archives are all worthless login urls as Resolved.
May 2 2022, 6:51 PM · InternetArchiveBot
Harej closed T293827: Continually reports "Bad title: The page you entered is invalid or doesn't exist. Please check your spelling and try again." as Invalid.
May 2 2022, 6:46 PM · InternetArchiveBot
Harej closed T293324: Bot no longer expands "archive.today" URLs from short format to long format as Resolved.
May 2 2022, 6:46 PM · InternetArchiveBot
Harej triaged T292102: The bot is tagging links inside files as dead as Medium priority.
May 2 2022, 6:42 PM · InternetArchiveBot
Harej triaged T291239: IABot broke archive-date by encoding it as Medium priority.
May 2 2022, 6:41 PM · InternetArchiveBot
Harej triaged T290896: URL is cut off when parsing, resulting in links being treated as dead when they are not as High priority.
May 2 2022, 6:37 PM · IABot-Priority-Eastern-Europe-Former-USSR, InternetArchiveBot
Harej added a comment to T288516: hrwiki - Wrong date format on single page analysis.

@Ivi104 , can you provide us a list of correctly spelled month names, as well as the incorrectly spelled month names the bot uses?

May 2 2022, 6:36 PM · Croatian-Sites, InternetArchiveBot
Harej triaged T288516: hrwiki - Wrong date format on single page analysis as Medium priority.
May 2 2022, 6:35 PM · Croatian-Sites, InternetArchiveBot
Harej triaged T288494: bot creates nonsensical combination of |archive-url= and archive-date= [enwiki] as Medium priority.

bot added |url-status=live and yeah, there is something there, but it ain't what was intended which, one suspects, is why the original url was converted to an archive.org snapshot with this edit

May 2 2022, 6:28 PM · InternetArchiveBot
Harej triaged T288475: Citation engine not detecting native templates for skwiki as Medium priority.
May 2 2022, 6:15 PM · InternetArchiveBot
Harej triaged T287898: Severe template parsing glitch on Italian Wikipedia [itwiki] as Medium priority.
May 2 2022, 6:15 PM · InternetArchiveBot
Harej triaged T302917: IABot is not properly handling duplicated named references as High priority.
May 2 2022, 6:10 PM · InternetArchiveBot

Apr 28 2022

Harej created T307126: InternetArchiveBot on Turkish Wikipedia [trwiki] misspells Mayıs.
Apr 28 2022, 4:44 PM · InternetArchiveBot

Apr 5 2022

Harej moved T305243: Internet Archive Bot occasionally duplicates its own templates from Inbox to Backlog: Syntax on the InternetArchiveBot board.
Apr 5 2022, 11:48 PM · IABot-Priority-Eastern-Europe-Former-USSR, IABot-Priority-2022, InternetArchiveBot
Harej moved T305416: Internetarchive bot new fieldnames with aditional "-" duplicate parameters already present in references (url-arquivo should be urlarquivo, url-morta should be urlmorta etc.) from Inbox to Backlog: Configuration and Deployment on the InternetArchiveBot board.
Apr 5 2022, 11:48 PM · InternetArchiveBot
Harej removed a project from T305243: Internet Archive Bot occasionally duplicates its own templates: Internet-Archive.
Apr 5 2022, 11:48 PM · IABot-Priority-Eastern-Europe-Former-USSR, IABot-Priority-2022, InternetArchiveBot

Mar 30 2022

Harej added a comment to T304236: Template parameter update on huwiki.

For future reference, it is adequate to disable the bot by going to https://iabot.toolforge.org/index.php?page=runpages (make sure Hungarian Wikipedia is selected from the dropdown).

Mar 30 2022, 6:00 PM · Regression, Hungarian-Sites, InternetArchiveBot

Mar 29 2022

Harej changed the status of T274050: Users trying to analyze pages are being told they are blocked when they are not from Stalled to Open.
Mar 29 2022, 12:14 AM · Anti-Harassment, MediaWiki-Blocks, InternetArchiveBot
Harej renamed T277698: InternetArchiveBot adds links to archives that have been excluded from the Wayback Machine from add blocked pages as archives to InternetArchiveBot adds links to archives that have been excluded from the Wayback Machine.
Mar 29 2022, 12:13 AM · Internet-Archive, InternetArchiveBot

Mar 28 2022

Harej moved T274050: Users trying to analyze pages are being told they are blocked when they are not from Blocked to Current Work on the InternetArchiveBot board.
Mar 28 2022, 7:46 PM · Anti-Harassment, MediaWiki-Blocks, InternetArchiveBot
Harej merged task T286831: Avoid adding archive URLs for excluded websites into T277698: InternetArchiveBot adds links to archives that have been excluded from the Wayback Machine.
Mar 28 2022, 7:44 PM · Internet-Archive, InternetArchiveBot
Harej merged T286831: Avoid adding archive URLs for excluded websites into T277698: InternetArchiveBot adds links to archives that have been excluded from the Wayback Machine.
Mar 28 2022, 7:44 PM · Internet-Archive, InternetArchiveBot
Harej updated the task description for T277698: InternetArchiveBot adds links to archives that have been excluded from the Wayback Machine.
Mar 28 2022, 7:44 PM · Internet-Archive, InternetArchiveBot
Harej closed T286143: links archived shown but not appearing in diffss as Invalid.
Mar 28 2022, 7:43 PM · InternetArchiveBot
Harej added a comment to T274050: Users trying to analyze pages are being told they are blocked when they are not.

I received the same error running the tool on a different Wikipedia article, so the specific article being run doesn't seem to affect reproduction of the bug.

Mar 28 2022, 7:39 PM · Anti-Harassment, MediaWiki-Blocks, InternetArchiveBot
Harej added a comment to T274050: Users trying to analyze pages are being told they are blocked when they are not.

Bug reproduction successful. (Note that these steps may not successfully reproduce the bug in the future.)

Mar 28 2022, 7:29 PM · Anti-Harassment, MediaWiki-Blocks, InternetArchiveBot
Harej closed T285191: bot adds |archive-url= for |lay-url= [enwiki] as Resolved.
Mar 28 2022, 7:25 PM · InternetArchiveBot
Harej closed T285181: Archive links not working as Invalid.
Mar 28 2022, 7:25 PM · InternetArchiveBot
Harej closed T284189: Bot doubles links in certain cases as Resolved.
Mar 28 2022, 7:24 PM · InternetArchiveBot
Harej closed T283963: archive-url parameter ignored [bgwiki] as Resolved.
Mar 28 2022, 7:14 PM · InternetArchiveBot
Harej closed T281592: InternetArchiveBot adds non-functional archival links to Google Trends as Resolved.
Mar 28 2022, 6:56 PM · InternetArchiveBot, Internet-Archive
Harej closed T280752: Wrong parameter "dodelink" instead of "dodeurl" on Dutch Wikipedia as Resolved.
Mar 28 2022, 6:47 PM · InternetArchiveBot
Harej merged T278009: creates broken cs1|2 template and bad wiki markup into T270812: converting ext wikilink to cs1|2 template issues [enwiki].
Mar 28 2022, 6:40 PM · Internet-Archive, InternetArchiveBot
Harej merged task T278009: creates broken cs1|2 template and bad wiki markup into T270812: converting ext wikilink to cs1|2 template issues [enwiki].
Mar 28 2022, 6:40 PM · InternetArchiveBot
Harej updated the task description for T270812: converting ext wikilink to cs1|2 template issues [enwiki].
Mar 28 2022, 6:40 PM · Internet-Archive, InternetArchiveBot
Harej closed T277765: IABot goes into infinite loop when encountering a template inside a square link as Resolved.
Mar 28 2022, 6:37 PM · Chinese-Sites, InternetArchiveBot
Harej closed T277378: InternetArchiveBot is adding dead link template to link which already has a dead link template as Resolved.
Mar 28 2022, 6:16 PM · Hungarian-Sites, InternetArchiveBot
Harej added a comment to T277048: Failure to fix improperly-formatted archive URL .

The original URL is not a valid URL so the bot is unable to interpret it.

Mar 28 2022, 6:14 PM · InternetArchiveBot
Harej closed T277048: Failure to fix improperly-formatted archive URL as Declined.
Mar 28 2022, 6:13 PM · InternetArchiveBot
Harej closed T275205: InternetArchiveBot interface not showing all pages submitted as Invalid.
Mar 28 2022, 6:10 PM · InternetArchiveBot
Harej closed T274865: Multiple entries about a non-working link are added. [ltwiki] as Resolved.
Mar 28 2022, 6:09 PM · InternetArchiveBot

Mar 23 2022

Harej triaged T298180: IAbot is archiving archives as Medium priority.
Mar 23 2022, 6:44 PM · InternetArchiveBot
Harej triaged T299438: archive.today #select feature as Medium priority.
Mar 23 2022, 6:35 PM · InternetArchiveBot

Mar 21 2022

Harej closed T274863: InternetArchiveBot job finished without doing anything as Resolved.
Mar 21 2022, 7:01 PM · InternetArchiveBot
Harej triaged T273388: Bot incorrectly changes "wayback" templates to "cite web" despite of them following other cite templates. [huwiki] as Medium priority.
Mar 21 2022, 6:57 PM · Hungarian-Sites, InternetArchiveBot
Harej merged T273977: bot creates broken cs1|2 citation templates into T270812: converting ext wikilink to cs1|2 template issues [enwiki].
Mar 21 2022, 6:54 PM · Internet-Archive, InternetArchiveBot
Harej merged task T273977: bot creates broken cs1|2 citation templates into T270812: converting ext wikilink to cs1|2 template issues [enwiki].
Mar 21 2022, 6:54 PM · InternetArchiveBot
Harej updated the task description for T270812: converting ext wikilink to cs1|2 template issues [enwiki].
Mar 21 2022, 6:53 PM · Internet-Archive, InternetArchiveBot
Harej closed T269525: Bot uses "dead-url" instead of "deadlink" [ruwiki] as Resolved.
Mar 21 2022, 6:48 PM · Russian-Sites, InternetArchiveBot
Harej assigned T265049: bot used |url-access=registration when it should use |chapter-url-access=registration to Green_Cardamom.
Mar 21 2022, 6:45 PM · InternetArchiveBot
Harej closed T263370: Bot adds space before closing brackets in citation template, but not consistently [enwiki] as Declined.
Mar 21 2022, 6:42 PM · InternetArchiveBot
Harej closed T261402: IABot date formatting problem in Cantonese as Resolved.
Mar 21 2022, 6:40 PM · InternetArchiveBot
Harej moved T234546: IAB incorrecly uses older version of archived page when there is a newer version available that should be used from Backlog: URLs to v3.0 on the InternetArchiveBot board.
Mar 21 2022, 6:36 PM · InternetArchiveBot (v3.0)