Page MenuHomePhabricator

Harej (James Hare)
Developer

Today

  • No visible events.

Tomorrow

  • No visible events.

Wednesday

  • No visible events.

User Details

User Since
Oct 16 2014, 12:59 AM (581 w, 4 d)
Availability
Available
IRC Nick
hare
LDAP User
Harej
MediaWiki User
Harej [ Global Accounts ]

Software developer for hire

Recent Activity

Nov 2 2025

Harej created T409006: Update dump mirror rsync allowlist to reflect new IP address for Scatter.
Nov 2 2025, 5:36 PM · Datasets-General-or-Unknown

Jun 17 2025

Harej added a comment to T355341: mwcli installation warning: image's platform does not match the detected host platform .

While I got warnings, you are actually right that they are just warnings and it still works.

Jun 17 2025, 8:47 PM · a Wikimedia CLI

Jun 7 2025

Harej created T396289: Add option to change container port.
Jun 7 2025, 11:49 PM · a Wikimedia CLI
Harej created T396288: Add support for arm64 architecture.
Jun 7 2025, 11:24 PM · a Wikimedia CLI

May 10 2025

Harej closed T393245: [Unconference] Demo: Citation Watchlist user script as Resolved.
May 10 2025, 3:16 PM · User-Harej, Citation-Watchlist, Wikimedia-Hackathon-2025
Harej closed T393238: [Unconference] The Wikipedia Citations Database! as Resolved.
May 10 2025, 3:16 PM · User-Harej, Wikimedia-Hackathon-2025
Harej added a comment to T155846: Reference recommender system.

The Librarybase project is ongoing at https://librarybase.org. There is also a prototype database of citations appearing on Wikipedia at https://wikipediacitations.scatter.red. Otherwise I am not familiar with any progress on this kind of work, though it's something I am still interested in.

May 10 2025, 3:15 PM · WikiCite, artificial-intelligence

May 4 2025

Harej added a comment to T393238: [Unconference] The Wikipedia Citations Database!.

Demo is done; this task can be closed at any time.

May 4 2025, 10:42 AM · User-Harej, Wikimedia-Hackathon-2025
Harej added a comment to T393245: [Unconference] Demo: Citation Watchlist user script.

The demo has been done! This task can be closed any time.

May 4 2025, 10:42 AM · User-Harej, Citation-Watchlist, Wikimedia-Hackathon-2025

May 3 2025

Harej created T393245: [Unconference] Demo: Citation Watchlist user script.
May 3 2025, 9:42 AM · User-Harej, Citation-Watchlist, Wikimedia-Hackathon-2025
Harej created T393238: [Unconference] The Wikipedia Citations Database!.
May 3 2025, 8:30 AM · User-Harej, Wikimedia-Hackathon-2025

Apr 24 2025

Harej moved T363400: Incorporate questionable academic journals list into Citation Watchlist from Inbox to Next on the Citation-Watchlist board.
Apr 24 2025, 8:29 PM · Citation-Watchlist
Harej moved T363403: Incorporate JzG predatory journals list into Citation Watchlist from Inbox to Next on the Citation-Watchlist board.
Apr 24 2025, 8:29 PM · Citation-Watchlist

Apr 22 2025

Harej moved T363751: Concept of domain aliases in Citation Watchlist from Inbox to Current on the Citation-Watchlist board.
Apr 22 2025, 5:14 PM · Citation-Watchlist
Harej closed T388164: Add data caching for Citation Watchlist as Resolved.
Apr 22 2025, 5:12 PM · Citation-Watchlist

Apr 21 2025

Harej added a comment to T388164: Add data caching for Citation Watchlist.

I also added a function to purge expired cache at the beginning of the script call.

Apr 21 2025, 9:21 PM · Citation-Watchlist
Harej added a comment to T388164: Add data caching for Citation Watchlist.

Added caching for:

  • the Public Suffix List
  • Individual lists of domains
  • Added URLs for a given combination of old revision ID and new revision ID
    • Whether a given URL matches is not cached, since that can change as the contents of the domain list changes. But whether a URL appeared in a given revision should not change (unless there was a bug in the URL extraction mechanism).
Apr 21 2025, 9:13 PM · Citation-Watchlist

Apr 15 2025

Harej moved T363388: Deploy Citation Watchlist as (global) gadget from Inbox to Future on the Citation-Watchlist board.
Apr 15 2025, 10:45 PM · Citation-Watchlist
Harej created T392002: Optional activate-on-button-press for Citation Watchlist.
Apr 15 2025, 5:32 PM · Citation-Watchlist
Harej moved T388164: Add data caching for Citation Watchlist from Inbox to Current on the Citation-Watchlist board.
Apr 15 2025, 5:23 PM · Citation-Watchlist
Harej closed T388161: Develop testing plan for Citation Watchlist as Resolved.
Apr 15 2025, 5:20 PM · User-Harej, Citation-Watchlist
Harej closed T388162: Propose Citation Watchlist as gadget on English Wikipedia as Resolved.
Apr 15 2025, 4:57 PM · Citation-Watchlist
Harej added a comment to T388162: Propose Citation Watchlist as gadget on English Wikipedia.

Discussion outcome: https://en.wikipedia.org/wiki/Wikipedia:Village_pump_(technical)/Archive_219#h-Gadget_proposal:_Citation_Watchlist-20250317185000

Apr 15 2025, 4:57 PM · Citation-Watchlist

Mar 12 2025

Harej created T388714: Make hovering over emoji work on mobile devices.
Mar 12 2025, 8:13 PM · Citation-Watchlist

Mar 7 2025

Harej moved T388161: Develop testing plan for Citation Watchlist from Inbox to Current on the Citation-Watchlist board.
Mar 7 2025, 8:17 PM · User-Harej, Citation-Watchlist

Mar 6 2025

Harej created T388168: Explore AI model for detecting citations in Citation Watchlist.
Mar 6 2025, 8:56 PM · Citation-Watchlist
Harej created T388167: Citation Watchlist interface feature for personal domain list.
Mar 6 2025, 8:46 PM · Citation-Watchlist
Harej created T388166: Citation Watchlist should detect the addition of sources other than URL-based ones.
Mar 6 2025, 8:38 PM · Citation-Watchlist
Harej created T388164: Add data caching for Citation Watchlist.
Mar 6 2025, 8:35 PM · Citation-Watchlist
Harej created T388163: Add performance benchmarking to Citation Watchlist.
Mar 6 2025, 8:28 PM · Citation-Watchlist
Harej created T388162: Propose Citation Watchlist as gadget on English Wikipedia.
Mar 6 2025, 8:24 PM · Citation-Watchlist
Harej created T388161: Develop testing plan for Citation Watchlist.
Mar 6 2025, 8:19 PM · User-Harej, Citation-Watchlist

Feb 10 2025

Harej closed T363404: Incorporate self publishing list into Citation Watchlist as Resolved.
Feb 10 2025, 6:11 PM · Citation-Watchlist
Harej added a comment to T363404: Incorporate self publishing list into Citation Watchlist.

https://en.wikipedia.org/wiki/Wikipedia:Citation_Watchlist/Lists/Self-Publishing

Feb 10 2025, 6:11 PM · Citation-Watchlist

Dec 3 2024

Harej added a comment to T336937: Attempt to create OAuth2 consumer ends in HTTP 500.

My input and the resulting error

Dec 3 2024, 8:07 PM · Wikibase Cloud (Kanban Board Q2 2025), affects-Miraheze

Nov 14 2024

Harej reopened T336937: Attempt to create OAuth2 consumer ends in HTTP 500 as "Open".
Nov 14 2024, 7:22 PM · Wikibase Cloud (Kanban Board Q2 2025), affects-Miraheze
Harej added a comment to T336937: Attempt to create OAuth2 consumer ends in HTTP 500.

I now get this error:

[6aab8836cf02dc6764ce8a29] 2024-11-14 19:19:54: Fatal exception of type "Lcobucci\JWT\Signer\Key\FileCouldNotBeRead"
Nov 14 2024, 7:22 PM · Wikibase Cloud (Kanban Board Q2 2025), affects-Miraheze

Oct 25 2024

Harej removed a project from T319887: Bring back `mdwiki` tool from the demise of GridEngine (transition to Toolforge Kubernetes): User-Harej.
Oct 25 2024, 8:58 PM · Tools, Wikimedia-Medicine
Harej removed a project from T355725: Populate language links on MDWiki based on Wikidata item: User-Harej.
Oct 25 2024, 8:57 PM · MediaWiki-extensions-UnlinkedWikibase, Wikimedia-Medicine

Oct 24 2024

Harej closed T363921: Async watchlist entry processing as Resolved.
Oct 24 2024, 9:03 PM · Citation-Watchlist
Harej closed T364918: Adapt Citation Watchlist to work on Page Histories as well as Resolved.
Oct 24 2024, 9:03 PM · Citation-Watchlist
Harej closed T366396: Replace home-grown differ with REST API call as Resolved.
Oct 24 2024, 9:03 PM · Citation-Watchlist
Harej closed T363750: Incorporate list of Wikipedia subdomains as a list within Citation Watchlist as Resolved.
Oct 24 2024, 9:03 PM · Citation-Watchlist
Harej closed T363745: Incorporate "Deprecated sources" into Citation Watchlist as Resolved.
Oct 24 2024, 9:03 PM · Citation-Watchlist
Harej closed T364336: Citation Watchlist not triggering on www.thesun.co.uk as Resolved.
Oct 24 2024, 9:03 PM · Citation-Watchlist
Harej closed T364333: Citation Watchlist not triggering match on www.dailymail.co.uk as Resolved.
Oct 24 2024, 9:02 PM · Citation-Watchlist
Harej closed T363875: Add release log to WP:WATCHCITE as Resolved.
Oct 24 2024, 9:02 PM · Citation-Watchlist
Harej closed T364926: Incorporate Template:Predatory open access source list into Citation Watchlist as Resolved.
Oct 24 2024, 9:02 PM · Citation-Watchlist
Harej closed T364937: Fill out Toolhub tool description as Resolved.
Oct 24 2024, 9:01 PM · Citation-Watchlist
Harej moved T363379: Feature: Turn source lists on/off from Inbox to Future on the Citation-Watchlist board.
Oct 24 2024, 9:00 PM · Citation-Watchlist
Harej moved T363381: Enhanced tooltip when hovering over indicators from Inbox to Future on the Citation-Watchlist board.
Oct 24 2024, 9:00 PM · Citation-Watchlist
Harej moved T363382: Citation Watchlist indicator should include provenance from Inbox to Future on the Citation-Watchlist board.
Oct 24 2024, 9:00 PM · Citation-Watchlist
Harej moved T363388: Deploy Citation Watchlist as (global) gadget from Inbox to Future on the Citation-Watchlist board.
Oct 24 2024, 8:59 PM · Citation-Watchlist
Harej moved T363393: Server-side processing of diffs, list screening from Inbox to Future on the Citation-Watchlist board.
Oct 24 2024, 8:59 PM · Citation-Watchlist
Harej moved T363746: Differentiated classification within the same domain from Inbox to Future on the Citation-Watchlist board.
Oct 24 2024, 8:59 PM · Citation-Watchlist
Harej moved T363751: Concept of domain aliases in Citation Watchlist from Inbox to Future on the Citation-Watchlist board.
Oct 24 2024, 8:59 PM · Citation-Watchlist
Harej moved T363871: Classify specific paths within a domain as unreliable from Inbox to Future on the Citation-Watchlist board.
Oct 24 2024, 8:58 PM · Citation-Watchlist
Harej moved T365657: Feature: Promote "inspect" indicators to "caution" or "warn" for individual lists, as a user preference from Inbox to Future on the Citation-Watchlist board.
Oct 24 2024, 8:58 PM · Citation-Watchlist
Harej added a comment to T363380: Resilience against watchlist javascript.

As part of solving this, we can store revision comparison data in browser cache. Given a combination of revisions, the URLs added are stored, and then screened against whatever copy of the list is loaded.

Oct 24 2024, 8:57 PM · Citation-Watchlist
Harej renamed T363388: Deploy Citation Watchlist as (global) gadget from Deploy Citation Watchlist as global gadget to Deploy Citation Watchlist as (global) gadget.
Oct 24 2024, 8:55 PM · Citation-Watchlist
Harej closed T363413: Domain on caution list triggers in some circumstances but not others as Resolved.
Oct 24 2024, 8:54 PM · Citation-Watchlist
Harej closed T363869: "Walking on Broken Glass" revision does not trigger en.wikipedia.org filter as Resolved.
Oct 24 2024, 8:50 PM · Citation-Watchlist
Harej closed T363873: Rosabel Watson false positive as Resolved.
Oct 24 2024, 8:47 PM · Citation-Watchlist
Harej closed T363874: List of Pixar films false positive as Resolved.
Oct 24 2024, 8:45 PM · Citation-Watchlist
Harej closed T364335: Citation Watchlist not triggering on addition of www.crunchbase.com as Resolved.
Oct 24 2024, 8:43 PM · Citation-Watchlist
Harej closed T364337: Citation Watchlist not triggering in Tables as Resolved.
Oct 24 2024, 8:43 PM · Citation-Watchlist
Harej closed T364914: Citation Watchlist does not pick up addition of flagged URL when URL is changed in place as Resolved.
Oct 24 2024, 8:41 PM · Citation-Watchlist
Harej closed T364916: Citation Watchlist not triggering on very large diffs as Invalid.
Oct 24 2024, 8:37 PM · Citation-Watchlist
Harej added a comment to T365186: Miscellaneous Edge Case Diffs for Triage.

Still an issue as of 1.11

Oct 24 2024, 8:35 PM · Citation-Watchlist
Harej closed T372215: Citation watchlist has intermittent false positives on history tabs as Resolved.
Oct 24 2024, 8:32 PM · Citation-Watchlist
Harej added a comment to T372215: Citation watchlist has intermittent false positives on history tabs.

This should be resolved with the post-Wikimania rewrite.

Oct 24 2024, 8:32 PM · Citation-Watchlist
Harej closed T365652: Double-indication: "Warn and inspect" for dailymail.co.uk as Resolved.
Oct 24 2024, 8:30 PM · Citation-Watchlist
Harej added a comment to T377655: Request creation of wikiqlever VPS project.

I have two servers with 384 GB of RAM each, plus a workstation with 256 GB of RAM that can be used for testing. (I have tested QLever successfully on it before.) One of the servers has a bit over 2 TB free; the other has over 7 TB free. (Capacity can be added with funding.) I think either or both could be used to run QLever. What I would want to know is:

Oct 24 2024, 4:49 AM · Data-Platform-SRE (2024.11.09 - 2024.11.29), Wikidata, Wikidata-Query-Service, Cloud-VPS (Project-requests)

Oct 11 2024

Harej closed T371729: Overview of outstanding tasks for Citation Watchlist as Resolved.
Oct 11 2024, 4:16 PM · Citation-Watchlist
Harej placed T317580: Re-deploy InternetArchiveBot to Portuguese Wikipedia [ptwiki] up for grabs.
Oct 11 2024, 3:35 PM · InternetArchiveBot, User-Harej
Harej placed T317246: Re-deploy InternetArchiveBot to Japanese Wikipedia [jawiki] up for grabs.
Oct 11 2024, 3:35 PM · IABot-Priority-2022, InternetArchiveBot, User-Harej
Harej placed T317262: Deploy InternetArchiveBot to Asturian Wikipedia [astwiki] up for grabs.
Oct 11 2024, 3:35 PM · User-Harej, InternetArchiveBot
Harej placed T317261: Deploy InternetArchiveBot to Alemannisch Wikipedia [alswiki] up for grabs.
Oct 11 2024, 3:35 PM · User-Harej, InternetArchiveBot
Harej placed T319887: Bring back `mdwiki` tool from the demise of GridEngine (transition to Toolforge Kubernetes) up for grabs.
Oct 11 2024, 3:32 PM · Tools, Wikimedia-Medicine
Harej added a comment to T363239: Two-diff recent changes entry has warning for nypost.com even though that URL does not appear in article.

No longer occurs in latest version of Citation Watchlist.

Oct 11 2024, 3:30 PM · User-Harej, Citation-Watchlist
Harej closed T363239: Two-diff recent changes entry has warning for nypost.com even though that URL does not appear in article as Resolved.
Oct 11 2024, 3:30 PM · User-Harej, Citation-Watchlist
Harej added a comment to T369392: Batch requests together.

Alternative strategy to *not* do this actually gets better results.

Oct 11 2024, 3:27 PM · User-Harej, Citation-Watchlist
Harej closed T369392: Batch requests together as Declined.
Oct 11 2024, 3:27 PM · User-Harej, Citation-Watchlist

Sep 19 2024

Harej added a comment to T372920: API endpoint for URLs added within a diff.

What I have learned, much to my delight, is that this API endpoint exists... for current revisions.

Sep 19 2024, 9:02 PM · MediaWiki-REST-API, MediaWiki-Action-API

Aug 20 2024

Harej created T372920: API endpoint for URLs added within a diff.
Aug 20 2024, 5:07 PM · MediaWiki-REST-API, MediaWiki-Action-API

Aug 13 2024

Harej added a comment to T50940: Punctuation like ".", "?" and "!" at the end of page title in links not interpreted as part of the URL by various applications.

This task and T40265 should be merged since they seem to describe the same underlying problem, that if a character is treated as punctuation, it is not treated as part of the URL.

Aug 13 2024, 5:58 PM · Patch-For-Review, MediaWiki-Email

Aug 3 2024

Harej added a comment to T371561: bug: cannot save python implementation with regex.

Page seems to save fine after that, though now I can't test my function because doing so requires multiline input (that's a different problem).

Aug 3 2024, 8:51 PM · Abstract Wikipedia team (25Q1 (Jul–Sep))

Aug 2 2024

Harej created T371729: Overview of outstanding tasks for Citation Watchlist.
Aug 2 2024, 8:45 PM · Citation-Watchlist

Aug 1 2024

Harej added a comment to T371561: bug: cannot save python implementation with regex.

You can also reproduce it with this code:

import re
Aug 1 2024, 4:25 AM · Abstract Wikipedia team (25Q1 (Jul–Sep))

Jul 24 2024

Harej added a comment to T369392: Batch requests together.

Also: doing the PSL and the List of Lists in one combined request

Jul 24 2024, 3:40 AM · User-Harej, Citation-Watchlist

Jul 7 2024

Harej added a comment to T369392: Batch requests together.

The last frontier here, I think, is batching requests for first revision ID, which is done in the event that a diff link is not available but a hist link is.

Jul 7 2024, 3:25 AM · User-Harej, Citation-Watchlist

Jul 5 2024

Harej claimed T369392: Batch requests together.
Jul 5 2024, 7:55 PM · User-Harej, Citation-Watchlist
Harej created T369392: Batch requests together.
Jul 5 2024, 7:42 PM · User-Harej, Citation-Watchlist
Harej moved T363921: Async watchlist entry processing from Current to Review on the Citation-Watchlist board.
Jul 5 2024, 7:39 PM · Citation-Watchlist
Harej moved T366396: Replace home-grown differ with REST API call from Current to Review on the Citation-Watchlist board.
Jul 5 2024, 7:18 PM · Citation-Watchlist
Harej moved T364918: Adapt Citation Watchlist to work on Page Histories as well from Current to Review on the Citation-Watchlist board.
Jul 5 2024, 7:18 PM · Citation-Watchlist
Harej moved T363921: Async watchlist entry processing from Pending Deployment to Current on the Citation-Watchlist board.
Jul 5 2024, 7:18 PM · Citation-Watchlist

Jun 25 2024

Harej added a comment to T368232: cloud-vps Trove instance 'wikicitations' shows host 'none'.

I'm not sure the database was ever in active use. In any case, I would want to rebuild it anyway, so I don't think there's any harm in deleting this particular one.

Jun 25 2024, 3:43 PM · cloud-services-team, Cloud-VPS, WikiCite

Jun 20 2024

Harej added a comment to T366815: Collaboration hub pages display JSON instead of the collaboration hub.

But yes, this probably should be archived, as the fact that nobody noticed it was broken for years is telling.

Jun 20 2024, 5:22 PM · MediaWiki-extensions-CollaborationKit

Jun 16 2024

Harej created T367664: Cbignore ignored on tl wikipedia.
Jun 16 2024, 5:37 PM · InternetArchiveBot