Page MenuHomePhabricator

TWL citation data for Wall Street Journal and other Dow Jones Publications
Closed, ResolvedPublic5 Estimated Story Points

Description

I need the list of citation/reference links added by TWL users to the following URLs from the following collections.

https://wikipedialibrary.wmflabs.org/partners/158/

  • wsj.com
  • investors.com
  • marketwatch.com
  • barrons.com

https://wikipedialibrary.wmflabs.org/partners/159/

  • jp.wsj.com

https://wikipedialibrary.wmflabs.org/partners/160/

  • cn.wsj.com

https://wikipedialibrary.wmflabs.org/partners/161/

  • cn.wsj.com

Please share this data privately.

Event Timeline

jsn.sherman changed the task status from Open to Stalled.Sep 17 2025, 3:28 PM

Stalled on T404879

Kgraessle changed the task status from Stalled to Open.Oct 3 2025, 4:47 PM
Kgraessle subscribed.

@sjvipin we're unstalled now, but it's been a minute since we talked about this. Could you clarify exactly what is being asked for here (eg. a list of: link event records, links to edits where the citations were added, urls to cited articles)?

@jsn.sherman - We need the Link URL, page link and the Wikimedia project from 1st May 2025.

Kgraessle changed the task status from Open to Stalled.Oct 6 2025, 6:11 PM

@sjvipin
The data in wikilink should now be correct for The Wall Street Journal per T404381: Wikilink The Wall Street Journal collection not showing correct data.

The link event data from 1st May 2025 that you're requesting is no longer trivial to retrieve as we're archiving link events daily.

@Samwalton9-WMF
Let me know if we want to prioritize this request over dashboard work as this will not be a quick win.

Kgraessle changed the point value for this task from 2 to 5.
Dillon changed the task status from Stalled to In Progress.Oct 30 2025, 5:09 PM

@sjvipin Please let us know if the data provided by the new archive tool met your needs!

@jsn.sherman - No, I haven't been able to export the data. The tool seems not responsive for some reason.

I'll adjust the throttling mechanism; it sounds like I tightened things up too much.

jsn.sherman lowered the priority of this task from High to Medium.Nov 20 2025, 7:21 PM

Setting to medium priority since the data export was previously delivered to you out of band; this task is now reflecting work on the tool.

This ask has been completed, @sjvipin @jsn.sherman it's not clear to me what additional work is required here but could you create any relevant tickets?