Page MenuHomePhabricator

nshahquinn-wmf (Neil Shah-Quinn)
senior data scientist, Movement Insights, Wikimedia Foundation

Today

  • No visible events.

Tomorrow

  • No visible events.

Thursday

  • No visible events.

User Details

User Since
Apr 16 2015, 4:17 PM (573 w, 4 d)
Availability
Available
LDAP User
Neil Shah-Quinn (WMF)
MediaWiki User
Neil Shah-Quinn (WMF) [ Global Accounts ]

Recent Activity

Yesterday

nshahquinn-wmf updated the task description for T423067: Conda-Analytics environments are prone to dependency conflicts and installation errors.
Mon, Apr 13, 2:39 AM · Data-Platform-SRE, Data-Engineering
nshahquinn-wmf edited projects for T423052: Conda-Analytics environments stuck with very outdated packages, added: Data-Engineering, Data-Platform-SRE; removed Movement-Insights.
Mon, Apr 13, 2:30 AM · Data-Platform-SRE, Data-Engineering
nshahquinn-wmf renamed T423052: Conda-Analytics environments stuck with very outdated packages from Conda-Analytics environments are brittle and stuck with very outdated packages to Conda-Analytics environments stuck with very outdated packages.
Mon, Apr 13, 2:30 AM · Data-Platform-SRE, Data-Engineering
nshahquinn-wmf updated the task description for T423052: Conda-Analytics environments stuck with very outdated packages.
Mon, Apr 13, 2:29 AM · Data-Platform-SRE, Data-Engineering
nshahquinn-wmf created T423067: Conda-Analytics environments are prone to dependency conflicts and installation errors.
Mon, Apr 13, 2:29 AM · Data-Platform-SRE, Data-Engineering

Sun, Apr 12

nshahquinn-wmf renamed T423052: Conda-Analytics environments stuck with very outdated packages from Conda-Analytics environments are brittle and stuck with outdated packages to Conda-Analytics environments are brittle and stuck with very outdated packages.
Sun, Apr 12, 6:21 PM · Data-Platform-SRE, Data-Engineering
nshahquinn-wmf created T423052: Conda-Analytics environments stuck with very outdated packages.
Sun, Apr 12, 6:20 PM · Data-Platform-SRE, Data-Engineering
nshahquinn-wmf added a project to T422732: Add user_central_id to wmf_contributors.editor_month table: Movement-Metrics.

This is definitely a good idea, but although it's not that much work, it's big enough that we should wait for proper prioritization before doing it.

Sun, Apr 12, 4:57 PM · Movement-Metrics, Movement-Insights

Sat, Apr 11

nshahquinn-wmf closed T324025: Improve docs around JupyterLab and conda-analytics as Resolved.

These suggestions are moot (because anaconda-wmf environments are long gone), and any case I think ended up implementing most of them myself.

Sat, Apr 11, 2:37 AM · Data-Engineering-Icebox, Data-Engineering, Data Pipelines

Mon, Apr 6

nshahquinn-wmf added a comment to T414996: [MI 3] Monitor and investigate movement trends.

Last week, I:

Mon, Apr 6, 9:19 PM · Movement-Insights (FY25-26 H2), Epic

Tue, Mar 31

nshahquinn-wmf added a comment to T412655: Sudden traffic increase on 1 November 2025.

Sharing what I've heard from @GGoncalves-WMF and others working on this: we have devised a new bot detection rule that captures most or all of this spike. In addition to applying it to new data from the start of April, we have decided to retroactively apply it December–March data: T421735 (we can't apply it to November as we no longer have the source data for that period).

Tue, Mar 31, 6:26 AM · Data-Engineering, Data-Engineering-Wikistats, Pageviews-Anomaly

Mon, Mar 23

nshahquinn-wmf added a comment to T420996: Load Google Search Console data into the Data Lake.

While this would be very useful for Movement-Insights, from our perspective it's not top priority (unlike, for example, T418032).

Mon, Mar 23, 8:03 PM · Data-Engineering
nshahquinn-wmf created T420996: Load Google Search Console data into the Data Lake.
Mon, Mar 23, 7:56 PM · Data-Engineering
nshahquinn-wmf updated the task description for T418574: Migrate Content Translation metrics from public Superset (wmcloud) to internal instance.
Mon, Mar 23, 6:46 PM · LPL Analytics
nshahquinn-wmf updated the task description for T414996: [MI 3] Monitor and investigate movement trends.
Mon, Mar 23, 6:43 PM · Movement-Insights (FY25-26 H2), Epic
nshahquinn-wmf added a comment to T414996: [MI 3] Monitor and investigate movement trends.

Last week, I:

  • Drafted the trends brief
  • Improved Google Search monitoring notebook so that it automatically fetches latest data from BigQuery
  • Came up with proposed priorities in preparation for Monday prioritization meeting
  • Started learning about Dbt
Mon, Mar 23, 5:12 AM · Movement-Insights (FY25-26 H2), Epic

Sun, Mar 22

nshahquinn-wmf merged T413560: Invalid language name for nrm in LocalNamesEn.php into T25216: Move the Nourmande Wikipedia from nrm to nrf.
Sun, Mar 22, 11:28 PM · Wiki-Setup (Rename), Wikimedia-Language-setup
nshahquinn-wmf merged task T413560: Invalid language name for nrm in LocalNamesEn.php into T25216: Move the Nourmande Wikipedia from nrm to nrf.
Sun, Mar 22, 11:28 PM · MediaWiki-extensions-CLDR, Language codes
nshahquinn-wmf created T420864: Decide and set appropriate languages for multilingual wikis.
Sun, Mar 22, 11:24 PM · Analytics-Canonical-Data

Mar 14 2026

nshahquinn-wmf triaged T420126: Update canonical wiki dataset generation based on changes in language name data as High priority.

High priority because, once the refactor finishes, we will be blocked from adding new wikis to the dataset because the generation script will fail.

Mar 14 2026, 9:59 PM · Movement-Insights (FY25-26 H2), Analytics-Canonical-Data
nshahquinn-wmf lowered the priority of T336999: Create a structured list of Wikimedia projects' creation and closure dates from Medium to Low.
Mar 14 2026, 9:57 PM · Analytics-Canonical-Data, Product-Analytics, Movement-Insights
nshahquinn-wmf created T420126: Update canonical wiki dataset generation based on changes in language name data.
Mar 14 2026, 9:55 PM · Movement-Insights (FY25-26 H2), Analytics-Canonical-Data

Mar 10 2026

nshahquinn-wmf added a project to T419459: log id missing from mediawiki_private_cu_log?: CheckUser.
Mar 10 2026, 5:20 PM · OKR-Work, Movement-Insights, Product Safety and Integrity, CheckUser

Mar 9 2026

nshahquinn-wmf added a comment to T419459: log id missing from mediawiki_private_cu_log?.

Moving over from the conversation on Slack:

Mar 9 2026, 11:28 PM · OKR-Work, Movement-Insights, Product Safety and Integrity, CheckUser

Mar 8 2026

nshahquinn-wmf added a comment to T414996: [MI 3] Monitor and investigate movement trends.

This week, I:

  • Did the weekly monitoring
  • Explored and discussed infrastructure needs for this work (e.g. incremental mediawiki_history, general availability of Dbt)
  • Worked on getting service account access to exported search console data in Google Cloud (via round after round of getting permissions and then finding that they weren’t correctly applied or were insufficient)
Mar 8 2026, 2:49 AM · Movement-Insights (FY25-26 H2), Epic

Mar 6 2026

nshahquinn-wmf closed T415817: Add CI checking that the data protection information in the canonical country dataset matches the source as Declined.

Folding this into T419304.

Mar 6 2026, 11:20 PM · Analytics-Canonical-Data, Movement-Insights

Feb 28 2026

nshahquinn-wmf updated the task description for T417496: Conda-Analytics pinned file does not constrain Pip installations.
Feb 28 2026, 10:36 PM · Data-Platform-SRE (2026-03-27 - 2026-04-17)

Feb 27 2026

nshahquinn-wmf closed T369202: Wiki Comparison | update, maintain, improve as Declined.

I don't see a reason to have a permanent tracking task for the wiki comparison tool; if there was enough work that we needed to tie it together, a tag would be a better choice. Free free to reopen if you disagree!

Feb 27 2026, 6:32 PM · Movement-Insights (FY25-26 H2), Epic

Feb 23 2026

nshahquinn-wmf added a comment to T414996: [MI 3] Monitor and investigate movement trends.

This week, I:

  • Paired with Maya to do the weekly monitoring
  • Clarified leadership requirements around distribution of the fortnightly trends brief and the continuation of the monthly metrics report
  • Made visual improvements to the Google Search Console monitoring
  • Set up a basic monitoring notebook for daily editor counts
  • Made a plan for fetching Google Search Console data through an API rather than through tedious manual data exports
Feb 23 2026, 7:06 AM · Movement-Insights (FY25-26 H2), Epic

Feb 21 2026

nshahquinn-wmf created T418035: Add useful wiki information from the MediaWiki project-namespace map to the canonical wiki dataset.
Feb 21 2026, 3:44 AM · Analytics-Canonical-Data
nshahquinn-wmf added a comment to T405960: Fetch base wiki data from SiteMatrix or similar.

The script that updates the MediaWiki project-namespace map uses the SiteMatrix API.

Feb 21 2026, 3:27 AM · Analytics-Canonical-Data
nshahquinn-wmf updated the task description for T405960: Fetch base wiki data from SiteMatrix or similar.
Feb 21 2026, 3:25 AM · Analytics-Canonical-Data

Feb 19 2026

nshahquinn-wmf added a comment to T21044: Document LanguageConverter.

@Bewfip the syntax for unidirectional and bidirectional conversion are documented on the advanced syntax sub-page. Is that what you're referring to?

Feb 19 2026, 1:27 AM · Chinese-Sites, Documentation, MediaWiki-Language-converter

Feb 16 2026

nshahquinn-wmf added a comment to T414996: [MI 3] Monitor and investigate movement trends.

Last week, I:

  • Drafted the trends brief
  • Discussed needs and concerns around monthly reporting with Maya and Sam
  • Improved the Google Search Console data workflow and monitoring notebook
  • Documented monitoring notebooks and walked Maya through them
  • Helped Maya get dependencies for the monitoring notebooks installed, dealing with a bunch of stubborn environment issues in the process
Feb 16 2026, 11:06 PM · Movement-Insights (FY25-26 H2), Epic
nshahquinn-wmf updated the task description for T417496: Conda-Analytics pinned file does not constrain Pip installations.
Feb 16 2026, 10:35 PM · Data-Platform-SRE (2026-03-27 - 2026-04-17)

Feb 14 2026

nshahquinn-wmf updated the task description for T417496: Conda-Analytics pinned file does not constrain Pip installations.
Feb 14 2026, 11:21 PM · Data-Platform-SRE (2026-03-27 - 2026-04-17)
nshahquinn-wmf created T417496: Conda-Analytics pinned file does not constrain Pip installations.
Feb 14 2026, 11:14 PM · Data-Platform-SRE (2026-03-27 - 2026-04-17)

Feb 13 2026

nshahquinn-wmf closed T21044: Document LanguageConverter, a subtask of T43716: [EPIC] Support language variant conversion in Parsoid, as Resolved.
Feb 13 2026, 6:11 PM · Community-Wishlist, Page Content Service, Wikipedia-Android-App-Backlog, Wikipedia-iOS-App-Backlog, Parsoid-Read-Views (Language Converter Support), Parsoid-Rendering, affects-Kiwix-and-openZIM, Parsoid, Epic, MediaWiki-Language-converter, Chinese-Sites, I18n
nshahquinn-wmf closed T21044: Document LanguageConverter as Resolved.

Great work, @Diskdance! I think you've completed this 17-year-old task 🎉

Feb 13 2026, 6:10 PM · Chinese-Sites, Documentation, MediaWiki-Language-converter
nshahquinn-wmf added a comment to T416963: GitLab Private Repository Request for: Movement Insights trend monitoring.
Feb 13 2026, 6:02 PM · Essential-Work, Release-Engineering-Team, User-brennen, Movement-Insights, GitLab

Feb 10 2026

nshahquinn-wmf added a comment to T414996: [MI 3] Monitor and investigate movement trends.

Last week, I:

Feb 10 2026, 2:12 AM · Movement-Insights (FY25-26 H2), Epic
nshahquinn-wmf created T416963: GitLab Private Repository Request for: Movement Insights trend monitoring.
Feb 10 2026, 1:50 AM · Essential-Work, Release-Engineering-Team, User-brennen, Movement-Insights, GitLab
nshahquinn-wmf closed T365387: Issues in the dumps → mediawiki wikitext history → content gap metrics pipeline can significantly delay the movement metrics report as Resolved.

mediawiki_wikitext_history doesn't exist anymore. Should we close this?

Feb 10 2026, 12:38 AM · Movement-Insights (FY25-26 H2), Epic, Movement-Metrics

Feb 9 2026

nshahquinn-wmf added a comment to T354733: Create Conda Analytics environment including spark version 3.5.3.

Upgrading to Spark 3.5 should allow us to remove the version specs and pins for:

  • Pandas (T370705, T370707)
  • Numpy (T370710)
  • PyArrow (since I believe that its pin is for the current Pandas version)
Feb 9 2026, 7:17 PM · Data-Platform-SRE

Feb 6 2026

nshahquinn-wmf reopened T391730: Pilot page view forecasts with Fundraising teams, a subtask of T369325: [MI 3] Investigate trends in movement metrics, as Open.
Feb 6 2026, 7:19 PM · Movement-Insights (FY25-26 H1), Epic
nshahquinn-wmf reopened T391730: Pilot page view forecasts with Fundraising teams as "Open".

It seems pretty clear that we're not going to continue this, but I do want to close the loop properly with Fundraising, so I'm going to reopen to track that work.

Feb 6 2026, 7:19 PM · Movement-Insights (FY25-26 H2)

Feb 3 2026

nshahquinn-wmf added a comment to T414996: [MI 3] Monitor and investigate movement trends.

Last week, I:

  • Drafted and shared first “official” trends brief
  • Responded to follow-up questions on brief
  • Discussed plans with Sam NeSmith
Feb 3 2026, 2:38 AM · Movement-Insights (FY25-26 H2), Epic

Feb 2 2026

nshahquinn-wmf added a comment to T21044: Document LanguageConverter.

For everyone's information, I've created https://www.mediawiki.org/wiki/User:Diskdance/Overview_of_Language_Converter as a general overview document of LC.

Feb 2 2026, 11:10 PM · Chinese-Sites, Documentation, MediaWiki-Language-converter

Jan 29 2026

nshahquinn-wmf added a comment to T415817: Add CI checking that the data protection information in the canonical country dataset matches the source.

We discussed this at a team meeting today and decided there might be better ways to approach this, so we'll hold on this until we've had a chance to discuss with the full set of stakeholders in the process.

Jan 29 2026, 12:09 AM · Analytics-Canonical-Data, Movement-Insights
nshahquinn-wmf raised the priority of T415817: Add CI checking that the data protection information in the canonical country dataset matches the source from High to Needs Triage.
Jan 29 2026, 12:01 AM · Analytics-Canonical-Data, Movement-Insights

Jan 28 2026

nshahquinn-wmf created T415817: Add CI checking that the data protection information in the canonical country dataset matches the source.
Jan 28 2026, 6:37 PM · Analytics-Canonical-Data, Movement-Insights

Jan 27 2026

nshahquinn-wmf added a comment to T414996: [MI 3] Monitor and investigate movement trends.

Last week, I:

  • Continued discussions about reporting venue, audience, and frequency
  • Worked through several different Superset issues
  • Dug into new Comscore data and compared it with our unique device data
  • Answered lots of data questions from Suman
Jan 27 2026, 1:05 AM · Movement-Insights (FY25-26 H2), Epic
nshahquinn-wmf moved T414996: [MI 3] Monitor and investigate movement trends from Backlog to Epics on the Movement-Insights (FY25-26 H2) board.
Jan 27 2026, 1:04 AM · Movement-Insights (FY25-26 H2), Epic
nshahquinn-wmf edited projects for T414996: [MI 3] Monitor and investigate movement trends, added: Movement-Insights (FY25-26 H2); removed Movement-Insights.
Jan 27 2026, 1:04 AM · Movement-Insights (FY25-26 H2), Epic
nshahquinn-wmf created Movement-Insights (FY25-26 H2).
Jan 27 2026, 1:03 AM

Jan 19 2026

nshahquinn-wmf added a comment to T414996: [MI 3] Monitor and investigate movement trends.

Last week, I:

  • Worked on the monitoring runbook and dashboard
  • Discussed the trends catalog with Maya
  • Dug back into data on the fundraising issues
  • Produced updated charts of traffic from Google
Jan 19 2026, 11:51 PM · Movement-Insights (FY25-26 H2), Epic
nshahquinn-wmf created T414996: [MI 3] Monitor and investigate movement trends.
Jan 19 2026, 11:23 PM · Movement-Insights (FY25-26 H2), Epic
nshahquinn-wmf closed T369325: [MI 3] Investigate trends in movement metrics as Resolved.
Jan 19 2026, 11:21 PM · Movement-Insights (FY25-26 H1), Epic
nshahquinn-wmf closed T403352: Create a backlog of at least 3 well-defined investigation ideas, a subtask of T369325: [MI 3] Investigate trends in movement metrics, as Declined.
Jan 19 2026, 11:21 PM · Movement-Insights (FY25-26 H1), Epic
nshahquinn-wmf closed T403352: Create a backlog of at least 3 well-defined investigation ideas as Declined.
Jan 19 2026, 11:21 PM · Movement-Insights (FY25-26 H1)

Jan 15 2026

nshahquinn-wmf updated subscribers of T336999: Create a structured list of Wikimedia projects' creation and closure dates.

In the WMF Slack, @Michael just asked about getting wiki creation dates for some newer wikis to understand why they're missing a particular table (T414600).

Jan 15 2026, 1:05 AM · Analytics-Canonical-Data, Product-Analytics, Movement-Insights

Jan 13 2026

nshahquinn-wmf added a comment to T336999: Create a structured list of Wikimedia projects' creation and closure dates.

There's actually a wikiBirthday maintenance script that uses the "timestamp of minimum rev ID" method which I found to be the best option despite it being mostly wrong for UseModWiki-first wikis.

Jan 13 2026, 7:25 PM · Analytics-Canonical-Data, Product-Analytics, Movement-Insights

Jan 10 2026

nshahquinn-wmf closed T411813: November 2025 Wikimedia Movement Metrics as Resolved.
Jan 10 2026, 7:08 PM · Movement-Insights (FY25-26 H1)

Dec 23 2025

nshahquinn-wmf closed T403437: [SDS 1.4.1] Synthesize what we know about trends in our traffic, a subtask of T369325: [MI 3] Investigate trends in movement metrics, as Resolved.
Dec 23 2025, 3:49 PM · Movement-Insights (FY25-26 H1), Epic
nshahquinn-wmf closed T403437: [SDS 1.4.1] Synthesize what we know about trends in our traffic as Resolved.
Dec 23 2025, 3:49 PM · Epic, Movement-Insights (FY25-26 H1)

Dec 16 2025

nshahquinn-wmf updated the task description for T412775: Fundraising access request for Neil Shah-Quinn.
Dec 16 2025, 3:44 AM · fundraising-tech-ops
nshahquinn-wmf created T412775: Fundraising access request for Neil Shah-Quinn.
Dec 16 2025, 3:42 AM · fundraising-tech-ops

Dec 10 2025

nshahquinn-wmf updated the task description for T370710: Upgrade to Numpy ≥ 1.24 in Conda-Analytics.
Dec 10 2025, 6:36 PM · Data-Platform-SRE

Dec 8 2025

nshahquinn-wmf claimed T411813: November 2025 Wikimedia Movement Metrics.
Dec 8 2025, 6:07 PM · Movement-Insights (FY25-26 H1)

Dec 4 2025

nshahquinn-wmf updated the task description for T401692: EPIC: Migrate Data Platform SRE-owned hosts to Bookworm or later.
Dec 4 2025, 12:59 AM · Data-Platform-SRE (2026-03-27 - 2026-04-17), Epic
nshahquinn-wmf added a comment to T378253: Support creating a Spark session with a GitLab-built Conda environment.

@fkaelin has prepared an MR implementing this, so I should review it relatively soon.

Dec 4 2025, 12:50 AM · Movement-Insights (FY25-26 H2), Data-Engineering-Radar, Data-Engineering, Wmfdata-Python
nshahquinn-wmf moved T378253: Support creating a Spark session with a GitLab-built Conda environment from Backlog to Waiting on others on the Movement-Insights (FY25-26 H1) board.
Dec 4 2025, 12:27 AM · Movement-Insights (FY25-26 H2), Data-Engineering-Radar, Data-Engineering, Wmfdata-Python

Dec 2 2025

nshahquinn-wmf closed T402990: Update Wmfdata developer documentation to correctly cover working with UV as Resolved.
Dec 2 2025, 7:31 PM · Data-Engineering, Movement-Insights (FY25-26 H1), Wmfdata-Python
nshahquinn-wmf moved T402990: Update Wmfdata developer documentation to correctly cover working with UV from Backlog to Doing on the Movement-Insights (FY25-26 H1) board.
Dec 2 2025, 1:10 AM · Data-Engineering, Movement-Insights (FY25-26 H1), Wmfdata-Python

Dec 1 2025

nshahquinn-wmf added a comment to T369325: [MI 3] Investigate trends in movement metrics.

Last week, I:

  • Investigated the correlation between referral traffic from Google and from other external referrers
  • Finished slides for the board meeting presentation

Barring any last minute requests for changes to the slides, this work is done and the hypothesis will be closed shortly.

Dec 1 2025, 9:13 PM · Movement-Insights (FY25-26 H1), Epic
nshahquinn-wmf added a comment to T403437: [SDS 1.4.1] Synthesize what we know about trends in our traffic.

Last week, I:

  • Investigated the correlation between referral traffic from Google and from other external referrers
  • Finished slides for the board meeting presentation
Dec 1 2025, 9:05 PM · Epic, Movement-Insights (FY25-26 H1)

Nov 24 2025

nshahquinn-wmf added a comment to T369325: [MI 3] Investigate trends in movement metrics.

Last week, I:

  • Wrapped up small wiki investigation and chose proposed example wikis
  • Started work on slides for the board meeting presentation
Nov 24 2025, 5:53 AM · Movement-Insights (FY25-26 H1), Epic
nshahquinn-wmf added a comment to T403437: [SDS 1.4.1] Synthesize what we know about trends in our traffic.

Last week, I:

  • Wrapped up small wiki investigation and chose proposed example wikis
  • Started work on slides for the board meeting presentation
Nov 24 2025, 5:53 AM · Epic, Movement-Insights (FY25-26 H1)

Nov 22 2025

nshahquinn-wmf updated the task description for T388291: Add Iceberg version of canonical data tables.
Nov 22 2025, 7:56 PM · Analytics-Canonical-Data, Movement-Insights

Nov 19 2025

nshahquinn-wmf added a comment to T221482: Identify imported revisions in mediawiki_history.

One additional point I've thought of: if you look in MediaWiki history and find that a group of revisions have the same exact author, timestamp, and page title, this is (almost?) certainly a group with one original and the remainder imported duplicates of that original.

Nov 19 2025, 2:47 AM · Data-Engineering-Icebox, Data-Engineering, Product-Analytics
nshahquinn-wmf added a comment to T336999: Create a structured list of Wikimedia projects' creation and closure dates.

I just worked on a Wikipedia 25-related request from the WMF Communications department for:

  • The monthly article count for each Wikipedia during its history
  • The creation data for each Wikipedia
  • The first article created at each Wikipedia
Nov 19 2025, 2:34 AM · Analytics-Canonical-Data, Product-Analytics, Movement-Insights
nshahquinn-wmf closed T278467: Use Hive/Spark timestamps in Refined event data as Declined.

Agreed that the Hive tables can stay as they are, and the new Iceberg tables can do proper DATEs and TIMESTAMPs. When inserting into Iceberg, we can cast accordingly. See T335305 for an example conversion from year, month, day INTs to a DATE.

Nov 19 2025, 12:45 AM · Data-Engineering-Icebox, Data-Engineering, User-Iflorez, Product-Analytics
nshahquinn-wmf moved T403437: [SDS 1.4.1] Synthesize what we know about trends in our traffic from Doing to Needs sign-off on the Movement-Insights (FY25-26 H1) board.
Nov 19 2025, 12:04 AM · Epic, Movement-Insights (FY25-26 H1)

Nov 7 2025

nshahquinn-wmf added a comment to T339291: Should temp users be counted as registered & active users on Special:Statistics?.

@Niharika what kind of input are you looking for?

Nov 7 2025, 10:46 PM · Temporary accounts, Product Safety and Integrity, OKR-Work, MediaWiki-User-management, MediaWiki-Special-pages

Nov 4 2025

nshahquinn-wmf added a comment to T406531: NEWFEATURE REQUEST: Add new referral sources to pageview data.

@JAllemandou absolutely, I think the rule improvements are in great shape! By specifying TLDs for each search engine, you have already gone well above and beyond the requirements 😊

Nov 4 2025, 9:27 PM · Data-Engineering (Q2 FY25/26 October 1st - December 31th), Patch-For-Review, Essential-Work, Movement-Insights (FY25-26 H1), Data-Platform

Nov 3 2025

nshahquinn-wmf added a comment to T406531: NEWFEATURE REQUEST: Add new referral sources to pageview data.

I have copied some values where referer has changed grouped by referer-domain and ordered by number of hists desc here (note: this list doesn't include IP-referer that have now been categorized as unknown instead of external).

Nov 3 2025, 6:10 PM · Data-Engineering (Q2 FY25/26 October 1st - December 31th), Patch-For-Review, Essential-Work, Movement-Insights (FY25-26 H1), Data-Platform

Oct 25 2025

nshahquinn-wmf added a comment to T369325: [MI 3] Investigate trends in movement metrics.

Since my last update, I:

  • Produced lots more visuals and analysis
  • I'm on track to have this analysis and visualization work largely completed by the end of the day Tue, 28 Oct (since I'll be on vacation Wed-Fri)
Oct 25 2025, 3:30 AM · Movement-Insights (FY25-26 H1), Epic
nshahquinn-wmf added a comment to T403437: [SDS 1.4.1] Synthesize what we know about trends in our traffic.

Since my last update, I:

  • Produced lots more visuals and analysis
  • I'm on track to have this analysis and visualization work largely completed by the end of the day Tue, 28 Oct (since I'll be on vacation Wed-Fri)
Oct 25 2025, 3:30 AM · Epic, Movement-Insights (FY25-26 H1)

Oct 19 2025

nshahquinn-wmf added a comment to T369325: [MI 3] Investigate trends in movement metrics.

Since my last update, I:

  • Did a ton of analysis and data visualization in preparation for public communications about recent declines in pageviews
  • Dug into trends in referrers
  • Tested and found support for the the hypothesis that iOS traffic declined less than Android traffic (suggesting that pageviews from people with high socio-economic status declined less)
Oct 19 2025, 10:46 PM · Movement-Insights (FY25-26 H1), Epic
nshahquinn-wmf added a comment to T403437: [SDS 1.4.1] Synthesize what we know about trends in our traffic.

Since my last update, I:

  • Did a ton of analysis and data visualization in preparation for public communications about recent declines in pageviews
  • Dug into trends in referrers
  • Tested and found support for the the hypothesis that iOS traffic declined less than Android traffic (suggesting that pageviews from people with high socio-economic status declined less)
Oct 19 2025, 10:46 PM · Epic, Movement-Insights (FY25-26 H1)

Oct 18 2025

nshahquinn-wmf added a comment to T406531: NEWFEATURE REQUEST: Add new referral sources to pageview data.

Random suggestions:

  • Consider including Kagi as a search engine
  • Consider addressing T383088 by dropping the requirement that the referrer start with "http:// or "https://"
Oct 18 2025, 12:33 AM · Data-Engineering (Q2 FY25/26 October 1st - December 31th), Patch-For-Review, Essential-Work, Movement-Insights (FY25-26 H1), Data-Platform

Oct 15 2025

nshahquinn-wmf moved T406417: Revisit the default configuration of Wmfdata Spark sessions from Incoming to FY25-26 H1 on the Movement-Insights board.
Oct 15 2025, 6:43 PM · Data-Engineering, Movement-Insights, Wmfdata-Python

Oct 4 2025

nshahquinn-wmf updated subscribers of T406417: Revisit the default configuration of Wmfdata Spark sessions.
Oct 4 2025, 10:25 PM · Data-Engineering, Movement-Insights, Wmfdata-Python
nshahquinn-wmf updated the task description for T406417: Revisit the default configuration of Wmfdata Spark sessions.
Oct 4 2025, 10:24 PM · Data-Engineering, Movement-Insights, Wmfdata-Python
nshahquinn-wmf created T406417: Revisit the default configuration of Wmfdata Spark sessions.
Oct 4 2025, 10:22 PM · Data-Engineering, Movement-Insights, Wmfdata-Python
nshahquinn-wmf added a comment to T369325: [MI 3] Investigate trends in movement metrics.

This week, I:

  • Analyzed Comscore and unique device data
    • Lots of contradictory signals, plus noise due to the ongoing traffic data backfill, which should finish by Tue, Oct 7.
  • Analyzed Google-reported clickthrough and Google-referred pageview data
Oct 4 2025, 6:47 AM · Movement-Insights (FY25-26 H1), Epic
nshahquinn-wmf renamed T403437: [SDS 1.4.1] Synthesize what we know about trends in our traffic from Synthesize what we know about trends in our traffic to [SDS 1.4.1] Synthesize what we know about trends in our traffic.
Oct 4 2025, 6:46 AM · Epic, Movement-Insights (FY25-26 H1)
nshahquinn-wmf added a comment to T403437: [SDS 1.4.1] Synthesize what we know about trends in our traffic.

This week, I:

  • Analyzed Comscore and unique device data
    • Lots of contradictory signals, plus noise due to the ongoing traffic data backfill, which should finish by Tue, Oct 7.
  • Analyzed Google-reported clickthrough and Google-referred pageview data
Oct 4 2025, 6:46 AM · Epic, Movement-Insights (FY25-26 H1)

Oct 2 2025

nshahquinn-wmf added a comment to T405533: Unique devices data uses non-standard domains for Wikidata, Wikifunctions, and MediaWiki.org.

I'm very sorry for the misunderstanding here. I made a vocabulary mistake with the French false-friend "demander". I was really not meaning that you were forcing for the change, I'll be careful when using the word "demand" in future.

Oct 2 2025, 8:27 PM · Data-Engineering (Q2 FY25/26 October 1st - December 31th), Analytics-Data-Problem
nshahquinn-wmf added a comment to T402324: Mitigate consequences of Gobblin hiccups generating late events and alerts.

speaking for our needs, I think that's totally fine! :)

Oct 2 2025, 6:17 PM · Data-Engineering (Q2 FY25/26 October 1st - December 31th)