User Details
- User Since
- Apr 16 2015, 4:17 PM (573 w, 4 d)
- Availability
- Available
- LDAP User
- Neil Shah-Quinn (WMF)
- MediaWiki User
- Neil Shah-Quinn (WMF) [ Global Accounts ]
Yesterday
Sun, Apr 12
This is definitely a good idea, but although it's not that much work, it's big enough that we should wait for proper prioritization before doing it.
Sat, Apr 11
These suggestions are moot (because anaconda-wmf environments are long gone), and any case I think ended up implementing most of them myself.
Mon, Apr 6
Last week, I:
- Prepared a second draft of the metrics funnel: conceptual diagram, detailed inventory
- Monthly metrics report
- Prepared and released the February report
- Developed and implemented a new publication workflow that prevents permissions errors and sync conflicts
- Completely overhauled and updated the instructions
Tue, Mar 31
Sharing what I've heard from @GGoncalves-WMF and others working on this: we have devised a new bot detection rule that captures most or all of this spike. In addition to applying it to new data from the start of April, we have decided to retroactively apply it December–March data: T421735 (we can't apply it to November as we no longer have the source data for that period).
Mon, Mar 23
While this would be very useful for Movement-Insights, from our perspective it's not top priority (unlike, for example, T418032).
Last week, I:
- Drafted the trends brief
- Improved Google Search monitoring notebook so that it automatically fetches latest data from BigQuery
- Came up with proposed priorities in preparation for Monday prioritization meeting
- Started learning about Dbt
Sun, Mar 22
Mar 14 2026
High priority because, once the refactor finishes, we will be blocked from adding new wikis to the dataset because the generation script will fail.
Mar 10 2026
Mar 9 2026
Moving over from the conversation on Slack:
Mar 8 2026
This week, I:
- Did the weekly monitoring
- Explored and discussed infrastructure needs for this work (e.g. incremental mediawiki_history, general availability of Dbt)
- Worked on getting service account access to exported search console data in Google Cloud (via round after round of getting permissions and then finding that they weren’t correctly applied or were insufficient)
Mar 6 2026
Folding this into T419304.
Feb 28 2026
Feb 27 2026
I don't see a reason to have a permanent tracking task for the wiki comparison tool; if there was enough work that we needed to tie it together, a tag would be a better choice. Free free to reopen if you disagree!
Feb 23 2026
This week, I:
- Paired with Maya to do the weekly monitoring
- Clarified leadership requirements around distribution of the fortnightly trends brief and the continuation of the monthly metrics report
- Made visual improvements to the Google Search Console monitoring
- Set up a basic monitoring notebook for daily editor counts
- Made a plan for fetching Google Search Console data through an API rather than through tedious manual data exports
Feb 21 2026
The script that updates the MediaWiki project-namespace map uses the SiteMatrix API.
Feb 19 2026
@Bewfip the syntax for unidirectional and bidirectional conversion are documented on the advanced syntax sub-page. Is that what you're referring to?
Feb 16 2026
Last week, I:
- Drafted the trends brief
- Discussed needs and concerns around monthly reporting with Maya and Sam
- Improved the Google Search Console data workflow and monitoring notebook
- Documented monitoring notebooks and walked Maya through them
- Helped Maya get dependencies for the monitoring notebooks installed, dealing with a bunch of stubborn environment issues in the process
Feb 14 2026
Feb 13 2026
Great work, @Diskdance! I think you've completed this 17-year-old task 🎉
Feb 10 2026
Last week, I:
- Updated and refined the traffic statistics to be shared with Google to illustrate the Big English fundraising problems
- Did weekly monitoring and noted points for the next brief
- Ran monthly metrics calculations and generated charts
- Backfilled a few emerging wiki contributor metrics that weren’t done when Wikidata and Commons were removed from the category
Feb 9 2026
Upgrading to Spark 3.5 should allow us to remove the version specs and pins for:
Feb 6 2026
It seems pretty clear that we're not going to continue this, but I do want to close the loop properly with Fundraising, so I'm going to reopen to track that work.
Feb 3 2026
Last week, I:
- Drafted and shared first “official” trends brief
- Responded to follow-up questions on brief
- Discussed plans with Sam NeSmith
Feb 2 2026
Jan 29 2026
We discussed this at a team meeting today and decided there might be better ways to approach this, so we'll hold on this until we've had a chance to discuss with the full set of stakeholders in the process.
Jan 28 2026
Jan 27 2026
Last week, I:
- Continued discussions about reporting venue, audience, and frequency
- Worked through several different Superset issues
- Dug into new Comscore data and compared it with our unique device data
- Answered lots of data questions from Suman
Jan 19 2026
Last week, I:
- Worked on the monitoring runbook and dashboard
- Discussed the trends catalog with Maya
- Dug back into data on the fundraising issues
- Produced updated charts of traffic from Google
Jan 15 2026
Jan 13 2026
There's actually a wikiBirthday maintenance script that uses the "timestamp of minimum rev ID" method which I found to be the best option despite it being mostly wrong for UseModWiki-first wikis.
Jan 10 2026
Dec 23 2025
Dec 16 2025
Dec 10 2025
Dec 8 2025
Dec 4 2025
Dec 2 2025
Dec 1 2025
Last week, I:
- Investigated the correlation between referral traffic from Google and from other external referrers
- Finished slides for the board meeting presentation
Nov 24 2025
Last week, I:
- Wrapped up small wiki investigation and chose proposed example wikis
- Started work on slides for the board meeting presentation
Nov 22 2025
Nov 19 2025
One additional point I've thought of: if you look in MediaWiki history and find that a group of revisions have the same exact author, timestamp, and page title, this is (almost?) certainly a group with one original and the remainder imported duplicates of that original.
I just worked on a Wikipedia 25-related request from the WMF Communications department for:
- The monthly article count for each Wikipedia during its history
- The creation data for each Wikipedia
- The first article created at each Wikipedia
Nov 7 2025
@Niharika what kind of input are you looking for?
Nov 4 2025
@JAllemandou absolutely, I think the rule improvements are in great shape! By specifying TLDs for each search engine, you have already gone well above and beyond the requirements 😊
Nov 3 2025
Oct 25 2025
Since my last update, I:
- Produced lots more visuals and analysis
- I'm on track to have this analysis and visualization work largely completed by the end of the day Tue, 28 Oct (since I'll be on vacation Wed-Fri)
Oct 19 2025
Since my last update, I:
- Did a ton of analysis and data visualization in preparation for public communications about recent declines in pageviews
- Dug into trends in referrers
- Tested and found support for the the hypothesis that iOS traffic declined less than Android traffic (suggesting that pageviews from people with high socio-economic status declined less)
Oct 18 2025
Random suggestions:
Oct 15 2025
Oct 4 2025
This week, I:
- Analyzed Comscore and unique device data
- Lots of contradictory signals, plus noise due to the ongoing traffic data backfill, which should finish by Tue, Oct 7.
- Analyzed Google-reported clickthrough and Google-referred pageview data