User Details
- User Since
- Apr 16 2015, 4:17 PM (556 w, 4 d)
- Availability
- Available
- LDAP User
- Neil Shah-Quinn (WMF)
- MediaWiki User
- Neil Shah-Quinn (WMF) [ Global Accounts ]
Wed, Dec 10
Mon, Dec 8
Thu, Dec 4
Tue, Dec 2
Mon, Dec 1
Last week, I:
- Investigated the correlation between referral traffic from Google and from other external referrers
- Finished slides for the board meeting presentation
Mon, Nov 24
Last week, I:
- Wrapped up small wiki investigation and chose proposed example wikis
- Started work on slides for the board meeting presentation
Sat, Nov 22
Wed, Nov 19
One additional point I've thought of: if you look in MediaWiki history and find that a group of revisions have the same exact author, timestamp, and page title, this is (almost?) certainly a group with one original and the remainder imported duplicates of that original.
I just worked on a Wikipedia 25-related request from the WMF Communications department for:
- The monthly article count for each Wikipedia during its history
- The creation data for each Wikipedia
- The first article created at each Wikipedia
Nov 7 2025
@Niharika what kind of input are you looking for?
Nov 4 2025
@JAllemandou absolutely, I think the rule improvements are in great shape! By specifying TLDs for each search engine, you have already gone well above and beyond the requirements 😊
Nov 3 2025
Oct 25 2025
Since my last update, I:
- Produced lots more visuals and analysis
- I'm on track to have this analysis and visualization work largely completed by the end of the day Tue, 28 Oct (since I'll be on vacation Wed-Fri)
Oct 19 2025
Since my last update, I:
- Did a ton of analysis and data visualization in preparation for public communications about recent declines in pageviews
- Dug into trends in referrers
- Tested and found support for the the hypothesis that iOS traffic declined less than Android traffic (suggesting that pageviews from people with high socio-economic status declined less)
Oct 18 2025
Random suggestions:
Oct 15 2025
Oct 4 2025
This week, I:
- Analyzed Comscore and unique device data
- Lots of contradictory signals, plus noise due to the ongoing traffic data backfill, which should finish by Tue, Oct 7.
- Analyzed Google-reported clickthrough and Google-referred pageview data
Oct 2 2025
Sep 29 2025
Since my last update, I:
- Came up with a shortlist of instrumentation data streams I can use
- Started analysis template for Google Search Console data
- Evaluated a sample of SimilarWeb traffic data shared by Nino in Comms
- Got access to Google Search Console data in BigQuery and figured out it doesn’t go back far enough to be useful
- Picked 8 wikis to focus on and got access to them all in Google Search Console
@JAllemandou I know you've already merged the task, but FWIW, I just took a look and it makes sense to me! Thank you very much for smoothing out this little bump 😁
Sep 26 2025
FYI, assuming that the May backfill finished at the end of yesterday and using @mforns's estimate that a month takes about 3.5 days to backfill, I calculate that the hourly backfill will probably finish by the end of Monday, October 6.
Sep 24 2025
@JAllemandou I filed T405533 to continue the conversation.
@JAllemandou there's actually a bug in the new data: the domains names for Wikidata, Wikifunctions, and MediaWiki.org are missing the the leading "www." (e.g. the data has "wikidata.org" rather than "www.wikidata.org"). The versions with the "www." are canonical: they are what appear in the address bar, in the SiteMatrix, in canonical data, and so on.
Sep 23 2025
Sep 18 2025
I was actually coincidentally working on search console documentation, so I've gone ahead and made the necessary updates. The internal documentation is now at a new "search consoles" Office-Wiki page, and I've updated the Wikitech page to point there.
Based on the age of the task, I'm going to guess this is no longer an issue. If it still is and ITS still recommends an escalation to SRE, this should be reopened and tagged with SRE.
@NBaca-WMF I was actually coincidentally working on search console documentation, so I've gone ahead and made the necessary updates. The internal documentation is now at a new "search consoles" Office-Wiki page, and I've updated the Wikitech page to point there.
Sep 15 2025
Last week, I:
- Continued pursuing access to SimilarWeb data and Google Search Console data in BigQuery
- Started analyzing Comscore data
- Dug through instrumentation data streams in search of one that can proxy for mobile web pageviews
- Got a private GitLab repo to store the analysis (T404533)
