In T194961#4221017, @LGoto wrote:Hi @mpopov Is this for you?
- Queries
- All Stories
- Search
- Advanced Search
- Transactions
- Transaction Logs
Feed Advanced Search
Advanced Search
Advanced Search
May 21 2018
May 21 2018
mpopov closed T194287: Requesting access to stat1006 for Go Fish Digital, a subtask of T193052: Provide web crawler data logs to Go Fish Digital, as Declined.
May 18 2018
May 18 2018
@EBjune @RobH: BTW @JKatzWMF and I are going to be changing which properties are tracked in GSC (namely getting rid of all but 1-2 HTTP variants in favor of HTTPS) as well as which project classes and languages are tracked. We expect to be done with this "sprint cleaning" endeavor sometime in the next two weeks so I'll post an update here when it's ready.
May 11 2018
May 11 2018
mpopov updated the task description for T184838: Requesting access to stat1004, stat1005, stat1006 for mneisler.
mpopov moved T190092: Selected languages stats on Wikipedia Android app from Next Up to Doing on the Product-Analytics board.
mpopov moved T190092: Selected languages stats on Wikipedia Android app from Backlog to Done on the Discovery-Analysis (Current work) board.
- Part 2
May 10 2018
May 10 2018
mpopov closed T173049: Investigate mobile map gadget for eventlogging, a subtask of T173283: Extra 100KB of JS loaded by mobilemaps gadget on pageview, as Declined.
Declining for now but open to re-opening in the future if the parent task is re-opened and this work is needed.
mpopov closed T173049: Investigate mobile map gadget for eventlogging, a subtask of T174538: Relaunch Geohack mobile maps gadget experiment on English Wikipedia, as Declined.
@debt: closing for now but if you think this is work that needs to be done feel free to re-open
mpopov closed T183025: Analysis: query-frequency-stratified A/B test, a subtask of T182824: [epic] Show query-frequency-stratified results in A/B test results, as Declined.
@debt: closing for now but if you think this is work that needs to be done feel free to re-open
mpopov closed T183026: Analysis: most popular abandoned queries, a subtask of T176997: Extract a set of a few hundred most popular abandoned queries, as Declined.
mpopov moved T184087: Gather basic stats on iOS and Android app sessions from Triage to Backlog on the Product-Analytics board.
mpopov moved T194287: Requesting access to stat1006 for Go Fish Digital from Triage to Tracking on the Product-Analytics board.
mpopov moved T184091: Multi-lingual use of Android app from Backlog to Next Up on the Product-Analytics board.
mpopov moved T184098: [EPIC] Analytics baseline for Android app from Triage to Backlog on the Product-Analytics board.
mpopov moved T184088: Understand the usage of individual Android app features from Triage to Backlog on the Product-Analytics board.
mpopov moved T184089: Understand Android app usage by market from Triage to Backlog on the Product-Analytics board.
mpopov moved T184091: Multi-lingual use of Android app from Triage to Backlog on the Product-Analytics board.
mpopov moved T184092: Usage of colour modes in Android app from Triage to Backlog on the Product-Analytics board.
mpopov added a project to T194287: Requesting access to stat1006 for Go Fish Digital: SRE-Access-Requests.
May 9 2018
May 9 2018
Pinging @RStallman-legalteam & @JbuattiWMF to confirm that Go Fish Digital have signed the NDAs so that Ops can proceed with adding their public SSH key to the list of allowed keys.
May 4 2018
May 4 2018
mpopov added a comment to T193694: Pull map stats to create a baseline BEFORE rapid growth of usage on Wikipedias.
In T193694#4180609, @jmatazzoni wrote:Thanks @mpopov! As you note, not all current mapframe wikipedias are accounted for in your first stats page (missing are Arabic, Bulgarian, Czech, Spanish, Kannada, Latvian, Portuguese, English).
But here is the bigger issue: we are about to release mapframe to 277 more Wikipedias—essentially all wikipedias except nine flagged revision wikis. We need to be able to track usage on these as well. What do you suggest? How should they be added in?
Will your stats page be able to scale up to measure hundreds more? Is the general "Wikipedia" figure already accounting for all wikipedias programmatically, or does it just add up the 11 you list on the page? What about the spreadsheet: what will happen if we start loading hundreds of wikis? Should we pick some representative wikis we want to measure?
@Catrope: I updated the repository with instructions: https://github.com/wikimedia-research/Discovery-Interactive-Adhoc-Usage#re-run-instructions
mpopov moved T193810: Document how to rerun map stats from Next Up to Doing on the Product-Analytics board.
mpopov moved T193810: Document how to rerun map stats from Backlog to Next Up on the Product-Analytics board.
mpopov moved T193810: Document how to rerun map stats from Triage to Backlog on the Product-Analytics board.
May 3 2018
May 3 2018
mpopov added a comment to T193694: Pull map stats to create a baseline BEFORE rapid growth of usage on Wikipedias.
Sooooo…most of this has actually already been done. We have per-wiki daily stats beginning on 2017-09-14 over at:
Restricted Application edited projects for T151929: Implement third phase of map event logging, added: Product-Analytics; removed MW-1.29-release (WMF-deploy-2016-12-13_(1.29.0-wmf.6)).
Apr 27 2018
Apr 27 2018
@Deskana: Progress update: I have 4 days of data (~12GB gzipped) and right now I have a script that's verifying ~20K IP addresses to determine which ones are legit and which ones spoofed the UA and pretended to be one of those crawlers. As you might expect, that part is taking some time.
mpopov added a parent task for T192819: Event Logging schemas for Wikipedia iOS app: T184097: iOS data wishlist items.
mpopov added a subtask for T184097: iOS data wishlist items: T192819: Event Logging schemas for Wikipedia iOS app.
mpopov removed a parent task for T184097: iOS data wishlist items: T192819: Event Logging schemas for Wikipedia iOS app.
mpopov removed a subtask for T192819: Event Logging schemas for Wikipedia iOS app: T184097: iOS data wishlist items.
mpopov edited parent tasks for T184097: iOS data wishlist items, added: T192819: Event Logging schemas for Wikipedia iOS app; removed: T184098: [EPIC] Analytics baseline for Android app.
mpopov removed a subtask for T184098: [EPIC] Analytics baseline for Android app: T184097: iOS data wishlist items.
mpopov added a subtask for T192819: Event Logging schemas for Wikipedia iOS app: T184097: iOS data wishlist items.
Apr 26 2018
Apr 26 2018
In T184092#3998152, @mpopov wrote:@Charlotte: since the user can switch between modes multiple times in any time period, are we interested in (1) % of users who have tried out the two modes† or (2) at a particular snapshot in time, what's the breakdown of people using each theme?
@Charlotte: did you mean to make this iOS ticket part of the Android baseline analytics epic? Also, iOS team is long way away from having data that can be presented; should we replace the parent task with T192819?
Apr 25 2018
Apr 25 2018
In T193052#4159509, @chelsyx wrote:FAQs of Baiduspider (in English, include UA): http://help.baidu.com/question?prod_id=99&class=0&id=3001
How to identify Baiduspider (in Chinese, let me know if you can't understand it with google translate): https://ziyuan.baidu.com/college/articleinfo?id=1002
@chelsyx Can you please help me? I’ve been able to find documentation (UserAgent strings and instructions for verifying) on Google’s, Bing’s, and Yandex’s crawlers but the best I’ve been able to find for Baidu (in English) is this blog post by a third party from 2011: https://chineseseoshifu.com/blog/new-baidu-user-agent-baiduspider.html so I suspect any official documentation about what Baiduspider’s UA looks like these days (or how to verify) would be on Baidu's website and in Chinese.
@Deskana: Googlebot Mobile does not seem to be a thing anymore according to its lack of presence on https://support.google.com/webmasters/answer/1061943
In T191859#4156112, @Tbayer wrote:Method 1 has the disadvantage that we would be able to find out username given crossDeviceID, which is not the case for Method 2.
How is that not the case for Method 2?
Apr 24 2018
Apr 24 2018
In T191859#4155885, @Nuria wrote:@mpopov FYI that adding things to X-Analytics does not work automagically, we strongly recommend team to track interactions using events.
Want to clarify With @chelsyx and @Fjalapeno that events do not have to be send from client side, the server can also send them. EL has a mediawiki server side client too.
@APalmer_WMF @Fjalapeno @Jhernandez: can y'all please take a look at the updated description and let us know if you have any questions or concerns.
mpopov renamed T191859: [EPIC] Reading List Sync service analytics from Enable Reading List Syncing usage stats to [EPIC] Reading List Sync service analytics.
mpopov moved T191859: [EPIC] Reading List Sync service analytics from Next Up to Tracking on the Product-Analytics board.
mpopov moved T190601: Update Audiences page and Key Product Metrics with April 2018 Readers data from Epics to Next Up on the Product-Analytics board.
mpopov closed T190600: Update Audiences page and Key Product Metrics with March 2018 Readers data as Resolved.
Done
mpopov updated the task description for T190600: Update Audiences page and Key Product Metrics with March 2018 Readers data.
In T191859#4152718, @Tgr wrote:All these new proposals sound a bit overcomplicated. Why not just use X-Analytics? There is already a purge mechanism for raw webrequest data, right?
Apr 23 2018
Apr 23 2018
mpopov moved T190600: Update Audiences page and Key Product Metrics with March 2018 Readers data from Triage to Next Up on the Product-Analytics board.
mpopov moved T186044: [EPIC] Modernize Search Platform dashboarding from Triage to Doing on the Product-Analytics board.
mpopov moved T172009: Add referer to WebrequestData from Triage to Tracking on the Product-Analytics board.
0.7.0 on CRAN as of 2018-03-21. Re-installed the package on prod & beta
mpopov moved T168967: Upload shiny-server .deb to our Stretch apt repository from Triage to Tracking on the Product-Analytics board.
mpopov moved T139487: Get 'sparklyr' working on stats1005 from Triage to Tracking on the Product-Analytics board.
Majority of this is done. Deb can ping Chelsy or me if she has any more questions or issues accessing the data.
mpopov moved T170995: Setup a mirror for R language dependencies (CRAN) from Triage to Tracking on the Product-Analytics board.
mpopov moved T174110: Private data access for non-person user that calculates metrics from Triage to Tracking on the Product-Analytics board.
mpopov moved T182352: UDF for language detection from Triage to Backlog on the Product-Analytics board.
mpopov moved T172581: [EPIC] Set up mechanism for archiving Google Search Console data from Triage to Doing on the Product-Analytics board.
mpopov moved T174512: [Blog Post] Applying epidemiology techniques to browser tabs from Triage to Backlog on the Product-Analytics board.
mpopov moved T190092: Selected languages stats on Wikipedia Android app from Doing to Next Up on the Product-Analytics board.
mpopov moved T184094: What are the most productive referrers/channels for Android? from In progress to Backlog on the Discovery-Analysis (Current work) board.
mpopov moved T184095: Understand Android app monthly active users and daily active users from Triage to Next Up on the Product-Analytics board.
mpopov closed T175048: Search Relevance Survey test #3: analysis of test, a subtask of T174106: Search Relevance Survey test #3: action items, as Resolved.
mpopov closed T177358: Metrics for SDoC: translations, a subtask of T174519: [epic] SDoC: Determine baseline for metrics, as Resolved.
mpopov closed T179528: Investigate full-text searches in event logging vs SRP pageviews as Resolved.
mpopov closed T179528: Investigate full-text searches in event logging vs SRP pageviews, a subtask of T178958: Metrics check-in for Q1 2017/18, as Resolved.
mpopov closed T185365: File contributions by bots vs users on Wikimedia Commons (Redux) as Resolved.
mpopov closed T185365: File contributions by bots vs users on Wikimedia Commons (Redux), a subtask of T185363: [EPIC] Learn about our databases and tools, as Resolved.
mpopov closed T186575: File type and deletion metrics on Wikimedia Commons (Redux), a subtask of T185363: [EPIC] Learn about our databases and tools, as Resolved.
Thanks, everyone!
mpopov closed T186682: Bug in user sampling for MobileWikiAppSessions, a subtask of T184089: Understand Android app usage by market, as Resolved.
mpopov moved T186828: Productionize per-country daily & monthly active app user stats from Triage to Backlog on the Product-Analytics board.
Good job, @MNeisler!
mpopov closed T187827: Search metrics on Wikimedia Commons (Redux), a subtask of T185363: [EPIC] Learn about our databases and tools, as Resolved.
mpopov moved T188421: Investigate the full-text desktop search CTR decline on Wikimedia Commons from Triage to Doing on the Product-Analytics board.
mpopov moved T189054: Learn and document data and analysis about Wikipedia iOS app from Triage to Doing on the Product-Analytics board.
mpopov moved T190092: Selected languages stats on Wikipedia Android app from Triage to Doing on the Product-Analytics board.
Content licensed under Creative Commons Attribution-ShareAlike (CC BY-SA) 4.0 unless otherwise noted; code licensed under GNU General Public License (GPL) 2.0 or later and other open source licenses. By using this site, you agree to the Terms of Use, Privacy Policy, and Code of Conduct. · Wikimedia Foundation · Privacy Policy · Code of Conduct · Terms of Use · Disclaimer · CC-BY-SA · GPL