Page MenuHomePhabricator
Feed Advanced Search

Mar 7 2018

mpopov added a comment to T186682: Bug in user sampling for MobileWikiAppSessions.

@Dbrant: which release of the app will have the fix?

Mar 7 2018, 11:53 PM · Product-Analytics, Patch-For-Review, Wikipedia-Android-App-Backlog, Discovery-Analysis
mpopov added a comment to T187239: Go over analytics events and make sure they're doing what we think they're doing.

Great work! Are we going to document findings about each schema on that schema's documentation page (or associated talk page) too?

Mar 7 2018, 10:12 PM · Product-Analytics, Discovery-Analysis (Current work), Wikipedia-Android-App-Backlog
mpopov added a comment to T187239: Go over analytics events and make sure they're doing what we think they're doing.
/* Funnel.SAMPLE_LOG_ALL sampling by default, right? Also channel info is missing :( Phab task incoming... */
public InstallReferrerFunnel(WikipediaApp app) {
    super(app, SCHEMA_NAME, REV_ID);
}

Correct; all of these events are sent for all users. (Will reply in separate task regarding channel info.)

Mar 7 2018, 9:15 PM · Product-Analytics, Discovery-Analysis (Current work), Wikipedia-Android-App-Backlog

Mar 6 2018

mpopov moved T189035: Change how font size switching is logged from Needs triage to Tracking on the Discovery-Analysis board.
Mar 6 2018, 7:13 PM · Patch-For-Review, Discovery-Analysis, Wikipedia-Android-App-Backlog (Android-app-release-v2.7.24x-H-Hot-pepper)
mpopov updated the task description for T189035: Change how font size switching is logged.
Mar 6 2018, 7:13 PM · Patch-For-Review, Discovery-Analysis, Wikipedia-Android-App-Backlog (Android-app-release-v2.7.24x-H-Hot-pepper)
mpopov triaged T189035: Change how font size switching is logged as Lowest priority.
Mar 6 2018, 5:46 PM · Patch-For-Review, Discovery-Analysis, Wikipedia-Android-App-Backlog (Android-app-release-v2.7.24x-H-Hot-pepper)
mpopov renamed T186768: Extend all EL schemas to include client-side timestamp from Add client timestamp to MobileWikiAppSessions to Extent all EL schemas to include client-side timestamp.
Mar 6 2018, 5:30 PM · Patch-For-Review, Wikipedia-Android-App-Backlog (Android-app-release-v2.7.24x-H-Hot-pepper)
mpopov added a comment to T186768: Extend all EL schemas to include client-side timestamp.

Update: we decided to change this task to be about extending all schemas to include a client timestamp.

Mar 6 2018, 5:19 PM · Patch-For-Review, Wikipedia-Android-App-Backlog (Android-app-release-v2.7.24x-H-Hot-pepper)
mpopov added a comment to T185964: Update Audiences page and Key Product Metrics with February 2018 Readers data.

Forgot to comment here but Android monthly metrics for Feb 2018 have been updated as well.

Mar 6 2018, 5:03 PM · Reading-analysis
mpopov updated the task description for T187239: Go over analytics events and make sure they're doing what we think they're doing.
Mar 6 2018, 2:24 AM · Product-Analytics, Discovery-Analysis (Current work), Wikipedia-Android-App-Backlog
mpopov added a comment to T187239: Go over analytics events and make sure they're doing what we think they're doing.

@Charlotte: I'm going to share the spreadsheet with you and Dmitry. Once you review, can you please comment on this ticket so we know whether to move it into our Done column? ta!

Mar 6 2018, 2:20 AM · Product-Analytics, Discovery-Analysis (Current work), Wikipedia-Android-App-Backlog
mpopov added a comment to T186575: File type and deletion metrics on Wikimedia Commons (Redux).

Looks good! I don't think CI stuff is that important here, so shall we move it into Done?

Mar 6 2018, 2:17 AM · Product-Analytics, Discovery-Analysis (Current work)
mpopov changed the status of T170494: [EPIC] Reconfigure Discovery-Stats on Analytics Cluster from Open to Stalled.

Waiting for systems users with private data access to become available.

Mar 6 2018, 2:15 AM · Patch-For-Review, Product-Analytics, Discovery-Analysis (Current work), Discovery-ARCHIVED
mpopov changed the status of T170494: [EPIC] Reconfigure Discovery-Stats on Analytics Cluster, a subtask of T170471: Move statistics::discovery jobs from stat1002 -> stat1005, from Open to Stalled.
Mar 6 2018, 2:15 AM · Analytics-Kanban, Patch-For-Review, Analytics-Clusters
mpopov changed the status of T171985: SERP tagging from Open to Stalled.
Mar 6 2018, 2:14 AM · Product-Analytics, Discovery-Analysis, Discovery-ARCHIVED
mpopov added a comment to T183024: Analysis of hewiki's A/B test (> 1% of search traffic with a new model).

Shall we move this into the "Done" column as the report looks to have been reviewed?

Mar 6 2018, 2:13 AM · Discovery-Analysis (Current work)
mpopov added a comment to T184093: Usage of feed customisation in Android app.

I think I need to redo some/most/all of this once the sampling bug (T186682) fix is deployed and we've collected enough data from users of the new version.

Mar 6 2018, 2:12 AM · Product-Analytics, Discovery-Analysis (Current work)
mpopov moved T184641: Metrics for Android quarterly update from Needs review to Done on the Discovery-Analysis (Current work) board.
Mar 6 2018, 2:10 AM · Product-Analytics, Discovery-Analysis (Current work)
mpopov moved T187239: Go over analytics events and make sure they're doing what we think they're doing from Backlog to Needs review on the Discovery-Analysis (Current work) board.
Mar 6 2018, 2:09 AM · Product-Analytics, Discovery-Analysis (Current work), Wikipedia-Android-App-Backlog
mpopov updated the task description for T187239: Go over analytics events and make sure they're doing what we think they're doing.
Mar 6 2018, 2:09 AM · Product-Analytics, Discovery-Analysis (Current work), Wikipedia-Android-App-Backlog
mpopov claimed T187239: Go over analytics events and make sure they're doing what we think they're doing.
Mar 6 2018, 2:08 AM · Product-Analytics, Discovery-Analysis (Current work), Wikipedia-Android-App-Backlog
mpopov added a comment to T187239: Go over analytics events and make sure they're doing what we think they're doing.

To clarify, the sampling logic in the app works like this:

  • When the app is installed, a unique appInstallId is generated and saved, which is a random UUID. (this will identify this "user" across different schemas)
  • To decide whether a certain funnel's events are sent or not, we take the last digits of the appInstallId, and if those digits equal zero, modulo the sample rate (e.g. SAMPLE_LOG_100), then the events from that funnel will be sent. Otherwise the funnel will be silent.

That's basically it; there's no other random selection at work. This has the following implications:

  • If the appInstallId = 0 mod 100, then all funnels with SAMPLE_LOG_100 will be enabled, as well as all funnels with SAMPLE_LOG_10.
  • If the appInstallId = 0 mod 10, then all funnels with SAMPLE_LOG_10 will be enabled, but not necessarily funnels with SAMPLE_LOG_100.
  • Funnels with SAMPLE_LOG_ALL are always enabled, since appInstallId mod 1 is always 0.
Mar 6 2018, 2:04 AM · Product-Analytics, Discovery-Analysis (Current work), Wikipedia-Android-App-Backlog

Mar 3 2018

mpopov updated subscribers of T188453: Google Search Console access for Search Platform team.
Mar 3 2018, 1:15 AM · Search-Console-access-request, Discovery-Search, SRE

Mar 2 2018

mpopov awarded T184264: Generate/delete app install ID with opt-in/opt-out actions instead of install/uninstall a Like token.
Mar 2 2018, 8:33 PM · Wikipedia-Android-App-Backlog (Android-app-release-v2.7.23x-G-Grape), Patch-For-Review
mpopov closed T187104: Pageviews/Stats on dataviz-literacy.wmflabs.org as Resolved.

And there's data! Thank you!

Mar 2 2018, 6:55 PM · Analytics-Kanban, Reading-analysis, Discovery-Analysis

Mar 1 2018

mpopov triaged T188683: Highlighted text sometimes includes adjacent extras like punctuation as Lowest priority.
Mar 1 2018, 11:52 PM · Wikipedia-Android-App-Backlog
mpopov added a comment to T187104: Pageviews/Stats on dataviz-literacy.wmflabs.org.

Hi @mpopov!

Is there a special login I'll need to use?

I enter piwik.wikimedia.org with my Wikitech credentials.

Mar 1 2018, 5:29 PM · Analytics-Kanban, Reading-analysis, Discovery-Analysis
mpopov added a comment to T187104: Pageviews/Stats on dataviz-literacy.wmflabs.org.

You just have to add tracking code to your site

Mar 1 2018, 12:28 AM · Analytics-Kanban, Reading-analysis, Discovery-Analysis

Feb 28 2018

mpopov closed T188557: Malformed wiki field in mobile app event logs as Resolved.

Yes indeed, there were some old versions of the app (from around Dec 2016) that were erroneously populating the wiki field with the app version, but is this actually happening with any recent versions of the app?

Feb 28 2018, 10:56 PM · Discovery-Analysis, Wikipedia-Android-App-Backlog, Analytics, Analytics-Data-Quality
mpopov triaged T188557: Malformed wiki field in mobile app event logs as Unbreak Now! priority.
Feb 28 2018, 10:22 PM · Discovery-Analysis, Wikipedia-Android-App-Backlog, Analytics, Analytics-Data-Quality

Feb 27 2018

mpopov created T188453: Google Search Console access for Search Platform team.
Feb 27 2018, 10:02 PM · Search-Console-access-request, Discovery-Search, SRE

Feb 26 2018

mpopov added a comment to T172410: Replace the current multisource analytics-store setup.

I never do cross-wiki joins, so no objections from me either. BUT we do have Maps prevalence metrics calculated on a per-wiki basis so we will need to know which wikis will be on which hosts.

Feb 26 2018, 7:03 PM · Analytics-Radar, Product-Analytics, WMDE-Analytics-Engineering, User-Addshore, User-Elukey, Research

Feb 24 2018

mpopov added a comment to T180825: Investigate increase in pageviews with Android app v190.

@Tbayer: I'm trying to figure out the -100 in ROUND(100*SUM(IF(month = 3 AND day <= 28, view_count, null)) / SUM(IF(month = 2, view_count, null)) -100,1) and no success 😕

Feb 24 2018, 12:51 AM · Product-Analytics, Reading-analysis, Android-app-Bugs, Wikipedia-Android-App-Backlog

Feb 23 2018

mpopov added a comment to T184092: Usage of colour modes in Android app.

@Charlotte: since the user can switch between modes multiple times in any time period, are we interested in (1) % of users who have tried out the two modes† or (2) at a particular snapshot in time, what's the breakdown of people using each theme?

Feb 23 2018, 11:26 PM · Product-Analytics, Reading-analysis
mpopov created T188146: Android app acquisition channel is not logged in MobileWikiAppInstallReferrer.
Feb 23 2018, 9:57 PM · Wikipedia-Android-App-Backlog (Android-app-release-v2.7.24x-H-Hot-pepper), Patch-For-Review, Android-app-Bugs
mpopov updated subscribers of T187239: Go over analytics events and make sure they're doing what we think they're doing.

@Dbrant @Sharvaniharan @cooltey: I'm trying to figure out the different sampling configurations as part of this audit. Can one of you please review my guesses? Thanks!

Feb 23 2018, 9:25 PM · Product-Analytics, Discovery-Analysis (Current work), Wikipedia-Android-App-Backlog
mpopov claimed T184095: Understand Android app monthly active users and daily active users.
Feb 23 2018, 7:13 PM · Product-Analytics

Feb 22 2018

mpopov updated the task description for T187239: Go over analytics events and make sure they're doing what we think they're doing.
Feb 22 2018, 10:42 PM · Product-Analytics, Discovery-Analysis (Current work), Wikipedia-Android-App-Backlog
mpopov updated subscribers of T180190: Make event logging more efficient.

Make battery usage more efficient

Feb 22 2018, 10:40 PM · Wikipedia-Android-App-Backlog
mpopov added a comment to T180193: Create summary event for event logging.

@Sharvaniharan: hi! What's the intention behind this and what data did you have in mind?

Feb 22 2018, 10:29 PM · Wikipedia-Android-App-Backlog

Feb 21 2018

mpopov updated subscribers of T184095: Understand Android app monthly active users and daily active users.
Feb 21 2018, 2:11 AM · Product-Analytics

Feb 20 2018

mpopov added a comment to T184095: Understand Android app monthly active users and daily active users.

Some ideas of improvement:

  • Use parquet file format instead of default hive format
  • Store data daily instead of hourly (~10Mb per hour in hive format, meaning ~250Mb per day, plus parquet compaction --> Should be good)
  • Add request_count instead of keeping distincts
  • Prevent errors in happending data using OVERWRITE
  • Enforce number of files hive output (default is way too many, therefore small, therefore inefficient)

Updated version of the code below:

Feb 20 2018, 8:29 PM · Product-Analytics
mpopov updated subscribers of T184095: Understand Android app monthly active users and daily active users.

@Nuria @JAllemandou: I ran a query over the weekend because I need a processed subset of webrequests from Dec & Jan to work with. Everything was fine and last one I remember working right was 2017-12-05, but I just checked in and at some point I just kept getting:

Feb 20 2018, 6:33 PM · Product-Analytics

Feb 16 2018

mpopov claimed T184094: What are the most productive referrers/channels for Android?.
Feb 16 2018, 5:52 PM · Product-Analytics, Reading-analysis
mpopov added a comment to T184095: Understand Android app monthly active users and daily active users.

Note to future self: will need to join MobileWikiAppReadingLists events with wmf.webrequest data in a way that makes it easier to do T184094 also.

Feb 16 2018, 5:20 PM · Product-Analytics
mpopov added a comment to T184094: What are the most productive referrers/channels for Android?.

What sources/referrers are bringing people to the app?

Feb 16 2018, 12:13 AM · Product-Analytics, Reading-analysis

Feb 15 2018

mpopov added a comment to T186575: File type and deletion metrics on Wikimedia Commons (Redux).

@MNeisler good job! I like how you wrote your findings. Some initial feedback about the visualizations:

Feb 15 2018, 1:07 AM · Product-Analytics, Discovery-Analysis (Current work)

Feb 14 2018

mpopov moved T184089: Understand Android app usage by market from In progress to Stalled/Waiting on the Discovery-Analysis (Current work) board.

Done with DAU/MAU/stickiness part: https://github.com/wikimedia-research/App-Android-Baseline_Metrics/tree/master/T184089#app-stickiness

Feb 14 2018, 9:08 PM · Reading-analysis, Product-Analytics

Feb 13 2018

mpopov updated subscribers of T187104: Pageviews/Stats on dataviz-literacy.wmflabs.org.

@Nuria: my blog post is ready to be published but I'm holding off until I add the piwik stuff :) if there's a chance you can help with this sometime this week that would be awesome and appreciated!

Feb 13 2018, 8:11 PM · Analytics-Kanban, Reading-analysis, Discovery-Analysis

Feb 12 2018

mpopov added subtasks for T184089: Understand Android app usage by market: T186682: Bug in user sampling for MobileWikiAppSessions, T186768: Extend all EL schemas to include client-side timestamp.
Feb 12 2018, 7:47 PM · Reading-analysis, Product-Analytics
mpopov added a parent task for T186682: Bug in user sampling for MobileWikiAppSessions: T184089: Understand Android app usage by market.
Feb 12 2018, 7:47 PM · Product-Analytics, Patch-For-Review, Wikipedia-Android-App-Backlog, Discovery-Analysis
mpopov moved T186828: Productionize per-country daily & monthly active app user stats from Needs triage to Tracking on the Discovery-Analysis board.
Feb 12 2018, 7:45 PM · Patch-For-Review, Product-Analytics, Analytics, Discovery-Analysis, Reading-analysis
mpopov added a comment to T186682: Bug in user sampling for MobileWikiAppSessions.

Going forward let's please make a practice for developers to do basic vetting of metrics. Example: notice that in this case to see the sampling oddities it was enough to add the uniques in both sources (a simple addition, no stats). Let's not fire and forget metrics but rather follow through a bit to make sure things add up.

Feb 12 2018, 7:31 PM · Product-Analytics, Patch-For-Review, Wikipedia-Android-App-Backlog, Discovery-Analysis
mpopov created T187104: Pageviews/Stats on dataviz-literacy.wmflabs.org.
Feb 12 2018, 6:44 PM · Analytics-Kanban, Reading-analysis, Discovery-Analysis
mpopov awarded T186682: Bug in user sampling for MobileWikiAppSessions a Evil Spooky Haunted Tree token.
Feb 12 2018, 5:51 PM · Product-Analytics, Patch-For-Review, Wikipedia-Android-App-Backlog, Discovery-Analysis
mpopov awarded T186180: Move non-critical monthly jobs to the nice queue a Like token.
Feb 12 2018, 5:47 PM · Analytics-Kanban, Patch-For-Review, Analytics-Clusters

Feb 9 2018

chelsyx awarded T186828: Productionize per-country daily & monthly active app user stats a Like token.
Feb 9 2018, 10:05 PM · Patch-For-Review, Product-Analytics, Analytics, Discovery-Analysis, Reading-analysis

Feb 8 2018

mpopov updated the task description for T186828: Productionize per-country daily & monthly active app user stats.
Feb 8 2018, 9:12 PM · Patch-For-Review, Product-Analytics, Analytics, Discovery-Analysis, Reading-analysis
mpopov created T186828: Productionize per-country daily & monthly active app user stats.
Feb 8 2018, 9:00 PM · Patch-For-Review, Product-Analytics, Analytics, Discovery-Analysis, Reading-analysis
mpopov added a comment to T184027: Figure out sample size calculation for AB test under Bayesian framework.

For future self: https://github.com/slc-rug/2018-02-power-simulation/blob/master/PowerSimulation-handout.pdf & https://www.youtube.com/watch?v=Oj7dGg2sraQ

Feb 8 2018, 8:25 PM · Product-Analytics
mpopov awarded T180651: Calculate Android app daily active users from Nigeria a 100 token.
Feb 8 2018, 12:38 AM · New-Readers, Reading-analysis
mpopov triaged T186768: Extend all EL schemas to include client-side timestamp as High priority.
Feb 8 2018, 12:04 AM · Patch-For-Review, Wikipedia-Android-App-Backlog (Android-app-release-v2.7.24x-H-Hot-pepper)

Feb 2 2018

mpopov added a comment to T184089: Understand Android app usage by market.

@Tbayer Done! Thank you so much for the suggestions! And yep, the benchmarks are calculated from popular free Books & Reference apps, not the whole Play Store like I previously thought.

Feb 2 2018, 11:45 PM · Reading-analysis, Product-Analytics
mpopov updated subscribers of T184089: Understand Android app usage by market.

@Charlotte & @JKatzWMF: okay, here's my progress so far: https://github.com/wikimedia-research/App-Android-Baseline_Metrics/tree/master/T184089 Am I on the right track or nah?

Feb 2 2018, 12:46 AM · Reading-analysis, Product-Analytics

Jan 30 2018

mpopov moved T183984: [WMF All Hands 2018] Dataviz Literacy Workshop from Needs review to Done on the Discovery-Analysis (Current work) board.
Jan 30 2018, 6:54 PM · Product-Analytics, Patch-For-Review, Discovery-ARCHIVED, Discovery-Analysis (Current work)
mpopov added a comment to T185131: Ubuntu Trusty VMs in discovery-stats for the All Hands workshop.

This is ok, but keep in mind that Trusty is deprecated throughout WMF infrastructure (including on WMCS VMs). Don't get too attached to these VMs -- I'd advise moving all of your systems to Stretch as soon as you're able.

Jan 30 2018, 5:51 PM · Product-Analytics, Patch-For-Review, cloud-services-team (Kanban), Cloud-VPS (Quota-requests), Reading-analysis, Discovery-Analysis

Jan 29 2018

mpopov closed T185131: Ubuntu Trusty VMs in discovery-stats for the All Hands workshop as Resolved.

Changed the name back because I want to keep these up as a learning tool for anyone who missed the workshop and future WMF employees.

Jan 29 2018, 1:25 AM · Product-Analytics, Patch-For-Review, cloud-services-team (Kanban), Cloud-VPS (Quota-requests), Reading-analysis, Discovery-Analysis
mpopov closed T185131: Ubuntu Trusty VMs in discovery-stats for the All Hands workshop, a subtask of T183984: [WMF All Hands 2018] Dataviz Literacy Workshop, as Resolved.
Jan 29 2018, 1:25 AM · Product-Analytics, Patch-For-Review, Discovery-ARCHIVED, Discovery-Analysis (Current work)
mpopov renamed T185131: Ubuntu Trusty VMs in discovery-stats for the All Hands workshop from Remove Ubuntu Trusty VMs in discovery-stats after the All Hands workshop to Ubuntu Trusty VMs in discovery-stats for the All Hands workshop.
Jan 29 2018, 1:24 AM · Product-Analytics, Patch-For-Review, cloud-services-team (Kanban), Cloud-VPS (Quota-requests), Reading-analysis, Discovery-Analysis

Jan 22 2018

mpopov awarded T174465: Puppet admin module should support adding system users to managed groups a Love token.
Jan 22 2018, 8:10 PM · Analytics-Kanban, Analytics, Patch-For-Review, SRE
mpopov updated subscribers of T185526: Make Wikipedia clickstream dataset available as API.
Jan 22 2018, 8:02 PM · Analytics
mpopov updated subscribers of T185526: Make Wikipedia clickstream dataset available as API.
Jan 22 2018, 8:01 PM · Analytics
mpopov created T185526: Make Wikipedia clickstream dataset available as API.
Jan 22 2018, 7:58 PM · Analytics
mpopov claimed T184089: Understand Android app usage by market.
Jan 22 2018, 5:36 PM · Reading-analysis, Product-Analytics
mpopov moved T179528: Investigate full-text searches in event logging vs SRP pageviews from Needs review to Done on the Discovery-Analysis (Current work) board.
Jan 22 2018, 5:29 PM · Product-Analytics, Discovery-Analysis (Current work), Discovery-ARCHIVED

Jan 20 2018

mpopov updated subscribers of T174465: Puppet admin module should support adding system users to managed groups.

This is solely for T174110 or are we anticipating other use cases?

Jan 20 2018, 12:19 AM · Analytics-Kanban, Analytics, Patch-For-Review, SRE

Jan 19 2018

mpopov updated subscribers of T183984: [WMF All Hands 2018] Dataviz Literacy Workshop.
Jan 19 2018, 10:55 PM · Product-Analytics, Patch-For-Review, Discovery-ARCHIVED, Discovery-Analysis (Current work)

Jan 17 2018

mpopov moved T183984: [WMF All Hands 2018] Dataviz Literacy Workshop from In progress to Needs review on the Discovery-Analysis (Current work) board.

Static version: https://bearloga.github.io/wmf-allhands18/ and here's the interactive app (which has a quiz): http://dataviz-literacy.wmflabs.org/

Jan 17 2018, 11:33 PM · Product-Analytics, Patch-For-Review, Discovery-ARCHIVED, Discovery-Analysis (Current work)
mpopov added a parent task for T185131: Ubuntu Trusty VMs in discovery-stats for the All Hands workshop: T183984: [WMF All Hands 2018] Dataviz Literacy Workshop.
Jan 17 2018, 11:26 PM · Product-Analytics, Patch-For-Review, cloud-services-team (Kanban), Cloud-VPS (Quota-requests), Reading-analysis, Discovery-Analysis
mpopov added a subtask for T183984: [WMF All Hands 2018] Dataviz Literacy Workshop: T185131: Ubuntu Trusty VMs in discovery-stats for the All Hands workshop.
Jan 17 2018, 11:26 PM · Product-Analytics, Patch-For-Review, Discovery-ARCHIVED, Discovery-Analysis (Current work)
mpopov added a comment to T185131: Ubuntu Trusty VMs in discovery-stats for the All Hands workshop.

Thank you very much, @Andrew!

Jan 17 2018, 11:01 PM · Product-Analytics, Patch-For-Review, cloud-services-team (Kanban), Cloud-VPS (Quota-requests), Reading-analysis, Discovery-Analysis
mpopov added a comment to T185131: Ubuntu Trusty VMs in discovery-stats for the All Hands workshop.

I will create these VMs for you. What shall I call them?

Jan 17 2018, 9:10 PM · Product-Analytics, Patch-For-Review, cloud-services-team (Kanban), Cloud-VPS (Quota-requests), Reading-analysis, Discovery-Analysis
mpopov triaged T185131: Ubuntu Trusty VMs in discovery-stats for the All Hands workshop as High priority.
Jan 17 2018, 7:53 PM · Product-Analytics, Patch-For-Review, cloud-services-team (Kanban), Cloud-VPS (Quota-requests), Reading-analysis, Discovery-Analysis

Jan 16 2018

mpopov added a comment to T184767: Make it possible to stop a survey after receiving a certain number of responses.

I don't think limiting a survey to a predetermined maximum of n responses is the way to go for the reasons @EBernhardson mentioned. The limiting factor should only be the start and end times/dates, ideally so that all timezones are equally represented.

Jan 16 2018, 6:46 PM · Web-Team-Backlog, QuickSurveys

Jan 12 2018

mpopov awarded T184768: Bug behavior of QTree[Long] for quantileBounds a Mountain of Wealth token.
Jan 12 2018, 10:44 PM · Analytics-Kanban, Product-Analytics, Patch-For-Review, Discovery-Analysis, Wikipedia-Android-App-Backlog

Jan 9 2018

mpopov updated subscribers of T172581: [EPIC] Set up mechanism for archiving Google Search Console data.

Call me crazy but i bet if we ask google for this data they will be happy to give it to us w/o having to setup web scraping/downloads
Again, call me crazy but i bet this data could be made public by google 100% such you do not need authentication to query it , we woudl be able to do it and so will be any interested party. seems that it would require a few conversations but little actual hands-on work

Jan 9 2018, 11:53 PM · Epic, Product-Analytics, SEO

Jan 3 2018

mpopov added a comment to T184019: Run search relevance survey on enwiki and frwiki.

@mpopov I wasn't quite sure from https://wikimedia-research.github.io/Discovery-Search-Adhoc-RelevanceSurveys/#responses_required , is 40 to 70 responses the number of impressions (yes+no+dismiss+timeout), the number of clicks (yes+no+dismiss), or the number of yes+no? I think it was yes+no+dismiss, but it might have been yes+no+dismiss+timeout?

Closer reading of the report:

the model is very accurate with at least 40 yes/no/unsure/dismiss responses and the most accurate with at least 70 responses

I think is saying that we are not considering timeouts here, which means with an ~30% response rate to get 70 responses we need 210 impressions?

Jan 3 2018, 6:56 PM · Discovery-ARCHIVED

Jan 2 2018

chelsyx awarded T179528: Investigate full-text searches in event logging vs SRP pageviews a Like token.
Jan 2 2018, 11:49 PM · Product-Analytics, Discovery-Analysis (Current work), Discovery-ARCHIVED
mpopov added a comment to T142795: Offer interwiki search with language detection functionality over the API.

Just tried the second link (API results for "sistema parlamentario con sede de gobierno" on enwiki (including results from eswiki)) and got the following:

Jan 2 2018, 8:28 PM · MW-1.29-release (WMF-deploy-2017-01-03_(1.29.0-wmf.7)), MW-1.29-release-notes, Discovery-Search (Current work), Patch-For-Review, good first task, Wikipedia-Android-App-Backlog, Wikipedia-iOS-App-Backlog, Discovery-ARCHIVED
mpopov updated the task description for T183984: [WMF All Hands 2018] Dataviz Literacy Workshop.
Jan 2 2018, 8:04 PM · Product-Analytics, Patch-For-Review, Discovery-ARCHIVED, Discovery-Analysis (Current work)
mpopov moved T183984: [WMF All Hands 2018] Dataviz Literacy Workshop from Backlog to In progress on the Discovery-Analysis (Current work) board.
Jan 2 2018, 8:04 PM · Product-Analytics, Patch-For-Review, Discovery-ARCHIVED, Discovery-Analysis (Current work)
mpopov triaged T183984: [WMF All Hands 2018] Dataviz Literacy Workshop as Medium priority.
Jan 2 2018, 8:03 PM · Product-Analytics, Patch-For-Review, Discovery-ARCHIVED, Discovery-Analysis (Current work)

Dec 20 2017

debt awarded T175048: Search Relevance Survey test #3: analysis of test a Like token.
Dec 20 2017, 10:24 PM · Product-Analytics, Discovery-Analysis (Current work), Discovery-ARCHIVED
mpopov added a comment to T175048: Search Relevance Survey test #3: analysis of test.

Added Python version into the production instructions for @EBernhardson's convenience :) https://github.com/wikimedia-research/Discovery-Search-Adhoc-RelevanceSurveys/tree/master/production#predicting-rank

Dec 20 2017, 9:49 PM · Product-Analytics, Discovery-Analysis (Current work), Discovery-ARCHIVED
mpopov added a comment to T175048: Search Relevance Survey test #3: analysis of test.

Thanks! I'm not sure what I was expecting, but it is interesting to see. It seems to like giving scores of 0.5, but a lot of models end up with a sort of "default" score they like best. I am surprised that it doesn't show any scores above 0.75. Should we map scores from a 0-0.75 range, rather then 0-1? Or, based on the low end of the trend line, maybe even 0.25-075?

Dec 20 2017, 7:25 PM · Product-Analytics, Discovery-Analysis (Current work), Discovery-ARCHIVED
TJones awarded F11851691: plot.png a Pterodactyl token.
Dec 20 2017, 4:37 PM

Dec 19 2017

mpopov moved T175048: Search Relevance Survey test #3: analysis of test from In progress to Done on the Discovery-Analysis (Current work) board.

Alrighty, here ya go! It's not as pretty as you were probably expecting!

Dec 19 2017, 10:52 PM · Product-Analytics, Discovery-Analysis (Current work), Discovery-ARCHIVED

Dec 18 2017

dcausse awarded F11851691: plot.png a Love token.
Dec 18 2017, 3:55 PM

Dec 15 2017

mpopov moved T179528: Investigate full-text searches in event logging vs SRP pageviews from In progress to Needs review on the Discovery-Analysis (Current work) board.

@EBernhardson @chelsyx do you see any errors?

Dec 15 2017, 10:06 PM · Product-Analytics, Discovery-Analysis (Current work), Discovery-ARCHIVED
mpopov moved T175048: Search Relevance Survey test #3: analysis of test from Needs review to In progress on the Discovery-Analysis (Current work) board.
Dec 15 2017, 5:58 PM · Product-Analytics, Discovery-Analysis (Current work), Discovery-ARCHIVED