mpopov (Mikhail Popov)
Data Analyst

Today

  • Clear sailing ahead.

Tomorrow

  • Clear sailing ahead.

Thursday

  • Clear sailing ahead.

User Details

User Since
Jul 27 2015, 4:15 PM (117 w, 8 h)
Availability
Available
IRC Nick
bearloga
LDAP User
Bearloga
MediaWiki User
MPopov (WMF)

Data Analyst in Reading (formerly of Discovery) | User:MPopov (WMF) | Highlighted Works

Recent Activity

Yesterday

mpopov lowered the priority of T168967: Upload shiny-server .deb to our Jessie apt repository from Normal to Lowest.

Ish? Until this is done, we're limited to using Ubuntu for the VMs that host our dashboards. Since the WM Cloud team (formerly WM Labs) is deprecating Ubuntu Trusty in favor of only offering Debian for VMs, we'll have to file a Phab ticket requesting a Trusty instance if we have to shut one down and launch a replacement. I don't think this task should be declined, but I am gonna adjust the priority to reflect where we are on this.

Mon, Oct 23, 8:28 PM · Discovery-Analysis, Discovery, Operations, Discovery-Search (Current work)

Tue, Oct 17

debt awarded T177356: Metrics for SDoC: look at querying databases a Party Time token.
Tue, Oct 17, 9:11 PM · Discovery-Analysis (Current work), Structured-Data-Commons, Discovery, Wikidata
mpopov added a comment to T176493: Analysis of testing on 18 wikis with > 1% of search traffic.

@chelsyx I dont think the ltr-i-1024 bucket should be included in this first look, it's an interleaved result set that can't really be interpreted with our standard metrics.

Tue, Oct 17, 12:20 PM · Discovery-Analysis (Current work), Discovery-Search (Current work), Discovery

Fri, Oct 13

mpopov added a comment to T177354: Metrics for SDoC: look at contributions.

@chelsyx do you wanna add your stuff to https://github.com/wikimedia-research/SDoC-Initial-Metrics ?

Fri, Oct 13, 7:45 PM · Discovery-Analysis (Current work), Structured-Data-Commons, Discovery, Wikidata
mpopov moved T177356: Metrics for SDoC: look at querying databases from In progress to Done on the Discovery-Analysis (Current work) board.

Queries & data uploaded to https://github.com/wikimedia-research/SDoC-Initial-Metrics

Fri, Oct 13, 7:44 PM · Discovery-Analysis (Current work), Structured-Data-Commons, Discovery, Wikidata
mpopov updated the task description for T177356: Metrics for SDoC: look at querying databases.
Fri, Oct 13, 7:42 PM · Discovery-Analysis (Current work), Structured-Data-Commons, Discovery, Wikidata
mpopov added a comment to T177356: Metrics for SDoC: look at querying databases.

Growth of number of deleters over time:

Fri, Oct 13, 7:31 PM · Discovery-Analysis (Current work), Structured-Data-Commons, Discovery, Wikidata
mpopov updated the task description for T177356: Metrics for SDoC: look at querying databases.
Fri, Oct 13, 6:18 PM · Discovery-Analysis (Current work), Structured-Data-Commons, Discovery, Wikidata
mpopov added a comment to T177356: Metrics for SDoC: look at querying databases.

Total files uploaded to Commons (as of right now) by extension:

Fri, Oct 13, 6:18 PM · Discovery-Analysis (Current work), Structured-Data-Commons, Discovery, Wikidata

Thu, Oct 12

mpopov moved T178096: Make a Puppet profile/role for doing R-based heavy stats/ML on Wikimedia Cloud from In progress to Needs review on the Discovery-Analysis (Current work) board.
Thu, Oct 12, 8:49 PM · Patch-For-Review, Discovery-Analysis (Current work), Discovery
mpopov moved T178096: Make a Puppet profile/role for doing R-based heavy stats/ML on Wikimedia Cloud from Backlog to In progress on the Discovery-Analysis (Current work) board.
Thu, Oct 12, 5:30 PM · Patch-For-Review, Discovery-Analysis (Current work), Discovery
mpopov created T178096: Make a Puppet profile/role for doing R-based heavy stats/ML on Wikimedia Cloud.
Thu, Oct 12, 5:29 PM · Patch-For-Review, Discovery-Analysis (Current work), Discovery

Wed, Oct 11

mpopov updated the task description for T177356: Metrics for SDoC: look at querying databases.
Wed, Oct 11, 11:28 PM · Discovery-Analysis (Current work), Structured-Data-Commons, Discovery, Wikidata
mpopov added a comment to T177356: Metrics for SDoC: look at querying databases.
  • Most copyright-related deletions happen within 1 day of upload across almost all media types, with the exception of 'drawing' (SVGs)
  • A lot of audio files are deleted within 1 minute or 1 week of upload
  • Half of all images and PDFs deleted were deleted within 1 month of upload for non-copyright reasons
Wed, Oct 11, 11:27 PM · Discovery-Analysis (Current work), Structured-Data-Commons, Discovery, Wikidata
mpopov updated the task description for T177356: Metrics for SDoC: look at querying databases.
Wed, Oct 11, 9:57 PM · Discovery-Analysis (Current work), Structured-Data-Commons, Discovery, Wikidata
mpopov added a comment to T177356: Metrics for SDoC: look at querying databases.

Reasons for files deleted in 2017:

Wed, Oct 11, 9:00 PM · Discovery-Analysis (Current work), Structured-Data-Commons, Discovery, Wikidata
mpopov added a comment to T177354: Metrics for SDoC: look at contributions.

Unfortunately, the mediawiki snapshot doesn't has the image table which describes images and other uploaded files.

Wed, Oct 11, 6:19 PM · Discovery-Analysis (Current work), Structured-Data-Commons, Discovery, Wikidata
mpopov added a comment to T177354: Metrics for SDoC: look at contributions.

Hey @chelsyx - what time frame does this cover?

Wed, Oct 11, 5:05 PM · Discovery-Analysis (Current work), Structured-Data-Commons, Discovery, Wikidata
mpopov claimed T177356: Metrics for SDoC: look at querying databases.
Wed, Oct 11, 4:00 PM · Discovery-Analysis (Current work), Structured-Data-Commons, Discovery, Wikidata
mpopov moved T177356: Metrics for SDoC: look at querying databases from Needs triage to Current work on the Discovery-Analysis board.
Wed, Oct 11, 3:59 PM · Discovery-Analysis (Current work), Structured-Data-Commons, Discovery, Wikidata

Tue, Oct 10

mpopov updated subscribers of T171652: Language Analysis Morphological Library Research Spike.

Perhaps worth noting that I'm pretty sure http://discovery.wmflabs.org/metrics/#langproj_breakdown isn't a true breakdown of search volume, although i should double check with @mpopov . I think that's a proportion of events in the TestSeachSatisfaction schema. The sampling on low volume wikis is all the same, but the top 20 or so have custom sampling rates which means we can't directly compare the numbers.

Tue, Oct 10, 11:42 PM · Discovery-Search (Current work), Tamil-Sites, Malayalam-Sites, Bengali-Sites, Discovery

Fri, Oct 6

mpopov moved T176811: [Dashboard] Count the number of user session tokens by volume for mobile web search from Needs review to Done on the Discovery-Analysis (Current work) board.

@chelsyx: thanks and good job!

Fri, Oct 6, 11:22 PM · Discovery-Analysis (Current work)
mpopov moved T171215: Interleaved results A/B test: analysis of data from Needs review to Done on the Discovery-Analysis (Current work) board.

Final draft up at https://wikimedia-research.github.io/Discovery-Search-Test-InterleavedLTR/

Fri, Oct 6, 11:14 PM · Discovery-Search (Current work), Discovery-Analysis (Current work), Discovery, CirrusSearch

Thu, Oct 5

mpopov committed R1821:2fced58f4463: [WIP] Add functions for working with interleaved experiments (authored by mpopov).
[WIP] Add functions for working with interleaved experiments
Thu, Oct 5, 5:43 PM
mpopov committed R1821:1b8b395c0df9: Fix variable name (authored by mpopov).
Fix variable name
Thu, Oct 5, 5:38 PM
mpopov committed R1821:0449576e628d: [WIP] Add functions for working with interleaved experiments (authored by mpopov).
[WIP] Add functions for working with interleaved experiments
Thu, Oct 5, 5:34 PM
mpopov committed R1821:adcc85c94664: Switch to fetching EL data from db1047 (authored by mpopov).
Switch to fetching EL data from db1047
Thu, Oct 5, 5:17 PM

Tue, Oct 3

mpopov added a comment to T162369: Evaluate rescore windows for learning to rank.

It would depend on how often things below the top 20 move into the top 20 in practice, not just in theory. We can use the search logs to find this out, no?

Tue, Oct 3, 5:57 PM · Discovery-Search (Current work), Discovery

Mon, Oct 2

mpopov added a comment to T176997: Extract a set of a few hundred most popular abandoned queries.

A few things that come to mind:

  • A nice large list would give us a better idea of the distribution of queries. Are there some really common things that people bail on, or is it all low frequency? One day isn't enough to tell, though it looks like the long tail is very long since @mpopov dropped the unique items, leaving a pretty short list.
Mon, Oct 2, 7:19 PM · Discovery-Search (Current work), Discovery, CirrusSearch

Thu, Sep 28

mpopov added a comment to T176997: Extract a set of a few hundred most popular abandoned queries.

Using just the event logging data from 2017-08-01 to today (2017-09-28), here's a glimpse at queries from abandoned searches:

Thu, Sep 28, 6:36 PM · Discovery-Search (Current work), Discovery, CirrusSearch
mpopov claimed T175048: Search Relevance Survey test #3: analysis of test.
Thu, Sep 28, 4:43 PM · Discovery-Analysis (Current work), Discovery
mpopov moved T171215: Interleaved results A/B test: analysis of data from In progress to Needs review on the Discovery-Analysis (Current work) board.

Bootstrapping finally finished -_- second draft up at https://people.wikimedia.org/~bearloga/reports/ltr-test.html

Thu, Sep 28, 4:39 PM · Discovery-Search (Current work), Discovery-Analysis (Current work), Discovery, CirrusSearch
mpopov removed a project from T170022: Map analytics : Patch-For-Review.

In beta:

Thu, Sep 28, 1:21 AM · Discovery-Analysis (Current work), Discovery

Wed, Sep 27

Quiddity awarded T150215: [Dashboard][Search] Sparklines for KPIs a Love token.
Wed, Sep 27, 10:26 PM · Patch-For-Review, Discovery-Analysis (Current work), Discovery
mpopov added a comment to T176639: Replace references to dbstore1002 by db1047 in reportupdater jobs.

@mforns: we specify the analytics-store hostname in our R package (the function that makes sql queries: https://github.com/wikimedia/wikimedia-discovery-wmf/blob/master/R/mysql.R#L39--L76) which is used for querying both wiki content dbs as well as the log db. If we add a type argument that sets the hostname ("db1047" in case of type == "events", for example), what hostname should we use for non-eventlogging queries?

Wed, Sep 27, 10:06 PM · Patch-For-Review, Analytics-Kanban
mpopov added a comment to T112170: Model user behavior and detect when reality heavily deviated from expectation.

Deployed at https://discovery.wmflabs.org/forecasts/

Wed, Sep 27, 4:14 PM · Discovery-Analysis (Current work), Discovery
mpopov committed rWDDE4c990fcea85d: Tab doc path fix (authored by mpopov).
Tab doc path fix
Wed, Sep 27, 4:08 PM

Tue, Sep 26

mpopov committed rWDDE9c6181d3cbcd: Make develop the default branch (authored by mpopov).
Make develop the default branch
Tue, Sep 26, 4:32 PM
mpopov committed rWDDE825202e79278: Edit Project Config (authored by mpopov).
Edit Project Config
Tue, Sep 26, 4:23 PM
mpopov committed rWDDEe7d35d4e2195: Edit Project Config (authored by mpopov).
Edit Project Config
Tue, Sep 26, 4:21 PM
mpopov committed rWDDEb68c155cd39d: Update link to code repo (authored by mpopov).
Update link to code repo
Tue, Sep 26, 4:17 PM

Mon, Sep 25

mpopov added a comment to T176639: Replace references to dbstore1002 by db1047 in reportupdater jobs.

Is there a valid reason why analytics-slave shouldn't be used? Are we talking about eventlogging-queries?

Mon, Sep 25, 5:48 PM · Patch-For-Review, Analytics-Kanban
mpopov added a comment to T176194: Have CI run lintr for analytics/wmde/WDCM R files.

I did find 1 other gerrit repo referencing lintr, wikimedia-discovery-polloi.
@mpopov might be able to help me understand lintr more as that repo has a .lintr file https://github.com/wikimedia/wikimedia-discovery-polloi/blob/master/.lintr and a testthat test for syntax https://github.com/wikimedia/wikimedia-discovery-polloi/blob/master/tests/testthat/test-syntax.R

Mon, Sep 25, 5:16 PM · WMDE-QWERTY-Team-Board, WMDE-QWERTY-Sprint-2017-09-19, Continuous-Integration-Infrastructure (shipyard), Patch-For-Review, WMDE-Analytics-Engineering, User-Addshore

Sep 22 2017

mpopov moved T171215: Interleaved results A/B test: analysis of data from Needs review to In progress on the Discovery-Analysis (Current work) board.
Sep 22 2017, 5:33 PM · Discovery-Search (Current work), Discovery-Analysis (Current work), Discovery, CirrusSearch
mpopov moved T170022: Map analytics from In progress to Needs review on the Discovery-Analysis (Current work) board.
Sep 22 2017, 5:32 PM · Discovery-Analysis (Current work), Discovery
mpopov moved T172453: Keep more data longer (dashboard or otherwise) from Needs review to Done on the Discovery-Analysis (Current work) board.

Deployed to prod. Good job, @chelsyx!

Sep 22 2017, 4:23 PM · Discovery-Analysis (Current work), Discovery
mpopov awarded T172452: API usage: break out internal vs external a Like token.
Sep 22 2017, 4:23 PM · Discovery-Analysis (Current work), Discovery
mpopov moved T172452: API usage: break out internal vs external from Needs review to Done on the Discovery-Analysis (Current work) board.

Deployed to prod. Good job, @chelsyx!

Sep 22 2017, 4:23 PM · Discovery-Analysis (Current work), Discovery

Sep 21 2017

mpopov moved T168967: Upload shiny-server .deb to our Jessie apt repository from Needs triage to Tracking on the Discovery-Analysis board.
Sep 21 2017, 8:42 PM · Discovery-Analysis, Discovery, Operations, Discovery-Search (Current work)
mpopov added a comment to T170022: Map analytics .

Result of running T170022#3611637:

Sep 21 2017, 8:01 PM · Discovery-Analysis (Current work), Discovery

Sep 20 2017

mpopov added a comment to T131795: Create a parameterized report template for search team's A/B tests.

I think the final step is to transfer the repo from @chelsyx's personal GitHub account over to Gerrit (see Gerrit/New repositories for instructions) and add licensing info.

Sep 20 2017, 10:06 PM · Discovery-Analysis (Current work), Discovery
mpopov added a comment to T170022: Map analytics .

Still need to add the logic that auto-selects "(None)" in the languages list if the user selects "Commons" in the projects list, but here's what I have so far:

Sep 20 2017, 1:19 AM · Discovery-Analysis (Current work), Discovery

Sep 18 2017

debt awarded T176165: Minor UI update: add marker for map tile usage on dashboard a Like token.
Sep 18 2017, 8:21 PM · Patch-For-Review, Discovery-Analysis (Current work), Discovery
mpopov moved T170022: Map analytics from Backlog to In progress on the Discovery-Analysis (Current work) board.
Sep 18 2017, 8:16 PM · Discovery-Analysis (Current work), Discovery
mpopov moved T176165: Minor UI update: add marker for map tile usage on dashboard from In progress to Done on the Discovery-Analysis (Current work) board.

Deployed

Sep 18 2017, 8:15 PM · Patch-For-Review, Discovery-Analysis (Current work), Discovery
mpopov claimed T176165: Minor UI update: add marker for map tile usage on dashboard.
Sep 18 2017, 7:44 PM · Patch-For-Review, Discovery-Analysis (Current work), Discovery

Sep 16 2017

mpopov added a comment to T170022: Map analytics .

R script & Hive query that finds static map thumbnail requests and then uses those to find the pages that have a mapframe and how many pageviews those pages have and the total pageviews the respective project has:

Sep 16 2017, 1:05 AM · Discovery-Analysis (Current work), Discovery

Sep 15 2017

mpopov added a comment to T175648: Turn on test for language links.

Haven't seen esclicks yet but the hover-on/offs appear to be working along with the rest of the test:

Sep 15 2017, 8:56 PM · Discovery-Search (Current work), CirrusSearch, Discovery

Sep 14 2017

mpopov updated subscribers of T112170: Model user behavior and detect when reality heavily deviated from expectation.
Sep 14 2017, 11:55 PM · Discovery-Analysis (Current work), Discovery
mpopov moved T170468: Dashboard: Search results page - dwell time metric from Needs review to Done on the Discovery-Analysis (Current work) board.

I don't remember any other changes we wanted to make and since it's deployed to production (https://discovery.wmflabs.org/metrics/#spr_surv), I'm moving this ticket to Done.

Sep 14 2017, 10:09 PM · Discovery-Analysis (Current work), Discovery
mpopov moved T171215: Interleaved results A/B test: analysis of data from In progress to Needs review on the Discovery-Analysis (Current work) board.

First draft up at https://people.wikimedia.org/~bearloga/reports/ltr-test.html

Sep 14 2017, 8:26 PM · Discovery-Search (Current work), Discovery-Analysis (Current work), Discovery, CirrusSearch
mpopov moved T170022: Map analytics from In progress to Backlog on the Discovery-Analysis (Current work) board.

Update: I fixed the query for prevalence stats in https://people.wikimedia.org/~bearloga/reports/maps-usage.html -- specifically I am now counting only pages that are articles and that are not redirects. I also added an "% of sessions that activated mapframe" to https://people.wikimedia.org/~bearloga/reports/maps-interactions.html

Sep 14 2017, 6:20 PM · Discovery-Analysis (Current work), Discovery

Sep 7 2017

mpopov added a comment to T174106: Search Relevance Survey test #3: action items.

How about…

Sep 7 2017, 5:13 PM · MW-1.30-release-notes (WMF-deploy-2017-09-19 (1.30.0-wmf.19)), MW-1.31-release-notes (WMF-deploy-2017-09-26 (1.31.0-wmf.1)), Discovery-Search (Current work), Discovery
mpopov added a comment to T174106: Search Relevance Survey test #3: action items.

@mpopov Should i trim down the list of questions asked for this next test, or keep all 4 variations? Or maybe remove some variations and add some new ones?

Sep 7 2017, 4:48 PM · MW-1.30-release-notes (WMF-deploy-2017-09-19 (1.30.0-wmf.19)), MW-1.31-release-notes (WMF-deploy-2017-09-26 (1.31.0-wmf.1)), Discovery-Search (Current work), Discovery

Aug 31 2017

mpopov moved T171215: Interleaved results A/B test: analysis of data from Backlog to In progress on the Discovery-Analysis (Current work) board.
Aug 31 2017, 10:44 PM · Discovery-Search (Current work), Discovery-Analysis (Current work), Discovery, CirrusSearch
mpopov moved T170468: Dashboard: Search results page - dwell time metric from In progress to Needs review on the Discovery-Analysis (Current work) board.
Aug 31 2017, 10:44 PM · Discovery-Analysis (Current work), Discovery
mpopov committed R1821:7722e51030c0: [WIP] Add functions for working with interleaved experiments (authored by mpopov).
[WIP] Add functions for working with interleaved experiments
Aug 31 2017, 10:12 PM
mpopov committed R1821:e7bbef6e2ad4: [WIP] Add functions for working with interleaved experiments (authored by mpopov).
[WIP] Add functions for working with interleaved experiments
Aug 31 2017, 9:56 PM
mpopov moved T172452: API usage: break out internal vs external from Needs review to In progress on the Discovery-Analysis (Current work) board.
Aug 31 2017, 9:55 PM · Discovery-Analysis (Current work), Discovery
mpopov moved T170468: Dashboard: Search results page - dwell time metric from Needs review to In progress on the Discovery-Analysis (Current work) board.
Aug 31 2017, 9:55 PM · Discovery-Analysis (Current work), Discovery
mpopov removed a project from T172452: API usage: break out internal vs external: Patch-For-Review.

Up on beta: http://discovery-beta.wmflabs.org/metrics/#referer_breakdown

Aug 31 2017, 8:00 PM · Discovery-Analysis (Current work), Discovery
mpopov removed a project from T170468: Dashboard: Search results page - dwell time metric: Patch-For-Review.

Up on beta: http://discovery-beta.wmflabs.org/metrics/#spr_surv

Aug 31 2017, 7:57 PM · Discovery-Analysis (Current work), Discovery

Aug 30 2017

debt awarded T172425: Add Mikhail and Chelsy to WMF-NDA group a Like token.
Aug 30 2017, 9:15 PM · User-bd808, Discovery-Analysis, WMF-NDA-Requests
mpopov added a comment to T172425: Add Mikhail and Chelsy to WMF-NDA group.

Thanks!

Aug 30 2017, 5:39 PM · User-bd808, Discovery-Analysis, WMF-NDA-Requests
mpopov added a comment to T136017: Analyse results of the swap2and3 search test.

Good work, @chelsyx! Minor changes here and there: https://github.com/wikimedia-research/Discovery-Search-Test-Swap2and3/pull/1

Aug 30 2017, 12:50 AM · Discovery-Analysis (Current work), Discovery
mpopov added a comment to T164857: A/B Test: explore similar - analysis of results .
  • Reads really nice; super easy to follow along
  • "not comparing apple to apple" => "not comparing apples to apples"
  • "According to our eventlogging schema" => "According to our EL schema" (since you already introduced the term above)
Aug 30 2017, 12:18 AM · Discovery-Analysis (Current work), Discovery

Aug 29 2017

mpopov moved T174512: [Blog Post] Applying epidemiology techniques to browser tabs from Needs triage to Tracking on the Discovery-Analysis board.
Aug 29 2017, 9:48 PM · Discovery, Discovery-Analysis, Wikimedia-Blog-Content
mpopov created T174512: [Blog Post] Applying epidemiology techniques to browser tabs.
Aug 29 2017, 9:47 PM · Discovery, Discovery-Analysis, Wikimedia-Blog-Content
mpopov added a comment to T173049: Investigate mobile map gadget for eventlogging.

Ah, got it. Cool, that makes things easier! Yeah, adding some kind of a marker to the extra field somewhere in https://github.com/wikimedia/mediawiki-extensions-WikimediaEvents/blob/master/modules/ext.wikimediaEvents.kartographer.js#L141--L232 would be the way to go.

Aug 29 2017, 9:26 PM · Patch-For-Review, Maps-Sprint, Discovery-Analysis, Discovery
mpopov claimed T171215: Interleaved results A/B test: analysis of data.
Aug 29 2017, 9:03 PM · Discovery-Search (Current work), Discovery-Analysis (Current work), Discovery, CirrusSearch
mpopov removed a project from T170022: Map analytics : Patch-For-Review.
Aug 29 2017, 9:03 PM · Discovery-Analysis (Current work), Discovery
mpopov removed a project from T170494: [EPIC] Reconfigure Discovery-Stats on Analytics Cluster: Patch-For-Review.
Aug 29 2017, 9:03 PM · Discovery-Analysis (Current work), Discovery
mpopov removed a project from T172452: API usage: break out internal vs external: Patch-For-Review.
Aug 29 2017, 9:03 PM · Discovery-Analysis (Current work), Discovery
mpopov removed a project from T170468: Dashboard: Search results page - dwell time metric: Patch-For-Review.
Aug 29 2017, 9:02 PM · Discovery-Analysis (Current work), Discovery
mpopov added a comment to T170468: Dashboard: Search results page - dwell time metric.

Backfilling data from 2017-04-01 through 2017-08-28. Adding that data to the dashboard should be relatively straightforward.

Aug 29 2017, 8:40 PM · Discovery-Analysis (Current work), Discovery
mpopov added a comment to T173049: Investigate mobile map gadget for eventlogging.

@TheDJ Event logging would be quite an undertaking, but we can start with tracking tiles requested specifically by the gadget (as opposed to mobile web in general). When the gadget (or leaflet, maybe?) makes the API calls to Kartotherian for the tiles, do you use a custom user agent or are you able to specify one? Because if you can specify a custom user agent, on our side we can then look for that UA specifically when we count tiles served. A good UA would include name of gadget & URL or your name & contact info, for example.

Aug 29 2017, 8:34 PM · Patch-For-Review, Maps-Sprint, Discovery-Analysis, Discovery
mpopov added a comment to T172452: API usage: break out internal vs external.

@chelsyx there should also be a tab that shows the total usage (across all APIs) broken down by referrer with the option to switch between raw counts and %s

Aug 29 2017, 5:59 PM · Discovery-Analysis (Current work), Discovery
mpopov added a comment to T153856: Add lint/CI to all wikimedia/discovery analytics repositories.

And obviously /usr/local/lib/R/site-library is empty. If I change the repos from HTTPS to HTTP, it works fine. What I suspect is that install.packages depends on another package to be installed in order to support HTTPS.

Aug 29 2017, 5:08 PM · Patch-For-Review, Release-Engineering-Team (Watching / External), Discovery-Analysis (Current work), Discovery, Continuous-Integration-Config

Aug 28 2017

mpopov added a comment to T171740: [Epic] Search Relevance: graded by humans.

Final draft up at https://wikimedia-research.github.io/Discovery-Search-Adhoc-SurveyMVP/

Aug 28 2017, 10:45 PM · Epic, Discovery-Search (Current work), Discovery
mpopov added a comment to T172452: API usage: break out internal vs external.

Marking search_api_usage for a recount and then recounting using the new UDF so we have referrer breakdown for the past 60 days:

Aug 28 2017, 8:22 PM · Discovery-Analysis (Current work), Discovery
mpopov claimed T170468: Dashboard: Search results page - dwell time metric.
Aug 28 2017, 5:01 PM · Discovery-Analysis (Current work), Discovery
mpopov edited projects for T174110: Private data access for non-person user that calculates metrics, added: Discovery-Analysis; removed Discovery-Analysis (Current work), Patch-For-Review.
Aug 28 2017, 4:59 PM · Discovery-Analysis, Analytics-Kanban, Discovery
mpopov moved T170494: [EPIC] Reconfigure Discovery-Stats on Analytics Cluster from In progress to Stalled/Waiting on the Discovery-Analysis (Current work) board.
Aug 28 2017, 4:58 PM · Discovery-Analysis (Current work), Discovery

Aug 24 2017

mpopov added a comment to T172452: API usage: break out internal vs external.

I just checked the refinery commits log and the UDF is available now :) "Add refinery-source jars for v0.0.51 to artifacts" https://github.com/wikimedia/analytics-refinery/commit/712bf13a8689fda40530c072384d355b1dd694d5

Aug 24 2017, 11:37 PM · Discovery-Analysis (Current work), Discovery
mpopov added a comment to T174110: Private data access for non-person user that calculates metrics.

P.S. I should also add that we currently have several teams without performance metrics from the past 10 days (and counting), so getting this done is pretty important — hence the high priority. On 13 August 2017 I asked Guillaume to fix the permissions on the datasets so that I could run golden/main.sh as myself just to backfill metrics that we were missing since 23 July 2017 by that point. We can go back to that running-under-staff-account solution but that's just not sustainable (as discussed at length in T129260), so the switch to a non-person executing these scripts has to be done anyway.

Aug 24 2017, 10:36 PM · Discovery-Analysis, Analytics-Kanban, Discovery
mpopov closed T173421: Puppetization of Discovery Dashboards as Resolved.

Post up at https://blog.wikimedia.org/2017/08/21/discovery-dashboards-puppet/

Aug 24 2017, 10:07 PM · Discovery-Analysis, Wikimedia-Blog-Content
mpopov moved T174110: Private data access for non-person user that calculates metrics from Needs triage to Tracking on the Discovery-Analysis board.
Aug 24 2017, 10:06 PM · Discovery-Analysis, Analytics-Kanban, Discovery
mpopov edited projects for T174110: Private data access for non-person user that calculates metrics, added: Discovery-Analysis; removed Discovery-Analysis (Current work).
Aug 24 2017, 10:06 PM · Discovery-Analysis, Analytics-Kanban, Discovery
mpopov created T174110: Private data access for non-person user that calculates metrics.
Aug 24 2017, 10:05 PM · Discovery-Analysis, Analytics-Kanban, Discovery
mpopov renamed T170494: [EPIC] Reconfigure Discovery-Stats on Analytics Cluster from Reconfigure Discovery-Stats on Analytics Cluster to [EPIC] Reconfigure Discovery-Stats on Analytics Cluster.
Aug 24 2017, 9:18 PM · Discovery-Analysis (Current work), Discovery