mpopov (Mikhail Popov)
Data Analyst

Today

  • Clear sailing ahead.

Tomorrow

  • Clear sailing ahead.

Sunday

  • Clear sailing ahead.

User Details

User Since
Jul 27 2015, 4:15 PM (107 w, 3 d)
Availability
Available
IRC Nick
bearloga
LDAP User
Bearloga
MediaWiki User
MPopov (WMF)

Data Analyst in Reading (formerly of Discovery) | User:MPopov (WMF) | Highlighted Works

Recent Activity

Yesterday

mpopov added a comment to T171740: [Epic] Search Relevance: graded by humans.

@mpopov—the graphs look good. As mentioned on IRC, percentages or some other normalization would be helpful in figuring out the best response rates among the question formats and comparing yes/no/etc. rates among answers.

By eye, it looks like "would they want to read this article" gets slightly more engagement, and "would this article be relevant" and "would you click on this page" get slightly less, but I wouldn't be surprised if they were all statistically indistinguishable. I wonder if the question format has any effect on yes/no ratios, too. There may not be enough data to tell, though.

Erik pointed out that people don't like Ian Bannen (actor in the 1970s version of Tinker Tailor Soldier Spy) very much, but if you go by a simple ratio of yes/no votes, he still comes in 3rd, which is reasonable. (Ha! I just got the survey while looking at his page. It seemed only fair to dismiss it, though I wanted to vote yes.)

I think the results are promising. In places where the wisdom of the crowd disagrees with me, I think the results are understandable. For example, yesterday beetles gets all horrible results. But the least horrible is a different John Lennon song. That is at least tangentially related—it's a bad result, but it is also the best result.

I also wonder if the timeout proportion is a useful signal, or even a lack of responses (that points to a lack of popularity for the results page, at least). Seems possible, but it's not immediately clear how to use them.

Thu, Aug 17, 9:29 PM · MW-1.30-release-notes (WMF-deploy-2017-08-01_(1.30.0-wmf.12)), Patch-For-Review, Epic, Discovery-Search (Current work), Discovery
mpopov moved T173421: Puppetization of Discovery Dashboards from Needs triage to Tracking on the Discovery-Analysis board.
Thu, Aug 17, 6:32 PM · Discovery-Analysis, Wikimedia-Blog-Content
mpopov moved T165492: Find out which namespace combinations are used for searching from In progress to Done on the Discovery-Analysis (Current work) board.
Thu, Aug 17, 6:29 PM · Discovery, Discovery-Analysis (Current work), Advanced-Search, TCB-Team, German-Community-Wishlist
debt awarded T165861: Use search log to find currently existing namespace combinations a Like token.
Thu, Aug 17, 6:06 PM · Advanced-Search, Discovery-Analysis (Current work), TCB-Team, German-Community-Wishlist

Wed, Aug 16

mpopov added a comment to T165492: Find out which namespace combinations are used for searching.

From T165861#3529293:

Wed, Aug 16, 9:48 PM · Discovery, Discovery-Analysis (Current work), Advanced-Search, TCB-Team, German-Community-Wishlist
mpopov added a comment to T165861: Use search log to find currently existing namespace combinations.

@mpopov Do you think you will be able to give us insights in the next days? Our principle investigator of namespace correlations is only available until the end of next week. So if you manage to get back to it before, it would make it much easier for us to evaluate :)

Wed, Aug 16, 9:47 PM · Advanced-Search, Discovery-Analysis (Current work), TCB-Team, German-Community-Wishlist

Mon, Aug 14

mpopov added a comment to T171740: [Epic] Search Relevance: graded by humans.

Results from the first test:

Mon, Aug 14, 11:55 PM · MW-1.30-release-notes (WMF-deploy-2017-08-01_(1.30.0-wmf.12)), Patch-For-Review, Epic, Discovery-Search (Current work), Discovery
mpopov added a comment to T170468: Dashboard: Search results page - dwell time metric.

Code for estimating and visualizing dwell-time on full-text search results pages arrived at from autocomplete search: https://github.com/wikimedia-research/Discovery-Search-Adhoc-SRPDwellTime

Mon, Aug 14, 8:03 PM · Discovery-Analysis (Current work), Discovery
mpopov added a comment to T172453: Keep more data longer (dashboard or otherwise).

Can we use golden to collect those data that are not on dashboard and keep them in /srv/published-datasets/discovery?
For those on the dashboard but with a max_data_points limit, can we just create extra reports and remove the max_data_points limit?
@mpopov Any other ideas? ;)

Mon, Aug 14, 7:33 PM · Discovery-Analysis (Current work), Discovery
mpopov added a comment to T172960: [Dashboard] Paulscore calculation sums duplicated clicks on the same position.

Thanks, good job!

Mon, Aug 14, 6:38 PM · Patch-For-Review, Discovery-Analysis (Current work)
mpopov closed T172740: discovery-stats user does not have access to mysql config and published datasets as Resolved.

Yep, it looks like the patch worked! :D

Mon, Aug 14, 5:35 PM · Patch-For-Review, Discovery-Search (Current work), Discovery-Analysis, Discovery
mpopov moved T173333: Reportupdater outputs files with restricted permissions from Needs triage to Tracking on the Discovery-Analysis board.
Mon, Aug 14, 5:04 PM · Analytics-Kanban, Discovery-Analysis, Patch-For-Review, Discovery
mpopov edited projects for T173333: Reportupdater outputs files with restricted permissions, added: Discovery-Analysis; removed Discovery-Analysis (Current work).
Mon, Aug 14, 5:04 PM · Analytics-Kanban, Discovery-Analysis, Patch-For-Review, Discovery
mpopov created T173333: Reportupdater outputs files with restricted permissions.
Mon, Aug 14, 4:46 PM · Analytics-Kanban, Discovery-Analysis, Patch-For-Review, Discovery

Sun, Aug 13

mpopov added a comment to T165861: Use search log to find currently existing namespace combinations.

They don't? I thought that profile = a bundle of namespaces? What is a profile then?

Sun, Aug 13, 11:21 PM · Advanced-Search, Discovery-Analysis (Current work), TCB-Team, German-Community-Wishlist

Wed, Aug 9

mpopov added a comment to T165861: Use search log to find currently existing namespace combinations.

@mpopov
In the current search interface, there are four option: Content articles, multimedia, everything and advanced. When you click on advanced, you get the table of all namespaces and can choose them individually. If we understand your query correctly, you only look at searches that have profile=advanced in the url. The first three options have other profiles though. For our need, we would need to have these searches included, too.

Wed, Aug 9, 9:42 PM · Advanced-Search, Discovery-Analysis (Current work), TCB-Team, German-Community-Wishlist
mpopov added a comment to T165861: Use search log to find currently existing namespace combinations.

@mpopov was it intentional to not include ns0, the article/main namespace in the results of the query?

Wed, Aug 9, 5:08 PM · Advanced-Search, Discovery-Analysis (Current work), TCB-Team, German-Community-Wishlist

Mon, Aug 7

mpopov added a comment to T172581: Set up mechanism for archiving Search Console data.

Status update: JK will look into giving Chelsy and/or me some kind of access so we can take a look into it. I have previous experience with Google APIs and building R bindings to web APIs, so that will be helpful here :D

Mon, Aug 7, 9:57 PM · SEO, Reading-analysis, Discovery-Analysis

Sat, Aug 5

mpopov committed R1821:48b8d10eebd4: [WIP] Add functions for working with interleaved experiments (authored by mpopov).
[WIP] Add functions for working with interleaved experiments
Sat, Aug 5, 3:10 AM
mpopov committed R1821:388fef5f1a1f: [WIP] Add functions for working with interleaved experiments (authored by mpopov).
[WIP] Add functions for working with interleaved experiments
Sat, Aug 5, 3:10 AM
mpopov committed R1821:fd52edb61748: [WIP] Add functions for working with interleaved experiments (authored by mpopov).
[WIP] Add functions for working with interleaved experiments
Sat, Aug 5, 3:10 AM
mpopov added a comment to T150032: Add support for interleaved results in 2-way A/B test.

@EBernhardson: I went through that large-scale article. Right now I'm writing tools for calculating sample size in interleaved experiments and analyzing results.

Sat, Aug 5, 12:18 AM · MW-1.30-release-notes (WMF-deploy-2017-08-01_(1.30.0-wmf.12)), Patch-For-Review, Discovery-Search (Current work), Discovery, CirrusSearch

Thu, Aug 3

mpopov updated subscribers of T172425: Add Mikhail and Chelsy to WMF-NDA group.
Thu, Aug 3, 7:54 PM · Discovery-Analysis, WMF-NDA-Requests
mpopov moved T172425: Add Mikhail and Chelsy to WMF-NDA group from Needs triage to Tracking on the Discovery-Analysis board.
Thu, Aug 3, 6:40 PM · Discovery-Analysis, WMF-NDA-Requests
mpopov created T172425: Add Mikhail and Chelsy to WMF-NDA group.
Thu, Aug 3, 6:40 PM · Discovery-Analysis, WMF-NDA-Requests

Tue, Aug 1

mpopov updated the task description for T171531: Quarterly metrics prep: Maps.
Tue, Aug 1, 9:35 PM · Discovery-Analysis (Current work), Discovery
mpopov updated the task description for T171529: Quarterly metrics prep: Portal.
Tue, Aug 1, 9:34 PM · Discovery-Analysis (Current work), Discovery

Mon, Jul 31

mpopov moved T171529: Quarterly metrics prep: Portal from Backlog to In progress on the Discovery-Analysis (Current work) board.
Mon, Jul 31, 8:55 PM · Discovery-Analysis (Current work), Discovery
mpopov moved T170494: Reconfigure Discovery-Stats on Analytics Cluster from Needs review to Done on the Discovery-Analysis (Current work) board.
Mon, Jul 31, 8:53 PM · Patch-For-Review, Discovery-Analysis (Current work), Discovery
mpopov added a comment to T171622: Add purge info for Kartographer schema.

I think all the fields in the schema can be white-listed and kept indefinitely (except from EventCapsule's userAgent).
I was assuming you wanted to keep the data for longer.
Otherwise, there's no action needed, because the default behavior for new schemas is "auto-purge after 90 days".

Mon, Jul 31, 8:00 PM · Patch-For-Review, Analytics-Kanban, Discovery-Analysis, Discovery
mpopov moved T171531: Quarterly metrics prep: Maps from Backlog to In progress on the Discovery-Analysis (Current work) board.
Mon, Jul 31, 6:49 PM · Discovery-Analysis (Current work), Discovery
mpopov moved T170022: Map analytics from In progress to Backlog on the Discovery-Analysis (Current work) board.
Mon, Jul 31, 6:49 PM · Patch-For-Review, Discovery-Analysis (Current work), Discovery
mpopov added a comment to T171622: Add purge info for Kartographer schema.

Based on WikimediaEvents/kartographer.js#L13:

Mon, Jul 31, 4:24 PM · Patch-For-Review, Analytics-Kanban, Discovery-Analysis, Discovery

Fri, Jul 28

mpopov moved T172009: Add referer to WebrequestData from Needs triage to Tracking on the Discovery-Analysis board.
Fri, Jul 28, 9:18 PM · Analytics, Discovery-Analysis, Discovery
mpopov created T172009: Add referer to WebrequestData.
Fri, Jul 28, 9:17 PM · Analytics, Discovery-Analysis, Discovery
mpopov moved T170022: Map analytics from Stalled/Waiting to In progress on the Discovery-Analysis (Current work) board.
Fri, Jul 28, 9:12 PM · Patch-For-Review, Discovery-Analysis (Current work), Discovery
mpopov moved T171985: SERP tagging from In progress to Stalled/Waiting on the Discovery-Analysis (Current work) board.
Fri, Jul 28, 9:12 PM · Discovery-Analysis (Current work), Discovery
mpopov renamed T171985: SERP tagging from UDF for detecting when a URI is a SERP to SERP tagging.
Fri, Jul 28, 9:12 PM · Discovery-Analysis (Current work), Discovery
mpopov removed a parent task for T171985: SERP tagging: T171530: Quarterly metrics prep: Search.
Fri, Jul 28, 9:10 PM · Discovery-Analysis (Current work), Discovery
mpopov removed a subtask for T171530: Quarterly metrics prep: Search: T171985: SERP tagging.
Fri, Jul 28, 9:10 PM · Discovery-Analysis (Current work), Discovery
mpopov raised the priority of T171985: SERP tagging from Normal to High.
Fri, Jul 28, 7:29 PM · Discovery-Analysis (Current work), Discovery
mpopov moved T170022: Map analytics from In progress to Stalled/Waiting on the Discovery-Analysis (Current work) board.
Fri, Jul 28, 7:20 PM · Patch-For-Review, Discovery-Analysis (Current work), Discovery
mpopov moved T171985: SERP tagging from Backlog to In progress on the Discovery-Analysis (Current work) board.
Fri, Jul 28, 7:19 PM · Discovery-Analysis (Current work), Discovery
mpopov created T171985: SERP tagging.
Fri, Jul 28, 7:19 PM · Discovery-Analysis (Current work), Discovery

Thu, Jul 27

mpopov created T171904: Running multiple Search A/B tests in parallel.
Thu, Jul 27, 8:53 PM · Discovery-Search, Discovery-Analysis, Discovery
mpopov added a comment to T171790: chooseCRANmirror() and install.packages problems in R on production.
Sys.setenv(
  http_proxy = "http://webproxy.eqiad.wmnet:8080",
  https_proxy = "http://webproxy.eqiad.wmnet:8080"
)
install.packages("dplyr", repos = c(CRAN = "https://www.stats.bris.ac.uk/R/"))
Thu, Jul 27, 1:29 AM · Analytics

Wed, Jul 26

mpopov moved T170022: Map analytics from Backlog to In progress on the Discovery-Analysis (Current work) board.
Wed, Jul 26, 6:36 PM · Patch-For-Review, Discovery-Analysis (Current work), Discovery
mpopov added a comment to T164857: A/B Test: explore similar - analysis of results .

Haven't started properly working on this but I did just upload the auto-generated report to stat1005:/srv/published-datasets/discovery/reports/

Wed, Jul 26, 6:19 PM · Discovery-Analysis (Current work), Discovery
mpopov moved T170494: Reconfigure Discovery-Stats on Analytics Cluster from In progress to Needs review on the Discovery-Analysis (Current work) board.
Wed, Jul 26, 6:09 PM · Patch-For-Review, Discovery-Analysis (Current work), Discovery
mpopov moved T170494: Reconfigure Discovery-Stats on Analytics Cluster from Backlog to In progress on the Discovery-Analysis (Current work) board.
Wed, Jul 26, 3:55 PM · Patch-For-Review, Discovery-Analysis (Current work), Discovery
mpopov moved T170022: Map analytics from In progress to Backlog on the Discovery-Analysis (Current work) board.
Wed, Jul 26, 3:55 PM · Patch-For-Review, Discovery-Analysis (Current work), Discovery
mpopov claimed T170494: Reconfigure Discovery-Stats on Analytics Cluster.
Wed, Jul 26, 3:54 PM · Patch-For-Review, Discovery-Analysis (Current work), Discovery

Tue, Jul 25

mpopov updated subscribers of T171020: Maps: figure out usage of layer data in Wikivoyage maps.

I am not familiar enough with the event object in the context of Maps and had to make a lot of assumptions based on the surrounding code, so I'm not 100% sure that I'm correct in trying to get the name of the layer via event.layer in https://gerrit.wikimedia.org/r/#/c/366183/2/modules/wikivoyage/WVMapLayers.js. I'm hoping @MaxSem can confirm that I did it correctly or provide some advice for how to do it properly.

Tue, Jul 25, 8:02 PM · MW-1.30-release-notes (WMF-deploy-2017-08-01_(1.30.0-wmf.12)), Patch-For-Review, Discovery-Analysis (Current work), Maps-Sprint, Discovery
mpopov added a comment to T171637: Re-install shinydashboard package.

CRAN submission policy recommends waiting like 6 months before submitting another version and the most recent version available on CRAN went up on 2017-06-14, so the version that we actually want will probably go up in like 5-6 months.

Tue, Jul 25, 7:57 PM · Discovery-Analysis, Discovery
mpopov added a comment to T170995: Setup a mirror for R language dependencies (CRAN).

From @Ottomata at https://gerrit.wikimedia.org/r/#/c/366170/:

Tue, Jul 25, 5:51 PM · Discovery-Analysis, Continuous-Integration-Infrastructure, Operations, Release-Engineering-Team (Watching / External), Discovery
mpopov moved T171622: Add purge info for Kartographer schema from Needs triage to Tracking on the Discovery-Analysis board.
Tue, Jul 25, 5:14 PM · Patch-For-Review, Analytics-Kanban, Discovery-Analysis, Discovery
mpopov added projects to T171622: Add purge info for Kartographer schema: Discovery, Discovery-Analysis.
Tue, Jul 25, 5:14 PM · Patch-For-Review, Analytics-Kanban, Discovery-Analysis, Discovery
mpopov created T171622: Add purge info for Kartographer schema.
Tue, Jul 25, 5:13 PM · Patch-For-Review, Analytics-Kanban, Discovery-Analysis, Discovery

Mon, Jul 24

mpopov added a comment to T131795: Create a parameterized report template for search team's A/B tests.

Great job with this, @chelsyx!!! This is going to be such a useful tool when it's done! (Which it almost is! :P)

Mon, Jul 24, 11:37 PM · Discovery-Analysis (Current work), Discovery
mpopov added a comment to T170724: Dashboards: using a specific tab url doesn't work.

Nice! Great job @chelsyx! I was so frustrated when the thing supposed to work but didn't.

Mon, Jul 24, 11:28 PM · Discovery-Analysis (Current work), Discovery

Thu, Jul 20

mpopov closed T171187: Public folder on stat1005 for Discovery's A/B test reports as Resolved.

We're okay with that :)

Thu, Jul 20, 5:00 PM · Analytics, Discovery-Analysis, Discovery
mpopov closed T171187: Public folder on stat1005 for Discovery's A/B test reports, a subtask of T131795: Create a parameterized report template for search team's A/B tests, as Resolved.
Thu, Jul 20, 5:00 PM · Discovery-Analysis (Current work), Discovery
mpopov awarded T168683: Upgrade pandoc package to at least 1.12.3 a Party Time token.
Thu, Jul 20, 4:17 PM · Discovery-Analysis, Operations
mpopov moved T171187: Public folder on stat1005 for Discovery's A/B test reports from Needs triage to Tracking on the Discovery-Analysis board.
Thu, Jul 20, 4:15 PM · Analytics, Discovery-Analysis, Discovery
mpopov created T171187: Public folder on stat1005 for Discovery's A/B test reports.
Thu, Jul 20, 4:15 PM · Analytics, Discovery-Analysis, Discovery

Wed, Jul 19

mpopov added a comment to T170494: Reconfigure Discovery-Stats on Analytics Cluster.

I just realized that reworking discovery-stats properly will require R package installation stuff from https://gerrit.wikimedia.org/r/#/c/366170/

Wed, Jul 19, 6:22 PM · Patch-For-Review, Discovery-Analysis (Current work), Discovery
mpopov moved T164854: Search Dashboard: update for engagement - sister projects from Needs review to Backlog on the Discovery-Analysis (Current work) board.
Wed, Jul 19, 6:12 PM · Discovery-Analysis (Current work), Discovery
mpopov moved T170995: Setup a mirror for R language dependencies (CRAN) from Needs triage to Tracking on the Discovery-Analysis board.
Wed, Jul 19, 5:25 PM · Discovery-Analysis, Continuous-Integration-Infrastructure, Operations, Release-Engineering-Team (Watching / External), Discovery
mpopov edited projects for T170995: Setup a mirror for R language dependencies (CRAN), added: Discovery-Analysis; removed Discovery-Analysis (Current work).

@hashar Thank you for making this ticket and emailing the R Foundation/R Development Core Team! Heh, yesterday I emailed @Ottomata & @Gehel asking if setting up our own CRAN mirror would be a reasonable thing.

Wed, Jul 19, 5:25 PM · Discovery-Analysis, Continuous-Integration-Infrastructure, Operations, Release-Engineering-Team (Watching / External), Discovery
mpopov added a comment to T163139: Review map eventlogging.
Wed, Jul 19, 5:10 PM · Discovery-Analysis, Discovery, Maps-Sprint, Maps (Kartographer)

Jul 19 2017

mpopov moved T171020: Maps: figure out usage of layer data in Wikivoyage maps from In progress to Needs review on the Discovery-Analysis (Current work) board.
Jul 19 2017, 2:12 AM · MW-1.30-release-notes (WMF-deploy-2017-08-01_(1.30.0-wmf.12)), Patch-For-Review, Discovery-Analysis (Current work), Maps-Sprint, Discovery
mpopov claimed T171020: Maps: figure out usage of layer data in Wikivoyage maps.

Found where layer selection events are implemented: https://github.com/wikimedia/mediawiki-extensions-Kartographer/blob/master/modules/wikivoyage/WVMapLayers.js

Jul 19 2017, 1:11 AM · MW-1.30-release-notes (WMF-deploy-2017-08-01_(1.30.0-wmf.12)), Patch-For-Review, Discovery-Analysis (Current work), Maps-Sprint, Discovery

Jul 18 2017

mpopov added a comment to T170494: Reconfigure Discovery-Stats on Analytics Cluster.

@mpopov, do I need to bother migrating the existing statistics::discovery stuff then? If possible we should probably remove this class, and then create a new more discovery specific one.

Jul 18 2017, 7:30 PM · Patch-For-Review, Discovery-Analysis (Current work), Discovery

Jul 14 2017

mpopov added a comment to T170183: Investigate mobile/desktop disparity on sister search numbers.

Update: French and Catalan were the only languages that use a community-developed sister search sidebar in addition to ours. I've separated out those two languages into their category but that wasn't it:

Jul 14 2017, 7:04 PM · Patch-For-Review, Discovery-Analysis (Current work), Discovery

Jul 12 2017

mpopov added a comment to T170022: Map analytics .

@debt: first draft: https://people.wikimedia.org/~bearloga/reports/maps-usage.html

Jul 12 2017, 10:33 PM · Patch-For-Review, Discovery-Analysis (Current work), Discovery
mpopov updated subscribers of T170494: Reconfigure Discovery-Stats on Analytics Cluster.

@Ottomata: is it OK if we don't get around to this until after stat1005 goes live?

Jul 12 2017, 9:59 PM · Patch-For-Review, Discovery-Analysis (Current work), Discovery
mpopov created T170494: Reconfigure Discovery-Stats on Analytics Cluster.
Jul 12 2017, 9:57 PM · Patch-For-Review, Discovery-Analysis (Current work), Discovery
mpopov added a comment to T170471: Move statistics::discovery jobs from stat1002 -> stat1005.

I need to repurpose https://github.com/wikimedia/puppet/blob/production/modules/statistics/manifests/discovery.pp to be the thing that runs https://github.com/wikimedia/wikimedia-discovery-golden instead of https://github.com/wikimedia/analytics-discovery-stats (deprecated). I'll ping you and Guillaume for CR when it's ready.

Jul 12 2017, 9:47 PM · Analytics-Kanban, Patch-For-Review, Analytics-Cluster
mpopov added a project to T143366: Removal of {{#coordinates:}} leaves DB entries behind: DBA.

Tagging DBA here because the geo_tag table grows whenever someone adds coordinates but does not shrink when coordinates are removed on-wiki and that's something they should be aware of.

Jul 12 2017, 8:25 PM · DBA, Maps-Sprint, Discovery, GeoData

Jul 11 2017

mpopov added a comment to T164854: Search Dashboard: update for engagement - sister projects.

@debt: so…are we going ahead with the idea to add another language category for languages that already have a sister project search sidebar (e.g. French)?

Jul 11 2017, 10:24 PM · Discovery-Analysis (Current work), Discovery
mpopov updated subscribers of T170022: Map analytics .

@MaxSem I'm going through your discovery-stats repo and currently taking a look at the geo_tag table. I'm noticing that sometimes there are geotags in the database that are no longer present on wiki.

Jul 11 2017, 12:32 AM · Patch-For-Review, Discovery-Analysis (Current work), Discovery

Jul 10 2017

mpopov moved T170022: Map analytics from Backlog to In progress on the Discovery-Analysis (Current work) board.
Jul 10 2017, 10:39 PM · Patch-For-Review, Discovery-Analysis (Current work), Discovery
mpopov updated subscribers of T170183: Investigate mobile/desktop disparity on sister search numbers.

@JKatzWMF would @Tbayer be able to take a look at the Hive query that is generating the dataset and confirm it is correctly counting sister search-referred pageviews by platform, wiki, etc.? Just in case Chelsy or I missed some particular detail when writing/reviewing it. The query is at https://github.com/wikimedia/wikimedia-discovery-golden/blob/master/modules/metrics/search/sister_search_traffic

Jul 10 2017, 10:22 PM · Patch-For-Review, Discovery-Analysis (Current work), Discovery
mpopov moved T168466: Investigate PaulScores for late April and May for full-text searches from In progress to Done on the Discovery-Analysis (Current work) board.

It's because we changed the sampling rates on April 19th, decreasing enwiki and increasing every other wiki. Since enwiki generally has high PaulScore, we effectively lowered the overall PaulScore by decreasing enwiki's contribution.

Jul 10 2017, 10:03 PM · Patch-For-Review, Discovery-Analysis (Current work), Discovery

Jul 8 2017

Gerrit Code Review <gerrit@wikimedia.org> committed R1821:9a203e490b7e: Modified project settings (authored by mpopov).
Modified project settings
Jul 8 2017, 4:46 AM

Jul 7 2017

mpopov claimed T170022: Map analytics .

Links for future Mikhail:

Jul 7 2017, 10:25 PM · Patch-For-Review, Discovery-Analysis (Current work), Discovery
mpopov added a comment to T164854: Search Dashboard: update for engagement - sister projects.

@debt: latest version up on beta https://discovery-beta.wmflabs.org/metrics/#sister_search_traffic :)

Jul 7 2017, 5:41 PM · Discovery-Analysis (Current work), Discovery

Jul 6 2017

mpopov moved T164854: Search Dashboard: update for engagement - sister projects from In progress to Needs review on the Discovery-Analysis (Current work) board.
Jul 6 2017, 11:18 PM · Discovery-Analysis (Current work), Discovery
mpopov moved T169175: What is a reasonable per-IP ratelimit for maps from Needs review to Done on the Discovery-Analysis (Current work) board.
Jul 6 2017, 8:13 PM · Discovery-Analysis, Operations, Traffic, Maps-Sprint, Maps, Discovery
mpopov added a comment to T169175: What is a reasonable per-IP ratelimit for maps.

As a very short summary of @mpopov's analyis:

We would not limit anyone in the sample with:

  • 350 req/sec
  • 2300 req/minute
Jul 6 2017, 6:58 PM · Discovery-Analysis, Operations, Traffic, Maps-Sprint, Maps, Discovery
mpopov moved T169125: [Dashboards] Fix spline smoothing from Needs review to Done on the Discovery-Analysis (Current work) board.
Jul 6 2017, 5:34 PM · Patch-For-Review, Discovery-Analysis (Current work)
mpopov added a comment to T169175: What is a reasonable per-IP ratelimit for maps.

@mpopov I love your graphs! They just look nice!

Jul 6 2017, 5:26 PM · Discovery-Analysis, Operations, Traffic, Maps-Sprint, Maps, Discovery

Jul 5 2017

mpopov closed T168916: Sister search result clicks missing from search event logging as Resolved.
SELECT DATE(LEFT(timestamp, 8)) AS `date`, COUNT(*) AS ssclicks
FROM (
  SELECT DISTINCT timestamp, event_uniqueId
  FROM TestSearchSatisfaction2_16909631
  WHERE
    LEFT(timestamp, 6) >= '201707'
    AND event_subTest IS NULL
    AND event_source = 'fulltext'
    AND event_action = 'ssclick'
) deduped
GROUP BY `date`
ORDER BY `date`
LIMIT 10;
Jul 5 2017, 10:15 PM · MW-1.30-release-notes (WMF-deploy-2017-06-27_(1.30.0-wmf.7)), Patch-For-Review, Discovery-Analysis (Current work), Discovery-Search (Current work), Discovery
mpopov closed T168916: Sister search result clicks missing from search event logging, a subtask of T164854: Search Dashboard: update for engagement - sister projects, as Resolved.
Jul 5 2017, 10:15 PM · Discovery-Analysis (Current work), Discovery
mpopov moved T168916: Sister search result clicks missing from search event logging from Needs review to Done on the Discovery-Analysis (Current work) board.
Jul 5 2017, 10:07 PM · MW-1.30-release-notes (WMF-deploy-2017-06-27_(1.30.0-wmf.7)), Patch-For-Review, Discovery-Analysis (Current work), Discovery-Search (Current work), Discovery
mpopov moved T164854: Search Dashboard: update for engagement - sister projects from Needs review to In progress on the Discovery-Analysis (Current work) board.
Jul 5 2017, 10:07 PM · Discovery-Analysis (Current work), Discovery
mpopov added a comment to T169175: What is a reasonable per-IP ratelimit for maps.

@Gehel said the traffic team needs to take a look at this before we can call it done.

Jul 5 2017, 8:43 PM · Discovery-Analysis, Operations, Traffic, Maps-Sprint, Maps, Discovery

Jun 30 2017

yuvipanda awarded T168683: Upgrade pandoc package to at least 1.12.3 a Evil Spooky Haunted Tree token.
Jun 30 2017, 12:50 AM · Discovery-Analysis, Operations

Jun 29 2017

mpopov moved T169175: What is a reasonable per-IP ratelimit for maps from In progress to Needs review on the Discovery-Analysis (Current work) board.

Here's the distribution of tile counts per IP address per day:

Jun 29 2017, 9:22 PM · Discovery-Analysis, Operations, Traffic, Maps-Sprint, Maps, Discovery
mpopov claimed T169175: What is a reasonable per-IP ratelimit for maps.

Aiming to have results by the end of the day.

Jun 29 2017, 4:24 PM · Discovery-Analysis, Operations, Traffic, Maps-Sprint, Maps, Discovery