Page MenuHomePhabricator

Tbayer (Tilman Bayer)
Disabled

Today

  • Clear sailing ahead.

Tomorrow

  • Clear sailing ahead.

Monday

  • Clear sailing ahead.

User Details

User Since
Oct 20 2014, 11:21 PM (234 w, 4 d)
Roles
Disabled
Availability
Available
IRC Nick
HaeB
LDAP User
Unknown
MediaWiki User
Tbayer (WMF) [ Global Accounts ]

Recent Activity

Tue, Apr 9

Tbayer updated the task description for T216152: AMC Navigation - add new links to main menu with click tracking.
Tue, Apr 9, 5:43 PM · MW-1.34-notes (1.34.0-wmf.3; 2019-04-30), Readers-Web-Backlog (Readers-Web-Kanbanana-Board-2018-19-Q4), Advanced Mobile Contributions
Tbayer updated subscribers of T220236: [REQUEST] Top 150 pageviews in India.

See also in general T207171 (it looks like the query given there could be reused here with minor modifications), also for some data quality caveats by @Nuria - personally I wouldn't expect these to be a big issue for this particular use case, but it's worth being aware.

Tue, Apr 9, 3:08 AM · Product-Analytics
Tbayer added a comment to T207171: Have a way to show the most popular pages per country.

Is this also an issue for the topviews that are shown per language?

Yes, it is an issue with any top list. Now, topviews has a "spam" list so titles that are known to be spammy traffic are removed. Those are reported by users and while list is great to have it just removes the major offenders.

Tue, Apr 9, 3:00 AM · Language-strategy, Tool-Pageviews, Analytics

Mon, Apr 8

Tbayer added a comment to T107069: Convert HistoryAction.php to use OOUI and MW's new DateInputWidget.

Hi @Wargo, this is the right place for this specific form.

[..]

Second, even if you might not use the form, doesn't mean that it's not used frequently.

True, but conversely, it also doesn't mean that it is used frequently enough to justify enlarging it at the expense of other page elements ;)

We're trying to get more data behind the usage of those forms and its elements as part of Advanced Mobile Contributions project.

That's a good idea. Does someone happen to have a link to the corresponding data analysis task?
There is T214935, but measuring the usage of those forms has not been in the scope of that ticket (it would likely require new instrumentation).
And in fact, the results there so far point in a very different direction. E.g. in T214935#4917889 we found that for logged-in users on enwiki, only about 6% of clicks on history pages go to "other action=history views". This includes both submissions of this date/tag search form and all clicks to the "older 50" etc. links located at the top and bottom of the revision list.
I.e. usage of the form that has been enlarged here is probably even lower than 6% in that situation. On the other hand, clicks on diff links made up 43%, and links to old revisions 10% . So we already know that the revision list - which has been pushed further down the page by this change - gets more than half of the usage. (It is probably much more than 50%, because it contains other kinds of links - user pages, user talk pages, contributions - which also occur outside the list, so we can't easily determine whether a click on them came from inside or outside the list).

Mon, Apr 8, 11:12 PM · MW-1.34-notes (1.34.0-wmf.1; 2019-04-16), Patch-For-Review, MW-1.33-notes (1.33.0-wmf.24; 2019-04-02), UI-Standardization-Kanban, User-Jdlrobson, Advanced Mobile Contributions, MediaWiki-History-and-Diffs, UI-Standardization
Tbayer added a parent task for T201502: Ensure that readers of the 2018/19 Audiences annual plan wiki page can find out about canonical metric choices: T215976: Data Dictionary for Core Metrics.
Mon, Apr 8, 7:47 PM · Contributors-Analysis, Product-Analytics
Tbayer added a subtask for T215976: Data Dictionary for Core Metrics: T201502: Ensure that readers of the 2018/19 Audiences annual plan wiki page can find out about canonical metric choices.
Mon, Apr 8, 7:47 PM · Product-Analytics, Better Use Of Data
Tbayer renamed T201502: Ensure that readers of the 2018/19 Audiences annual plan wiki page can find out about canonical metric choices from Ensure that annual plan wiki page represents canonical metric choices to Ensure that readers of the 2018/19 Audiences annual plan wiki page can find out about canonical metric choices.
Mon, Apr 8, 7:46 PM · Contributors-Analysis, Product-Analytics
Restricted Application changed the subtype of T201502: Ensure that readers of the 2018/19 Audiences annual plan wiki page can find out about canonical metric choices from "Deadline" to "Task".

The need outlined in this task still exists; actually it has since become more urgent, as Audiences teams have been making those metrics choices.

Mon, Apr 8, 7:42 PM · Contributors-Analysis, Product-Analytics
Tbayer awarded T220432: Clean up "easter egg" short URLs before extension goes live a Like token.
Mon, Apr 8, 6:01 PM · MediaWiki-extensions-UrlShortener
Tbayer added a comment to T218964: Ingest data from PrefUpdate EventLogging schema into Druid.

That link looks great overall. There seems to be a one-day discrepancy though between the dates given on the x-axis and in the mouseover. Also, I'm having trouble accessing this view (the chart never materializes, the spinner keeps spinning even after waiting for 5-10 minutes - tried both in Firefox and Chromium). Perhaps a general Turnilo issue?

Mon, Apr 8, 5:58 PM · Patch-For-Review, Analytics-Kanban, Product-Analytics, Analytics
Tbayer updated subscribers of T219402: AMC: add Contributions action to toolbar for :User namespace pages.
Mon, Apr 8, 2:00 PM · Readers-Web-Backlog (Design), Advanced Mobile Contributions
Tbayer added a comment to T215675: Provide mechanism to allow dynamically tag log entries.

I think we're done here but want to check with @Tbayer to see if there's anything else we'd like to check in order to resolve this

T215597: QA edit tags for moderation actions is still open; should it be considered a subtask of this?

Mon, Apr 8, 1:24 PM · Readers-Web-Backlog (Readers-Web-Kanbanana-Board-2018-19-Q4), MW-1.33-notes (1.33.0-wmf.22; 2019-03-19), MediaWiki-Logging, Advanced Mobile Contributions, MobileFrontend

Sat, Apr 6

Tbayer closed T218286: Test X-Analytics tag in production as Resolved.

Two further plausibility checks look fine as well:

  • The distribution of special pages viewed seems roughly plausible (even if we perhaps don't anticipate MassMessage or CX to be widely used under the AMC interface).
  • Almost all request are logged-in.
Sat, Apr 6, 1:53 AM · Readers-Web-Backlog (Tracking), Product-Analytics, Audiences-QA (RW-Test-Cases), Advanced Mobile Contributions
Tbayer closed T218286: Test X-Analytics tag in production, a subtask of T210660: [EPIC] AMC Metrics , as Resolved.
Sat, Apr 6, 1:53 AM · Readers-Web-Backlog (Readers-Web-Kanbanana-Board-2018-19-Q4), Chinese-Sites, Epic, Advanced Mobile Contributions
Tbayer added a comment to T219546: Change permissions for daily traffic anomaly reports on stat1007.

@elukey Once this is puppetized, how much would it continue to depend on particular users retaining continued access?

Sat, Apr 6, 12:24 AM · User-Elukey, Analytics-Kanban, Analytics

Fri, Apr 5

Tbayer added a comment to T207280: Track share button usage.

Filed a whitelist patch. Rather than the Print schema I mentioned above, the ReadingDepth schema turned out to be a better example to follow here (note that in this case we don't track session IDs to begin with).

Fri, Apr 5, 8:32 PM · Patch-For-Review, Readers-Web-Backlog (Tracking), User-Jdlrobson, MW-1.33-notes (1.33.0-wmf.22; 2019-03-19), MinervaNeue
Tbayer added a comment to T218286: Test X-Analytics tag in production.

The tag is present in the webrequest table (as "b%2Camc", an unencoded "b,amc" would have been slightly more aesthetically pleasing, but it's not a dealbreaker ;) It occurs on exactly the three projects where we expect it after the recent deployment. Will do some further checks.

Fri, Apr 5, 5:13 PM · Readers-Web-Backlog (Tracking), Product-Analytics, Audiences-QA (RW-Test-Cases), Advanced Mobile Contributions
Tbayer added a comment to T218964: Ingest data from PrefUpdate EventLogging schema into Druid.

Thanks! To me this looks good to go now, except perhaps that the x-axis coordinates seem a bit weird (each day appears twice - "Wed 16 Wed 16'", with the second "Wed 16" actually located at the start of Jan 17).

Fri, Apr 5, 4:08 PM · Patch-For-Review, Analytics-Kanban, Product-Analytics, Analytics
Tbayer added a comment to T207280: Track share button usage.

The schema has three different events (actions), and several fields. For QA, all these need to be checked. I have updated the task description with steps that actually achieve this

I can confirm I have verified all of these.

OK thanks! Feel free to mark the new QA steps as passed if we have indeed verified the fields for all three actions.

contains wprov" is not sufficient, it needs to use the parameter value we picked for this, in the format required by Varnish

The value is also there. This I can confirm

I'm not quite sure what you meant by "check if it shows up in a page view table" - wprov parameters don't show up there, only in webrequest

Webrequest is what I'm referring to. My understanding is that there is no webrequest table for the beta cluster so we won't be able to test this unless the code is enabled in production.

Fri, Apr 5, 2:07 AM · Patch-For-Review, Readers-Web-Backlog (Tracking), User-Jdlrobson, MW-1.33-notes (1.33.0-wmf.22; 2019-03-19), MinervaNeue
Tbayer added a comment to T219402: AMC: add Contributions action to toolbar for :User namespace pages.

...

The contributions icon is not recognisable to me to be honest

Agreed - earlier, this (or a similar icon, part of the "Wikifont" effort some years ago) has rather been used to evoke "content" in the movement, see e.g. this or this example.

and inconsistent with desktop which uses a black silhouette.

That black silhouette icon on desktop indicates your own user links (user page, notifications, talk page... ), not for contributions of other users.

Fri, Apr 5, 1:59 AM · Readers-Web-Backlog (Design), Advanced Mobile Contributions
Tbayer added a comment to T219402: AMC: add Contributions action to toolbar for :User namespace pages.

@CKoerner_WMF thanks for pointing that out.

@Tbayer is it possible for us to find out: for a logged-in user on desktop, from a Userpage are they more likely to navigate to History or to Contributions? @Jdlrobson makes a good point about consistency, so I'm not necessarily suggesting that we should base the design on the data entirely, but would be good to know if possible.

Fri, Apr 5, 1:42 AM · Readers-Web-Backlog (Design), Advanced Mobile Contributions
Tbayer added a comment to T207280: Track share button usage.

I've QAed and verified events are shown on share clicks.

The schema has three different events (actions), and several fields. For QA, all these need to be checked. I have updated the task description with steps that actually achieve this.

As for click through links, I can verify that the URL shared contains wprov - is that enough to call this done?

"contains wprov" is not sufficient, it needs to use the parameter value we picked for this, in the format required by Varnish. (On the other hand, I'm not quite sure what you meant by "check if it shows up in a page view table" - wprov parameters don't show up there, only in webrequest. In any case we don't need to debug the Varnish / refinery pipeline here, just make sure we format the URL as specified in its documentation.) I have updated the task description in that regard too.

Fri, Apr 5, 12:05 AM · Patch-For-Review, Readers-Web-Backlog (Tracking), User-Jdlrobson, MW-1.33-notes (1.33.0-wmf.22; 2019-03-19), MinervaNeue

Thu, Apr 4

Tbayer updated the task description for T207280: Track share button usage.
Thu, Apr 4, 11:58 PM · Patch-For-Review, Readers-Web-Backlog (Tracking), User-Jdlrobson, MW-1.33-notes (1.33.0-wmf.22; 2019-03-19), MinervaNeue
Tbayer added a comment to T218964: Ingest data from PrefUpdate EventLogging schema into Druid.

Great - the capsule dimensions look good to me. It doesn't yet seem possible to switch to a daily time series, perhaps that is an artifact of the short test period? (The dataset seems to contain data from both Jan 14 and Jan 15. But splitting by time and selecting 1D granularity results in a chart consisting just of one dot, for Jan 15.)

Thu, Apr 4, 7:58 PM · Patch-For-Review, Analytics-Kanban, Product-Analytics, Analytics
Tbayer created T220148: Count moderation actions.
Thu, Apr 4, 7:46 PM · Product-Analytics, Advanced Mobile Contributions
Tbayer updated the task description for T213461: Define moderation actions.
Thu, Apr 4, 2:43 AM · Advanced Mobile Contributions
Tbayer added a comment to T215597: QA edit tags for moderation actions.

@Edtadros Sure, thanks for checking! I have edited the task description in that regard, let me know if this is helpful.
I also took the opportunity to rewrite rest somewhat, adding more explanatory links and nuance.

Thu, Apr 4, 2:42 AM · Readers-Web-Backlog (Readers-Web-Kanbanana-Board-2018-19-Q4), Audiences-QA (RW-Test-Cases), Advanced Mobile Contributions
Tbayer updated the task description for T215597: QA edit tags for moderation actions.
Thu, Apr 4, 2:39 AM · Readers-Web-Backlog (Readers-Web-Kanbanana-Board-2018-19-Q4), Audiences-QA (RW-Test-Cases), Advanced Mobile Contributions

Wed, Apr 3

Tbayer updated the task description for T210660: [EPIC] AMC Metrics .
Wed, Apr 3, 11:07 PM · Readers-Web-Backlog (Readers-Web-Kanbanana-Board-2018-19-Q4), Chinese-Sites, Epic, Advanced Mobile Contributions
Tbayer updated the task description for T210660: [EPIC] AMC Metrics .
Wed, Apr 3, 10:40 PM · Readers-Web-Backlog (Readers-Web-Kanbanana-Board-2018-19-Q4), Chinese-Sites, Epic, Advanced Mobile Contributions
Tbayer updated the task description for T210660: [EPIC] AMC Metrics .
Wed, Apr 3, 9:48 PM · Readers-Web-Backlog (Readers-Web-Kanbanana-Board-2018-19-Q4), Chinese-Sites, Epic, Advanced Mobile Contributions
Tbayer updated the task description for T198313: [GOAL] Advanced mobile contributions.
Wed, Apr 3, 9:31 PM · Advanced Mobile Contributions, Goal, Readers-Web-Backlog
Tbayer added a comment to T207280: Track share button usage.

@pmiazga I have added the mandatory schema documentation to the talk page (feel rope in other maintainers, or fill out the project name): https://meta.wikimedia.org/wiki/Schema_talk:MobileWebShareButton
It also needs a whitelisting decision. I suggest to follow the example of Schema:Print here too (code).

Wed, Apr 3, 8:19 PM · Patch-For-Review, Readers-Web-Backlog (Tracking), User-Jdlrobson, MW-1.33-notes (1.33.0-wmf.22; 2019-03-19), MinervaNeue
Tbayer updated the task description for T218964: Ingest data from PrefUpdate EventLogging schema into Druid.
Wed, Apr 3, 7:47 PM · Patch-For-Review, Analytics-Kanban, Product-Analytics, Analytics
Tbayer added a comment to T218964: Ingest data from PrefUpdate EventLogging schema into Druid.

Thanks @fdans - we'll also need at least some of the standard fields from the event capsule, as in the case of previous EventLogging ingestions (e.g. the aforementioned T202751, where these had been understood to be included without being listed explicitly in the task description. But I should done that here for clarity, will do so now).

Wed, Apr 3, 7:44 PM · Patch-For-Review, Analytics-Kanban, Product-Analytics, Analytics
Tbayer updated the task description for T220016: Extend and rename MobileWebMainMenuClickTracking schema.
Wed, Apr 3, 5:30 PM · Advanced Mobile Contributions, Readers-Web-Backlog
Tbayer created T220016: Extend and rename MobileWebMainMenuClickTracking schema.
Wed, Apr 3, 5:03 PM · Advanced Mobile Contributions, Readers-Web-Backlog

Tue, Apr 2

Tbayer updated the task description for T219542: Make data access guidelines public.
Tue, Apr 2, 5:59 PM · Documentation, Product-Analytics

Mon, Apr 1

Tbayer added a comment to T178802: Add Tilman to analytics-admins.

@elukey Sure, that totally makes sense! The end of January estimate from T178802#4647106 turned out a bit optimistic (see again our internal timeline document which I have been trying to keep up to date as information became available to me), but as of a few weeks ago this now indeed looks completed for the foreseeable future. Please remove the bits. What might the turnaround time be to reinstate them if needed?

Mon, Apr 1, 9:45 PM · Patch-For-Review, Operations, SRE-Access-Requests, Analytics-Kanban

Fri, Mar 29

Tbayer added a comment to T219212: Augment ReadingDepth schema with data from Page Lifecycle API.

☝️ If necessary, we could break out migrating from using the unload event to using the pagehide event into another task.

Fri, Mar 29, 4:40 PM · Reading Depth, Product-Analytics, Readers-Web-Backlog

Tue, Mar 26

Tbayer moved T219212: Augment ReadingDepth schema with data from Page Lifecycle API from Triage to Tracking on the Product-Analytics board.
Tue, Mar 26, 11:02 PM · Reading Depth, Product-Analytics, Readers-Web-Backlog
Tbayer moved T218286: Test X-Analytics tag in production from Triage to Next Up on the Product-Analytics board.
Tue, Mar 26, 11:02 PM · Readers-Web-Backlog (Tracking), Product-Analytics, Audiences-QA (RW-Test-Cases), Advanced Mobile Contributions
Tbayer claimed T218286: Test X-Analytics tag in production.
Tue, Mar 26, 11:02 PM · Readers-Web-Backlog (Tracking), Product-Analytics, Audiences-QA (RW-Test-Cases), Advanced Mobile Contributions
Tbayer added a comment to T216883: Document contributors movement metrics.

BTW, it may be worth including references to the Wikistats definitions (https://meta.wikimedia.org/wiki/Research:Wikistats_metrics ) which seem to have several parallels (e.g.
https://meta.wikimedia.org/wiki/Research:Wikistats_metrics/Active_editors ) but perhaps with differences.

Tue, Mar 26, 9:15 PM · Contributors-Analysis, Product-Analytics
Tbayer updated the task description for T215976: Data Dictionary for Core Metrics.
Tue, Mar 26, 8:18 PM · Product-Analytics, Better Use Of Data
Tbayer added a comment to T219212: Augment ReadingDepth schema with data from Page Lifecycle API.
  • Can lifecycleactiveLength be lifecycleActiveLength?

Sounds good, I renamed it in the task description.

  • It's my understanding that this change is anticipated to be supported by Chrome only initially. I think that means we have to maintain the existing implementation but I wasn't quite clear from the meeting discussion.

Yes, that's why the task decription said "in addition to visibleLength" (which will remain based on the Visibility API).

Tue, Mar 26, 6:55 PM · Reading Depth, Product-Analytics, Readers-Web-Backlog
Tbayer updated the task description for T219212: Augment ReadingDepth schema with data from Page Lifecycle API.
Tue, Mar 26, 6:53 PM · Reading Depth, Product-Analytics, Readers-Web-Backlog
Tbayer added a comment to T172410: Replace the current multisource analytics-store setup.

For the record: Information about how to work with the new setup was added to https://wikitech.wikimedia.org/wiki/Analytics/Data_access#MariaDB_replicas .

Tue, Mar 26, 1:00 AM · Product-Analytics, Analytics, WMDE-Analytics-Engineering, User-Addshore, User-Elukey, Research

Mon, Mar 25

Tbayer renamed T219212: Augment ReadingDepth schema with data from Page Lifecycle API from Augment ReadingDepth schema with to Augment ReadingDepth schema with data from Page Lifecycle API.
Mon, Mar 25, 7:41 PM · Reading Depth, Product-Analytics, Readers-Web-Backlog
Tbayer created T219212: Augment ReadingDepth schema with data from Page Lifecycle API.
Mon, Mar 25, 6:53 PM · Reading Depth, Product-Analytics, Readers-Web-Backlog

Sun, Mar 24

Tbayer added a comment to T192214: Make dumps accessible in PAWS.

Just to clarify for casual readers: the dumps are currently accessible. (I used them successfully in this notebook earlier this month, thanks @Chicocvenancio for fixing this earlier!) I understand this ticket is now about implementing this in a more future-proof manner - should the task description be updated?

Sun, Mar 24, 7:25 AM · PAWS (zero-to-jupyterhub-k8s 0.8.0), cloud-services-team (Kanban), Data-Services

Sat, Mar 23

Tbayer added a comment to T210006: Event counts from Mysql and Hive don't match. Refine is persisting data from crawlers. .

By looking at some of this data I can see that web crawler events are getting into hive but not into mysql (that would be something for us to fix).

Does this (i.e. that T67508 doesn't yet work for the Hive data) apply to all EL schemas), ? In that case it would seem a valuable addition to the documentation at https://wikitech.wikimedia.org/wiki/Analytics/Systems/EventLogging#Incompatibilities_with_the_MariaDB_setup .

Sat, Mar 23, 12:35 AM · Analytics-Kanban, Product-Analytics, Analytics

Fri, Mar 22

Tbayer created T219040: "Latest X" filter in Turnilo picks the wrong dates.
Fri, Mar 22, 10:23 PM · Analytics
Tbayer closed T198218: Generate list of most used special pages as Resolved.

Results for the third question about the most popular non-mainspace pages for logged-in users, expanding on the enwiki results of T198218#4600385 , are now posted at https://www.mediawiki.org/wiki/Reading/Web/Advanced_mobile_contributions/Special_pages_usage#Top_non-mainspace_pages
Note that as before, this is grouped by (and linking to) page roots, e.g. https://es.wikipedia.org/wiki/Wikipedia:Consultas_de_borrado/Cloud9_(League_of_Legends) counts for https://es.wikipedia.org/wiki/Wikipedia:Consultas_de_borrado . For some pages root, a corresponding page may not exist (e.g. there is https://en.wiktionary.org/wiki/Reconstruction:Proto-Slavic/-ati but no https://en.wiktionary.org/wiki/Reconstruction:Proto-Slavic ).

Fri, Mar 22, 5:24 PM · Chinese-Sites, Advanced Mobile Contributions, Reading-analysis, Readers-Web-Backlog (Tracking), Product-Analytics
Tbayer closed T198218: Generate list of most used special pages, a subtask of T198313: [GOAL] Advanced mobile contributions, as Resolved.
Fri, Mar 22, 5:24 PM · Advanced Mobile Contributions, Goal, Readers-Web-Backlog
Tbayer closed T198218: Generate list of most used special pages, a subtask of T210660: [EPIC] AMC Metrics , as Resolved.
Fri, Mar 22, 5:24 PM · Readers-Web-Backlog (Readers-Web-Kanbanana-Board-2018-19-Q4), Chinese-Sites, Epic, Advanced Mobile Contributions
Tbayer updated the task description for T198218: Generate list of most used special pages.
Fri, Mar 22, 5:18 PM · Chinese-Sites, Advanced Mobile Contributions, Reading-analysis, Readers-Web-Backlog (Tracking), Product-Analytics
ovasileva awarded T218964: Ingest data from PrefUpdate EventLogging schema into Druid a Love token.
Fri, Mar 22, 3:15 PM · Patch-For-Review, Analytics-Kanban, Product-Analytics, Analytics

Thu, Mar 21

Tbayer moved T218964: Ingest data from PrefUpdate EventLogging schema into Druid from Triage to Tracking on the Product-Analytics board.
Thu, Mar 21, 11:38 PM · Patch-For-Review, Analytics-Kanban, Product-Analytics, Analytics
Tbayer created T218964: Ingest data from PrefUpdate EventLogging schema into Druid.
Thu, Mar 21, 11:38 PM · Patch-For-Review, Analytics-Kanban, Product-Analytics, Analytics
Tbayer added a comment to T198218: Generate list of most used special pages.

Results for the second question, expanding the earlier result for the top 50 special pages to 14 non-enwiki projects, are now posted on this wiki page (consisting of 15 x 50 = 750 numbers, this information is a bit unwieldy and would not fit well into a table here on Phabricator).

Thu, Mar 21, 5:18 PM · Chinese-Sites, Advanced Mobile Contributions, Reading-analysis, Readers-Web-Backlog (Tracking), Product-Analytics
Tbayer updated the task description for T198218: Generate list of most used special pages.
Thu, Mar 21, 4:52 PM · Chinese-Sites, Advanced Mobile Contributions, Reading-analysis, Readers-Web-Backlog (Tracking), Product-Analytics

Mar 21 2019

Tbayer added a comment to T211197: Build AMC opt-in toggle.

Note though that there seem to be some general concerns about the data quality of the PrefUpdate schema, regarding duplicate events. I have filed T218835 for this. Right now it seems that this problem could be limited to a small number of other preferences (i.e. not the AMC one we're relying on here), but it seems worth checking this later after a larger number of signups have occurred.

Mar 21 2019, 12:17 AM · Audiences-QA (RW-Test-Cases), MW-1.33-notes (1.33.0-wmf.16; 2019-02-05), Patch-For-Review, Readers-Web-Backlog (Readers-Web-Kanbanana-Board-2018-19-Q3), Advanced Mobile Contributions
Tbayer updated the task description for T218835: prefUpdate schema contains multiple identical events for the same preference update.
Mar 21 2019, 12:13 AM · Product-Analytics, Analytics

Mar 20 2019

Tbayer added a comment to T218835: prefUpdate schema contains multiple identical events for the same preference update.

(Tagging this with Analytics considering general EL code stewardship and the current schema maintainer, although I honestly don't know who is in the best position to fix this.)

Mar 20 2019, 11:16 PM · Product-Analytics, Analytics
Tbayer moved T218835: prefUpdate schema contains multiple identical events for the same preference update from Triage to Tracking on the Product-Analytics board.
Mar 20 2019, 11:13 PM · Product-Analytics, Analytics
Tbayer created T218835: prefUpdate schema contains multiple identical events for the same preference update.
Mar 20 2019, 11:12 PM · Product-Analytics, Analytics
Tbayer updated subscribers of T211843: Update Audiences page and Key Product Metrics deck with February 2019 Readers data.

@Neil_P._Quinn_WMF Are you going to take care of posting the entire slide deck on the Audiences page again?

Mar 20 2019, 9:29 PM · Product-Analytics
Tbayer updated the task description for T211843: Update Audiences page and Key Product Metrics deck with February 2019 Readers data.
Mar 20 2019, 9:25 PM · Product-Analytics
Tbayer moved T211843: Update Audiences page and Key Product Metrics deck with February 2019 Readers data from Blocked to Doing on the Product-Analytics board.
Mar 20 2019, 9:23 PM · Product-Analytics
Tbayer moved T203498: Upgrade Hive to ≥ 2.0 from Triage to Tracking on the Product-Analytics board.
Mar 20 2019, 9:19 PM · Product-Analytics, Analytics-Cluster, Analytics
Tbayer added a project to T203498: Upgrade Hive to ≥ 2.0: Product-Analytics.
Mar 20 2019, 9:18 PM · Product-Analytics, Analytics-Cluster, Analytics
Tbayer added a comment to T211197: Build AMC opt-in toggle.

@Edtadros and I worked through testing the 4th 5th acceptance criterion together during our 1:1 today.

Here's are the server-side EventLogging events that I captured while opting in and out of AMC mode on http://reading-web-staging.wmflabs.org/wiki/Special:MobileOptions:

[...]

To the "and compatible" part of the AC: note well the "clientValidated": true in the events above.

Thanks! This looks good enough for now, although it still seems worthwhile to run a query later - as soon as this is live in production - to verify that these events show up in the PrefUpdate EL table in Hive or MariaDB. (IIRC events from reading-web-staging are not logged there, correct?)

Mar 20 2019, 7:48 PM · Audiences-QA (RW-Test-Cases), MW-1.33-notes (1.33.0-wmf.16; 2019-02-05), Patch-For-Review, Readers-Web-Backlog (Readers-Web-Kanbanana-Board-2018-19-Q3), Advanced Mobile Contributions

Mar 19 2019

Tbayer updated the task description for T215976: Data Dictionary for Core Metrics.
Mar 19 2019, 10:35 PM · Product-Analytics, Better Use Of Data
Tbayer added a project to T218286: Test X-Analytics tag in production: Product-Analytics.
Mar 19 2019, 7:18 PM · Readers-Web-Backlog (Tracking), Product-Analytics, Audiences-QA (RW-Test-Cases), Advanced Mobile Contributions
Tbayer updated the task description for T218627: Upgrade MobileWebMainMenuClickTracking to have an AMC field.
Mar 19 2019, 5:28 PM · Patch-For-Review, MW-1.33-notes (1.33.0-wmf.24; 2019-04-02), Readers-Web-Backlog (Readers-Web-Kanbanana-Board-2018-19-Q4), Advanced Mobile Contributions
Tbayer updated the task description for T203498: Upgrade Hive to ≥ 2.0.
Mar 19 2019, 1:23 AM · Product-Analytics, Analytics-Cluster, Analytics
Tbayer added a comment to T217851: [SPIKE] Check MobileWebMainMenuClickTracking schema is still functional and determine how we can use it for AMC.

...

Thanks for documenting the sampling on the talk page! Does "0.5 of all clicks ... based on session id" mean 0.5% of sessions or half of them? The low event rate (roughly 1 event/sec on average) would be quite surprising in the former case, considering that we have about 3500 mobile pageviews per second.

Mar 19 2019, 1:18 AM · Spike, Readers-Web-Backlog, Advanced Mobile Contributions

Mar 16 2019

Tbayer added a comment to T218349: Update translation for "mobile web edit" change tag to "mobile web action".
  • We announce that the tag will be renamed to wikitech-l, wikimedia-l, and on the technical village pump.

I don't think this is Wikimedia-l material. But a notice in Tech News would be a good idea.

Mar 16 2019, 2:21 AM · MobileFrontend, Readers-Web-Backlog
Tbayer added a comment to T218349: Update translation for "mobile web edit" change tag to "mobile web action".

[...]

What is the problem statement here? Distinguishing old data from new data or updating how it appears in the interface? (the latter is easy but the former is a big undertaking and I want to check it's worth all that effort - from what I remember, when apps started using the tag, web didn't change their tag before for this reason).

I understand it's the latter, although from the data analysis perspective it seems preferable to also rename it in the tables (it would be a permanent source of confusion if the tag's name there differs from its English name in the interface).

Mar 16 2019, 2:19 AM · MobileFrontend, Readers-Web-Backlog

Mar 15 2019

Tbayer added a comment to T18691: RFC: Section headings should have a clickable anchor.

...

In the meantime, here are some dump-based numbers for two Wikipedias (I'll try to run this for enwiki too), for mainspace pages:

Mar 15 2019, 11:54 PM · Readers-Web-Backlog (Design), TechCom-RFC, Design, MediaWiki-Interface
Tbayer added a comment to T216297: Develop method for identifying reverts in EventBus data.

@Anomie this is concerned with eventbus data that is real time, most of the data wanted here is less than couple months old as information wether edits where a revert exists on other datasets for data older than that. So (it seems) that having "rollback" or "undo" tags is actually a good measure of whether a revert has happened. In which case adding tags to this schema should be sufficient:

https://github.com/wikimedia/mediawiki-event-schemas/blob/master/jsonschema/mediawiki/revision/create/3.yaml

Mar 15 2019, 8:00 PM · Core Platform Team Backlog (Watching / External), Contributors-Analysis, Product-Analytics
Tbayer updated the task description for T216297: Develop method for identifying reverts in EventBus data.
Mar 15 2019, 7:48 PM · Core Platform Team Backlog (Watching / External), Contributors-Analysis, Product-Analytics
Tbayer added a comment to T216297: Develop method for identifying reverts in EventBus data.

But I think there are several reasons why people have not been using them for reverts and instead relied on content-based revert detection, e.g. the fact that edits tagged "undo" might not actually be reverts because the user can modify the content before saving.

Another possibility is that the "rollback" and "undo" tags were only added a little over a year ago, so things written before that time wouldn't have been able to use them.

But before that, researchers and analysts were able to instead use the characteristic strings those actions leave in the edit summary (certainly a bit less reliable and convenient, but e.g. the Kittur et al. paper mentioned in https://meta.wikimedia.org/wiki/Research:Revert did something like this in 2007 already).

It also depends on your definition of "revert", whether you count those edited undoes or undoing of a revision older than the most recent while keeping the changes from later revisions.

Sure. There's a good overview in https://meta.wikimedia.org/wiki/Research:Revert , I understand this task is about what is called "identity revert" there because that is what is most practical and already implemented in mwreverts and in mediawiki_history.

Mar 15 2019, 7:47 PM · Core Platform Team Backlog (Watching / External), Contributors-Analysis, Product-Analytics
Tbayer added a comment to T18691: RFC: Section headings should have a clickable anchor.

...

Side note: I still support the idea of making this a configurable setting. For Wikipedia it'd be great to get data on what percentage of articles use <h3>, <h4>, etc. — I've reached out to @Tbayer for suggestions on how we might get that data.

There are some old (2015) stats on H5 & H6 usage at Enwiki, at T72004#1407800 which halfak kindly generated for me back then (it was an overnight query on the stats boxes)

Thanks! Do you happen to still have the underlying database query and could post it here so we could re-run it?
In the meantime, here are some dump-based numbers for two Wikipedias (I'll try to run this for enwiki too), for mainspace pages:

Mar 15 2019, 4:43 PM · Readers-Web-Backlog (Design), TechCom-RFC, Design, MediaWiki-Interface

Mar 14 2019

Tbayer added a comment to T218349: Update translation for "mobile web edit" change tag to "mobile web action".

Another thing to consider may be edit filters. I'm not even sure whether it is possible to include tags in edit filter rules, but in any case, at least on enwiki and dewiki, no filters seem to rely on the mobile web edit tag (admin access to view results).

Mar 14 2019, 11:29 PM · MobileFrontend, Readers-Web-Backlog
Tbayer added a comment to T218349: Update translation for "mobile web edit" change tag to "mobile web action".

[...]

Here the tag is being used to calculate the monthly movement metrics - but I assume this would be easy to update (CC @Neil_P._Quinn_WMF ).

PS: just asked Neil in person and he confirmed it wouldn't be a concern.

Mar 14 2019, 11:14 PM · MobileFrontend, Readers-Web-Backlog
Tbayer updated subscribers of T218349: Update translation for "mobile web edit" change tag to "mobile web action".
Mar 14 2019, 11:05 PM · MobileFrontend, Readers-Web-Backlog
Tbayer added a comment to T218349: Update translation for "mobile web edit" change tag to "mobile web action".

Another data point: Nobody seems to have been using it on Quarry so far.

Mar 14 2019, 8:13 PM · MobileFrontend, Readers-Web-Backlog
Tbayer added a comment to T217926: Import of wmfdata fails while trying to access analytics-store .

Resolved after updating (!pip install --upgrade git+https://github.com/neilpquinn/wmfdata.git). I guess this can be closed in favor of T216634?

Mar 14 2019, 12:40 AM · Product-Analytics

Mar 13 2019

Tbayer updated the task description for T216297: Develop method for identifying reverts in EventBus data.
Mar 13 2019, 9:35 PM · Core Platform Team Backlog (Watching / External), Contributors-Analysis, Product-Analytics
Tbayer added a comment to T216297: Develop method for identifying reverts in EventBus data.

Could we get here (from the UI) whether the user clicked the "revert" button, even? (per @Milimetric 's suggestion) and send that to the hook so the event data also has this information? This would not catch the totality of revisions but a big percentage of them, which, hey, it is a start.

Mar 13 2019, 9:31 PM · Core Platform Team Backlog (Watching / External), Contributors-Analysis, Product-Analytics
Tbayer added a comment to T212961: Add X-Analytics tag for AMC webrequests.

We still need to check at some point that this is correctly processed on Varnish and stored in the webrequest table (where we need it to conduct the analysis). But that will have to wait until the change is in production, because the domain en.m.wikipedia.beta.wmflabs.org is not captured in webrequest. Probably best to create a separate task for that, and close this one?

Mar 13 2019, 8:30 PM · MW-1.33-notes (1.33.0-wmf.22; 2019-03-19), Audiences-QA (RW-Test-Cases), Patch-For-Review, XAnalytics, Product-Analytics, Readers-Web-Backlog (Readers-Web-Kanbanana-Board-2018-19-Q3), Advanced Mobile Contributions

Mar 12 2019

Tbayer added a comment to T201123: What % of pages feature issues?.

PS: Keep in mind that the above data is, as stated, about usage of templates named "Ambox". Some wikis generate the "Ambox" class name manually instead from differently named template (e.g. itwiki, see template source, example article) and will thus be affected/improved by the new design even they show up with 0% in the table above.

Mar 12 2019, 10:42 PM · Product-Analytics, Readers-Web-Backlog (Tracking), Reading-analysis, Page-Issue-Warnings
Tbayer added a comment to T216883: Document contributors movement metrics.

PS: I already added some information about contributors metrics gleaned from the recent Insights presentation deck, but it is incomplete.

Mar 12 2019, 10:16 PM · Contributors-Analysis, Product-Analytics
Tbayer added a comment to T216883: Document contributors movement metrics.

@Neil_P._Quinn_WMF I started a page at https://www.mediawiki.org/wiki/Wikimedia_Audiences/Data_dictionary - feel free to add any links there that may already exist.

Mar 12 2019, 9:51 PM · Contributors-Analysis, Product-Analytics
Tbayer added a comment to T215976: Data Dictionary for Core Metrics.

Started a prototype at https://www.mediawiki.org/wiki/Wikimedia_Audiences/Data_dictionary

Mar 12 2019, 9:26 PM · Product-Analytics, Better Use Of Data
Tbayer claimed T217842: [Research] Get data on page actions usage.
Mar 12 2019, 7:28 PM · Advanced Mobile Contributions

Mar 11 2019

Tbayer updated the task description for T198218: Generate list of most used special pages.
Mar 11 2019, 7:07 PM · Chinese-Sites, Advanced Mobile Contributions, Reading-analysis, Readers-Web-Backlog (Tracking), Product-Analytics
Tbayer added a comment to T215477: Tag Thanks actions with AMC tag.

Leaving this here since we were talking about it earlier today in this context:

Mar 11 2019, 6:31 PM · Readers-Web-Backlog (Readers-Web-Kanbanana-Board-2018-19-Q4), Audiences-QA (RW-Test-Cases), MW-1.33-notes (1.33.0-wmf.22; 2019-03-19), Patch-For-Review, Product-Analytics, Advanced Mobile Contributions, Thanks, Growth-Team