Page MenuHomePhabricator
Feed Advanced Search

Thu, Apr 18

jwang updated the task description for T346979: Report on baseline for interface customization.
Thu, Apr 18, 9:30 PM · Product-Analytics (Kanban), FY2023-24-WE 2.1 Typography and palette customizations, Web-Team-Backlog
jwang moved T361638: Determine number of logged-in editors using each skin on a subset of wikis from Next 2 weeks to Doing on the Product-Analytics (Kanban) board.
Thu, Apr 18, 12:01 AM · Product-Analytics (Kanban), Desktop Improvements (Vector 2022), Web-Team-Backlog
jwang added a project to T361638: Determine number of logged-in editors using each skin on a subset of wikis: Product-Analytics (Kanban).
Thu, Apr 18, 12:01 AM · Product-Analytics (Kanban), Desktop Improvements (Vector 2022), Web-Team-Backlog
jwang updated the task description for T361638: Determine number of logged-in editors using each skin on a subset of wikis.
Thu, Apr 18, 12:00 AM · Product-Analytics (Kanban), Desktop Improvements (Vector 2022), Web-Team-Backlog

Wed, Apr 17

jwang added a comment to T361638: Determine number of logged-in editors using each skin on a subset of wikis.

Both skin preference and global preference reflect the status as of the data collection date, which is April 15, 2024.

Wed, Apr 17, 6:30 PM · Product-Analytics (Kanban), Desktop Improvements (Vector 2022), Web-Team-Backlog

Tue, Apr 16

jwang moved T362453: Migrate the notebook to fetch data for IP mask dashboard to spark from Next 2 weeks to Doing on the Product-Analytics (Kanban) board.
Tue, Apr 16, 4:30 PM · Product-Analytics (Kanban)

Mon, Apr 15

jwang updated the task description for T361638: Determine number of logged-in editors using each skin on a subset of wikis.
Mon, Apr 15, 10:00 PM · Product-Analytics (Kanban), Desktop Improvements (Vector 2022), Web-Team-Backlog

Fri, Apr 12

jwang updated the task description for T362453: Migrate the notebook to fetch data for IP mask dashboard to spark.
Fri, Apr 12, 9:56 PM · Product-Analytics (Kanban)
jwang created T362453: Migrate the notebook to fetch data for IP mask dashboard to spark.
Fri, Apr 12, 9:56 PM · Product-Analytics (Kanban)
jwang closed T359993: Slowdown when querying via Hive as Resolved.
Fri, Apr 12, 6:29 PM · Data-Platform-SRE (2024.03.25 - 2024.04.14), Data-Platform
jwang added a comment to T359993: Slowdown when querying via Hive.

@JAllemandou @BTullis, Thank you very much for detailed explanation! I will move from hive to presto and spark. I am going to mark this ticket as resolved.

Fri, Apr 12, 6:28 PM · Data-Platform-SRE (2024.03.25 - 2024.04.14), Data-Platform

Fri, Apr 5

jwang added a comment to T359418: Analyze usage of desktop text size beta feature.

@ovasileva, here are the analysis result. The answer to the third questions is a very rough estimate. Let me know if you disagree with any of the assumptions.

Fri, Apr 5, 10:26 PM · Product-Analytics (Kanban), FY2023-24-WE 2.1 Typography and palette customizations, Web-Team-Backlog

Wed, Apr 3

jwang updated the task description for T359418: Analyze usage of desktop text size beta feature.
Wed, Apr 3, 5:04 PM · Product-Analytics (Kanban), FY2023-24-WE 2.1 Typography and palette customizations, Web-Team-Backlog

Tue, Apr 2

jwang added a comment to T359418: Analyze usage of desktop text size beta feature.

What is the default font value on vector-2022 ? Regular

Tue, Apr 2, 9:36 PM · Product-Analytics (Kanban), FY2023-24-WE 2.1 Typography and palette customizations, Web-Team-Backlog
jwang triaged T361579: Re-run analysis on which usernames begin with ~2 as High priority.
Tue, Apr 2, 4:43 PM · Product-Analytics (Kanban), Temporary accounts
jwang added a project to T361579: Re-run analysis on which usernames begin with ~2: Product-Analytics (Kanban).
Tue, Apr 2, 4:43 PM · Product-Analytics (Kanban), Temporary accounts

Fri, Mar 29

jwang updated subscribers of T359418: Analyze usage of desktop text size beta feature.

Here is the font size stats on desktop web by skin version. A few questions based on the data

  1. What is the default font value on vector-2022 ?
  2. What do the values 0, 1, 2 , and disabled stand for on vector-2022?
  3. What is the default font value on vector ?
Fri, Mar 29, 10:39 PM · Product-Analytics (Kanban), FY2023-24-WE 2.1 Typography and palette customizations, Web-Team-Backlog

Tue, Mar 26

jwang moved T359418: Analyze usage of desktop text size beta feature from Next 2 weeks to Doing on the Product-Analytics (Kanban) board.
Tue, Mar 26, 6:02 PM · Product-Analytics (Kanban), FY2023-24-WE 2.1 Typography and palette customizations, Web-Team-Backlog
jwang edited projects for T359418: Analyze usage of desktop text size beta feature, added: Product-Analytics (Kanban); removed Product-Analytics.
Tue, Mar 26, 6:01 PM · Product-Analytics (Kanban), FY2023-24-WE 2.1 Typography and palette customizations, Web-Team-Backlog

Mon, Mar 25

jwang updated the task description for T352342: QA WebUIScroll port to the new metrics platform.
Mon, Mar 25, 11:16 PM · Web-Team-Backlog (FY2023-24 Q3 Sprint 3), Data Products (Data Products Sprint 08), Metrics Platform Backlog, Product-Analytics (Kanban), MediaWiki-extensions-WikimediaEvents
jwang added a comment to T352342: QA WebUIScroll port to the new metrics platform.

As a followup, I have documented sample rate at data hub.

Mon, Mar 25, 11:16 PM · Web-Team-Backlog (FY2023-24 Q3 Sprint 3), Data Products (Data Products Sprint 08), Metrics Platform Backlog, Product-Analytics (Kanban), MediaWiki-extensions-WikimediaEvents
jwang updated the task description for T353029: QA desktopwebuiactionstracking schema port to the new metrics platform.
Mon, Mar 25, 11:03 PM · Patch-For-Review, Web-Team-Backlog (FY2023-24 Q3 Sprint 5), Product-Analytics (Kanban), MediaWiki-extensions-WikimediaEvents
jwang added a comment to T353029: QA desktopwebuiactionstracking schema port to the new metrics platform.

As a followup, the sample rate is document at datahub

Mon, Mar 25, 11:02 PM · Patch-For-Review, Web-Team-Backlog (FY2023-24 Q3 Sprint 5), Product-Analytics (Kanban), MediaWiki-extensions-WikimediaEvents
jwang updated the task description for T357542: QA mobilewebuiactionstracking schema port to the new metrics platform.
Mon, Mar 25, 10:45 PM · Web-Team-Backlog (FY2023-24 Q3 Sprint 5), Product-Analytics (Kanban), MediaWiki-extensions-WikimediaEvents
jwang updated the task description for T357542: QA mobilewebuiactionstracking schema port to the new metrics platform.
Mon, Mar 25, 10:45 PM · Web-Team-Backlog (FY2023-24 Q3 Sprint 5), Product-Analytics (Kanban), MediaWiki-extensions-WikimediaEvents
jwang updated the task description for T357542: QA mobilewebuiactionstracking schema port to the new metrics platform.
Mon, Mar 25, 10:44 PM · Web-Team-Backlog (FY2023-24 Q3 Sprint 5), Product-Analytics (Kanban), MediaWiki-extensions-WikimediaEvents
jwang added a comment to T357542: QA mobilewebuiactionstracking schema port to the new metrics platform.

As a followup, I have documented the current sample rate at https://datahub.wikimedia.org/dataset/urn:li:dataset:(urn:li:dataPlatform:hive,event.mobilewebuiactionstracking,PROD)/Documentation?is_lineage_mode=false

Mon, Mar 25, 10:43 PM · Web-Team-Backlog (FY2023-24 Q3 Sprint 5), Product-Analytics (Kanban), MediaWiki-extensions-WikimediaEvents

Mar 21 2024

jwang added a comment to T357771: Analyze how many distinct devices edit per day from a given IP address.
  • Following the inclusion of client hints in the analysis, there was an average increase of 2 in the maximum number of unique user agents on a daily basis.
    • Throughout January 2024, the daily maximum rose from 6 to 8 unique user agents per IP on English Wikipedia.
    • For some days, the increase in maximum after including client info could be as large as 5.
Mar 21 2024, 12:51 AM · Product-Analytics (Kanban), Temporary accounts

Mar 19 2024

jwang moved T322682: Analyze blocked edit attempts from Triage to Epics on the Product-Analytics board.
Mar 19 2024, 6:30 PM · Product-Analytics, Anti-Harassment
jwang edited projects for T322682: Analyze blocked edit attempts, added: Product-Analytics; removed Product-Analytics (Kanban).
Mar 19 2024, 6:30 PM · Product-Analytics, Anti-Harassment

Mar 13 2024

jwang updated the task description for T357542: QA mobilewebuiactionstracking schema port to the new metrics platform.
Mar 13 2024, 9:47 PM · Web-Team-Backlog (FY2023-24 Q3 Sprint 5), Product-Analytics (Kanban), MediaWiki-extensions-WikimediaEvents
jwang updated the task description for T357542: QA mobilewebuiactionstracking schema port to the new metrics platform.
Mar 13 2024, 9:46 PM · Web-Team-Backlog (FY2023-24 Q3 Sprint 5), Product-Analytics (Kanban), MediaWiki-extensions-WikimediaEvents
jwang added a comment to T357542: QA mobilewebuiactionstracking schema port to the new metrics platform.
What has been checkedStatusNoteSnapshot of the result from the old schemaSnapshot of the result from the new schema
Pick one session_id, compare the resultPASSCaptured same number of events.
image.png (704×826 px, 129 KB)
image.png (704×558 px, 68 KB)
Pick one pageview_id, compare the resultPASSCaptured same number of events.
image.png (416×598 px, 35 KB)
image.png (402×558 px, 34 KB)
By datePASSThe new schema captured 0.37% more events than the old schema. The new schema captured 0.34% more sessions than the old schema.
image.png (680×1 px, 187 KB)
By actionPASSBetween March 1st and 10th, The new schema captured 0.95% more click events than old schema. The new schema captured 0.36% more init events than old schema. The new schema captured 2.23% more show events than old schema. They are within a 2.5% acceptable variance.
image.png (300×674 px, 38 KB)
image.png (270×764 px, 40 KB)
By event nameBased on the data collected from 2024-03-01 to 2024-03-10: 1) 176 types of events are captured in new schema or old schema. 2) 31 types of events are captured in new schema, but not in old schema. 3) 2 types of events are captured in old schema, but not in new schema. They are menu.preferences and menu.ve-editevent name diff file
By wiki❓Is a difference of 2.6% on commonswiki OK?Between 2024-03-01 and 2024-03-10: New schema captured 819 wikis, while the old schema captured 820. The missed wiki is nycwikimedia. The new schema captured 0.56% more events than the old schema in average. The new schema captured 0.47% more sessions than the old schema in average. The highest different rate of session count is from small wikis.Among the large wikis, the events on commonswiki is 2.6% more in new schema.
image.png (456×1 px, 98 KB)
By skin name❓is it expected that the new schema captured 'vector' and 'vector-2022' skin with agent.client_platform_family='mobile_browser'.Based on the data collected from 2024-03-01 to 2024-03-10: The new schema captured 0.37% more minerva events than old schema. The new schema captured 0.34% more minerva sessions than old schema. 'vector' and 'vector-2022' skins are not captured in old schema, but captured in new schema with agent.client_platform_family='mobile_browser'. To check with engineer whehter it is expected.
image.png (252×708 px, 29 KB)
image.png (286×776 px, 36 KB)
By user typePASSThe difference is within a 2.5% variance.
image.png (232×882 px, 35 KB)
image.png (234×910 px, 36 KB)
agent typePASS
image.png (252×736 px, 32 KB)
image.png (228×734 px, 32 KB)
edit count bucket❓ Is it expected that performer.edit_count_bucket is NULL in new schema for logged out users, while in old schema, event.editCountBucket is '0 edits'.For loggedin users, editcountbucket difference is within 2.5% variance. For loggedout users, in new schema performer.edit_count_bucket is NULL, while in old schema event.editCountBucket is '0 edits'. Need to confirm whether it is expected.
image.png (432×832 px, 60 KB)
image.png (476×1 px, 81 KB)
pageNamespacepage.namespace_id is NULL in new schema
is_dark_mode_on,❓ Is the null in old schema expected?The difference is within a 2.5% variance. The old schema captured some NULLs, while new schema didnot.For the events with null in event.is_dark_mode_on, their kin is also NULL. To check with engineer
image.png (330×1 px, 68 KB)
is_dark_mode_prepared_by_os❓ Is the null in old schema expected?The different is within a 2.5% variance. The old schema captured some NULLs, their skin field is NULL too. To check with engineer
image.png (286×2 px, 73 KB)
dark_mode_setting❓ Is the null in old and new schemas expected?The differences in dark_mode_setting being 0,1, 2, and NULL are within a 2.5% variance.
image.png (342×1 px, 80 KB)
is_full_widthThe difference is within a 2.5% variance. - The old schema captured some NULLs, their skin fields are NULL too . To check with engineer
image.png (264×1 px, 62 KB)
is_media_viewer_enabledFor is_media_viewer_enabled=true, the difference is within a 2.5% variance. For is_media_viewer_enabled=false, the new schema captured 2.55% more events than the old schema. To check with engineer.
image.png (292×2 px, 71 KB)
is_page_preview_onPASSThe difference is within a 2.5% variance
image.png (288×1 px, 71 KB)
is_pinnedPASSThe difference is within a 2.5% variance
image.png (230×722 px, 29 KB)
image.png (248×764 px, 29 KB)
font❓Is font size 0 expected?The differences in font sizes, being small, regular, and large, are within a 2.5% variance. The difference in font size being large exceeds 2.5%. Given the low volume and small absolute difference, we mark it as PASS. Both schemas captured some events where the font size was 0. To check with the engineer.
image.png (464×1 px, 106 KB)
action_context❓ What's the meaning of the field valueneed to document the meanings of the values: stable, stable,amc
image.png (578×784 px, 60 KB)
sample.rate❌ incorrect100% for all wikis and for all type of users
image.png (164×420 px, 14 KB)
is_botperformer.is_bot is NULL in new schema.
image.png (304×672 px, 32 KB)
image.png (188×710 px, 25 KB)
Mar 13 2024, 9:43 PM · Web-Team-Backlog (FY2023-24 Q3 Sprint 5), Product-Analytics (Kanban), MediaWiki-extensions-WikimediaEvents
jwang added a comment to T353029: QA desktopwebuiactionstracking schema port to the new metrics platform.

Based on the number of events captured in the old and new schema, we believe the new schema is configured with the same sample rate as the old schema, as mentioned in T353029#9621127. However, it is recorded as 100% for all wikis in the new schema.

image.png (142×444 px, 14 KB)

Mar 13 2024, 6:28 PM · Patch-For-Review, Web-Team-Backlog (FY2023-24 Q3 Sprint 5), Product-Analytics (Kanban), MediaWiki-extensions-WikimediaEvents
jwang created T359993: Slowdown when querying via Hive.
Mar 13 2024, 12:46 AM · Data-Platform-SRE (2024.03.25 - 2024.04.14), Data-Platform

Mar 12 2024

jwang moved T357542: QA mobilewebuiactionstracking schema port to the new metrics platform from Next 2 weeks to Doing on the Product-Analytics (Kanban) board.
Mar 12 2024, 7:36 PM · Web-Team-Backlog (FY2023-24 Q3 Sprint 5), Product-Analytics (Kanban), MediaWiki-extensions-WikimediaEvents
jwang added a project to T357542: QA mobilewebuiactionstracking schema port to the new metrics platform: Product-Analytics (Kanban).
Mar 12 2024, 7:36 PM · Web-Team-Backlog (FY2023-24 Q3 Sprint 5), Product-Analytics (Kanban), MediaWiki-extensions-WikimediaEvents

Mar 11 2024

jwang updated the task description for T353029: QA desktopwebuiactionstracking schema port to the new metrics platform.
Mar 11 2024, 11:10 PM · Patch-For-Review, Web-Team-Backlog (FY2023-24 Q3 Sprint 5), Product-Analytics (Kanban), MediaWiki-extensions-WikimediaEvents
jwang added a comment to T353029: QA desktopwebuiactionstracking schema port to the new metrics platform.

@KSarabia-WMF , thanks for the info.

Mar 11 2024, 11:09 PM · Patch-For-Review, Web-Team-Backlog (FY2023-24 Q3 Sprint 5), Product-Analytics (Kanban), MediaWiki-extensions-WikimediaEvents
jwang updated the task description for T353029: QA desktopwebuiactionstracking schema port to the new metrics platform.
Mar 11 2024, 11:05 PM · Patch-For-Review, Web-Team-Backlog (FY2023-24 Q3 Sprint 5), Product-Analytics (Kanban), MediaWiki-extensions-WikimediaEvents
jwang added a comment to T353029: QA desktopwebuiactionstracking schema port to the new metrics platform.
What has been checkedStatusNoteSnapshot of the result from the old schemaSnapshot of the result from the new schema
Pick one session_id, compare the resultPASSCaptured same number of events.
image.png (410×736 px, 61 KB)
image.png (472×582 px, 44 KB)
Pick one pageview_id, compare the resultPASSCaptured same number of events.
image.png (378×632 px, 45 KB)
image.png (468×786 px, 57 KB)
By datePASSThe new schema captured 0.39% more events than the old schema. The new schema captured 0.34% more sessions than the old schema.
image.png (288×524 px, 34 KB)
image.png (284×510 px, 32 KB)
By actionPASSBetween March 1st and Match 5th, the new schema captured 0.18% more click events than old schema. The new schema captured 0.18% more click sessions than old schema. The new schema captured 0.58% more init events than old schema.The new schema captured 0.49% more init sessions than old schema. They are within 2.5% acceptable variance.
image.png (352×780 px, 49 KB)
image.png (334×704 px, 42 KB)
By event name4000+ types of event names in desktopwebuiactionstracking schema schema. Event names contain content info of the pages. . Some event names are in old schema but not in new schema, for example ui.sidebar-toc. Some event names are not in old schema but in new schema, for example, ns=0, most of them are from minerva skineven_name.diff_comparison
By wikiPASSNew schema captured 828 wikis, same as the old schema, in the month of Feb 2024.The highest different rate of session count is from small wikis. The events on nowiktionary is 42.3% fewer in new schema. The difference is reduced to 10% since 2024-02-26. The new schema captured 0.85% more events than the old schema in average.The new schema captured 1.44% more sessions than the old schema in average.
image.png (360×1 px, 51 KB)
By skin name❓ is it expected that the new schema captured 'minerva' skin with agent.client_platform_family='desktop_browser'.Based on the data collected from 20240301 to 20240305. The new schema captured 0.52% more vector events than old schema.The new schema captured 0.50% more vector sessions than old schema. The new schema captured 0.3% more vector2022 events than old schema.The new schema captured 0.25% more vector2022 sessions than old schema. minerva skin is not captured in old schema, but captured in new schema with agent.client_platform_family='desktop_browser'. To check with engineer whehter it is expected.
image.png (222×728 px, 31 KB)
image.png (326×672 px, 38 KB)
By user typePASSBased on the data collected from 20240301 to 20240305. New scheam captured more sessions and events than the old schema, but within 2.5% variance.
image.png (240×874 px, 37 KB)
image.png (240×658 px, 30 KB)
agent typePASS{F42562439}{F42562448}
edit count bucket❓ Is it expected that for logged-out users performer.edit_count_bucket is NULL in new schema, while in old schema, event.editCountBucket is '0 edits'.For logged-in users, editcountbucket difference is within 2.5% variance. For logged-out users, performer.edit_count_bucket is NULL in new schema, while in old schema, event.editCountBucket is '0 edits'. Need to confirm whether it is expected.
image.png (398×770 px, 59 KB)
image.png (460×942 px, 75 KB)
pageNamespacepage.namespace_id is NULL in new schema
image.png (332×800 px, 40 KB)
image.png (170×800 px, 24 KB)
viewportSizeBucketdiff is within 2.5% variance. new schema captured 2620 NULL viewportsizebucket with skin minerva . To check with engineer
image.png (454×806 px, 70 KB)
image.png (514×910 px, 78 KB)
is_dark_mode_on,❓ Is null in old schema expectedThe diff is within 2.5% variance. old schema captured some NULLs, while new schema did not. To check with engineer
image.png (328×968 px, 41 KB)
image.png (226×994 px, 37 KB)
is_dark_mode_prepared_by_os❓ Is null in old schema expectedThe diff is within 2.5% variance. old schema captured some NULLs, while new schema did not. To check with engineer
image.png (310×994 px, 40 KB)
image.png (240×990 px, 37 KB)
dark_mode_setting❓ Is null in old schema expectedThe differences in dark_mode_setting being 0, 2, and NULL are within a 2.5% variance. The difference in dark_mode_setting being 1 is larger than 2.5%. Due to the low volume and small absolute difference, we mark it as a pass.
image.png (330×818 px, 39 KB)
image.png (356×856 px, 41 KB)
is_full_widthThe diff is within a 2.5% variance. The old schema captured some NULLs, while new schema did not.The NULL is from anonymous users. To check with engineer
image.png (328×1 px, 72 KB)
is_media_viewer_enabledPASSThe difference is within a 2.5% variance
image.png (284×1 px, 70 KB)
is_page_preview_onPASSThe difference is within a 2.5% variance
image.png (320×1 px, 70 KB)
is_pinnedPASSThe difference is within a 2.5% variance
image.png (298×1 px, 69 KB)
fontThe diff in font=0,1,2 is within a 2.5% variance.. Some values, like large, null, regular and small, are captured in old schema only. To check with engineer.
image.png (610×1 px, 115 KB)
action_context,❓ is it expectedvalue is desktop for minerva skin in new schema
image.png (350×802 px, 36 KB)
is_botperformer.is_bot is NULL in new schema{F42603014}{F42603033}
sample rateincorrect in new schema
image.png (142×444 px, 14 KB)
Mar 11 2024, 10:54 PM · Patch-For-Review, Web-Team-Backlog (FY2023-24 Q3 Sprint 5), Product-Analytics (Kanban), MediaWiki-extensions-WikimediaEvents

Mar 8 2024

jwang added a comment to T352342: QA WebUIScroll port to the new metrics platform.

@KSarabia-WMF, thanks for checking. Can you also clarify what's the sample rate for logged-in users?

Mar 8 2024, 5:46 AM · Web-Team-Backlog (FY2023-24 Q3 Sprint 3), Data Products (Data Products Sprint 08), Metrics Platform Backlog, Product-Analytics (Kanban), MediaWiki-extensions-WikimediaEvents

Mar 7 2024

jwang moved T353029: QA desktopwebuiactionstracking schema port to the new metrics platform from Next 2 weeks to Doing on the Product-Analytics (Kanban) board.
Mar 7 2024, 4:52 PM · Patch-For-Review, Web-Team-Backlog (FY2023-24 Q3 Sprint 5), Product-Analytics (Kanban), MediaWiki-extensions-WikimediaEvents

Mar 6 2024

jwang added a comment to T352342: QA WebUIScroll port to the new metrics platform.

Hi, @KSarabia-WMF , Can you confirm if below sample rate captured in the new schema is correct?

Mar 6 2024, 9:46 PM · Web-Team-Backlog (FY2023-24 Q3 Sprint 3), Data Products (Data Products Sprint 08), Metrics Platform Backlog, Product-Analytics (Kanban), MediaWiki-extensions-WikimediaEvents
jwang added a comment to T357771: Analyze how many distinct devices edit per day from a given IP address.

@kostajh, please see the findings below.

Methodology

We reviewed the distribution of the number of distinct user agents that appear for a given IP address per day on each pilot wiki candidate and the largest wiki enwiki.
We also reviewed the worst-case scenario: the maximum number of the distinct user agents that appear for a given IP address per day across all wikis.
The analysis is limited to anonymous edits committed between 2024-01-01 and 2024-01-31.

Mar 6 2024, 6:28 PM · Product-Analytics (Kanban), Temporary accounts
jwang added a project to T359418: Analyze usage of desktop text size beta feature: Product-Analytics.
Mar 6 2024, 5:49 PM · Product-Analytics (Kanban), FY2023-24-WE 2.1 Typography and palette customizations, Web-Team-Backlog

Mar 5 2024

jwang updated the task description for T353029: QA desktopwebuiactionstracking schema port to the new metrics platform.
Mar 5 2024, 4:50 PM · Patch-For-Review, Web-Team-Backlog (FY2023-24 Q3 Sprint 5), Product-Analytics (Kanban), MediaWiki-extensions-WikimediaEvents
jwang added a comment to T353029: QA desktopwebuiactionstracking schema port to the new metrics platform.

@KSarabia-WMF, can you also provide the sample rate of the old schema DesktopWebUIActionsTracking? Thanks.

Mar 5 2024, 4:50 PM · Patch-For-Review, Web-Team-Backlog (FY2023-24 Q3 Sprint 5), Product-Analytics (Kanban), MediaWiki-extensions-WikimediaEvents
jwang added a comment to T357542: QA mobilewebuiactionstracking schema port to the new metrics platform.

@KSarabia-WMF, can you also provide the sample rate of the old schema MobileWebUIActionsTracking? Thanks.

Mar 5 2024, 4:49 PM · Web-Team-Backlog (FY2023-24 Q3 Sprint 5), Product-Analytics (Kanban), MediaWiki-extensions-WikimediaEvents
jwang updated the task description for T357542: QA mobilewebuiactionstracking schema port to the new metrics platform.
Mar 5 2024, 4:47 PM · Web-Team-Backlog (FY2023-24 Q3 Sprint 5), Product-Analytics (Kanban), MediaWiki-extensions-WikimediaEvents

Mar 4 2024

jwang claimed T357771: Analyze how many distinct devices edit per day from a given IP address.
Mar 4 2024, 5:53 PM · Product-Analytics (Kanban), Temporary accounts
jwang moved T357771: Analyze how many distinct devices edit per day from a given IP address from Next 2 weeks to Doing on the Product-Analytics (Kanban) board.
Mar 4 2024, 5:53 PM · Product-Analytics (Kanban), Temporary accounts

Feb 29 2024

jwang added a comment to T358685: Investigate best metric to measure or proxy reader retention.

HI @ovasileva, please see my investigation summary below.

Feb 29 2024, 1:26 AM · Product-Analytics (Kanban), Web-Team-Backlog

Feb 28 2024

jwang added a comment to T352342: QA WebUIScroll port to the new metrics platform.

Thanks for checking on it. Regarding 0.2% discrepancy, it can be marked as PASS given 1) it's within variance range , 2.5% variance for daily events across all wikis, that we defined in Metrics Platform Instrument Migration Data QA Process Description ; 2) the new instrumentation is capturing more unique sessions than old instrumentation.

Feb 28 2024, 10:02 PM · Web-Team-Backlog (FY2023-24 Q3 Sprint 3), Data Products (Data Products Sprint 08), Metrics Platform Backlog, Product-Analytics (Kanban), MediaWiki-extensions-WikimediaEvents
jwang moved T358685: Investigate best metric to measure or proxy reader retention from Next 2 weeks to Doing on the Product-Analytics (Kanban) board.
Feb 28 2024, 8:05 PM · Product-Analytics (Kanban), Web-Team-Backlog
jwang added a project to T358685: Investigate best metric to measure or proxy reader retention: Product-Analytics (Kanban).
Feb 28 2024, 8:05 PM · Product-Analytics (Kanban), Web-Team-Backlog

Feb 26 2024

jwang added a comment to T356335: Update WikimediaEvents "is_dark_mode_on" field.

I'll defer to Jennifer about 2 vs auto. I think it's better to do 2 personally in case these definitions ever change in future this will be more resilient to change.

Feb 26 2024, 6:14 PM · Verified, MW-1.42-notes (1.42.0-wmf.20; 2024-02-27), FY2023-24-WE 2.1 Typography and palette customizations, Web-Team-Backlog (FY2023-24 Q3 Sprint 3)

Feb 13 2024

jwang added a comment to T353029: QA desktopwebuiactionstracking schema port to the new metrics platform.

Migration of desktopwebuiactionstracking schema is ready for QA.
The mobilewebuiactionstracking schema is pending for migration.

Feb 13 2024, 11:42 PM · Patch-For-Review, Web-Team-Backlog (FY2023-24 Q3 Sprint 5), Product-Analytics (Kanban), MediaWiki-extensions-WikimediaEvents
jwang claimed T353029: QA desktopwebuiactionstracking schema port to the new metrics platform.
Feb 13 2024, 11:40 PM · Patch-For-Review, Web-Team-Backlog (FY2023-24 Q3 Sprint 5), Product-Analytics (Kanban), MediaWiki-extensions-WikimediaEvents

Feb 2 2024

jwang added a comment to T356335: Update WikimediaEvents "is_dark_mode_on" field.

Hi, thank you for bringing up and clarifying that.

Feb 2 2024, 7:12 PM · Verified, MW-1.42-notes (1.42.0-wmf.20; 2024-02-27), FY2023-24-WE 2.1 Typography and palette customizations, Web-Team-Backlog (FY2023-24 Q3 Sprint 3)
jwang added a comment to T352342: QA WebUIScroll port to the new metrics platform.

@phuedx, Here are some findings from my investigation.

Feb 2 2024, 6:33 PM · Web-Team-Backlog (FY2023-24 Q3 Sprint 3), Data Products (Data Products Sprint 08), Metrics Platform Backlog, Product-Analytics (Kanban), MediaWiki-extensions-WikimediaEvents

Jan 31 2024

jwang added a comment to T346979: Report on baseline for interface customization.

Here are the baselines for devices with a viewport larger than 1200px. @ovasileva , let me know if you have any questions.

Preview disable rate (viewport > 1200px)

Metric: Number of unique sessions with preview off (non-default)/ total number of unique initialized sessions (viewport > 1200px).
The following statistics are based on the data collected between Dec. 21, 2023 and Dec. 31, 2023

User typeDaily averageStd
Loggedin users44.37%0.27%
Anonymous users3.65%0.12%
Jan 31 2024, 9:23 PM · Product-Analytics (Kanban), FY2023-24-WE 2.1 Typography and palette customizations, Web-Team-Backlog

Jan 25 2024

jwang added a comment to T352342: QA WebUIScroll port to the new metrics platform.

@phuedx, Thanks for resolving all the questions. I will further investigate the remaining question of why the numbers of events, sessions and pages are slightly higher in the new schema. Will bring it up to you when I have more data.

Jan 25 2024, 8:03 PM · Web-Team-Backlog (FY2023-24 Q3 Sprint 3), Data Products (Data Products Sprint 08), Metrics Platform Backlog, Product-Analytics (Kanban), MediaWiki-extensions-WikimediaEvents
jwang updated the task description for T352342: QA WebUIScroll port to the new metrics platform.
Jan 25 2024, 7:51 PM · Web-Team-Backlog (FY2023-24 Q3 Sprint 3), Data Products (Data Products Sprint 08), Metrics Platform Backlog, Product-Analytics (Kanban), MediaWiki-extensions-WikimediaEvents

Jan 20 2024

jwang added a comment to T352342: QA WebUIScroll port to the new metrics platform.

Questions to confirm with engineers

  1. The number of events, sessions and pages are slightly higher in the new schema. Is it expected?
  2. Which field is to capture Spider user agent?
  3. Is access_method captured in agent.client_platform_family in the new schema?
  4. Please review the field mapping table below and confirm whether all entries are as expected.
Field in old schemaField in new schemaValue example
actionactionscroll-to-top
action_contextNULL
action_sourceNULL
action_subtypeNULL
web_session_idperformer.session_ide.g. , '2751f1d9e9a0417cbc1x'
meta.dtmeta.dte.g. "2024-01-16T00:17:25.272Z"
page_idpage.id59519
access_methodagent.client_platform_family❓access_method= 'desktop' ; agent.client_platform_family='desktop_browser'
is_anonperformer.is_logged_intrue, false. The old schema captures the status of being an anoymous user, while the new schema captures the status of being a loggedin users.
skinmediawiki.skinvector-2022
user_agent_map['device_family']MISSING ❓Spider
Jan 20 2024, 1:44 AM · Web-Team-Backlog (FY2023-24 Q3 Sprint 3), Data Products (Data Products Sprint 08), Metrics Platform Backlog, Product-Analytics (Kanban), MediaWiki-extensions-WikimediaEvents

Jan 17 2024

jwang added a comment to T346979: Report on baseline for interface customization.

Perhaps the best thing to do here would be to only consider devices with > 1200px for the desktop milestone. What do you think?

Jan 17 2024, 6:26 PM · Product-Analytics (Kanban), FY2023-24-WE 2.1 Typography and palette customizations, Web-Team-Backlog

Jan 12 2024

jwang added a comment to T346979: Report on baseline for interface customization.

I have further investigated the preview disable rate in 1000px-1199px viewport bucket, analyzing it by device families and wikis.
In summary, the high preview disable rate in the 1000px-1199px bucket is influenced primarily by devices in the Mac family, specifically those running Mac OS X with the version details: os_major 10 and os_minor 15.

By device family
Jan 12 2024, 10:07 PM · Product-Analytics (Kanban), FY2023-24-WE 2.1 Typography and palette customizations, Web-Team-Backlog

Jan 10 2024

jwang closed T336899: Capture Special:Mute data for final analysis as Resolved.
Jan 10 2024, 4:41 PM · Product-Analytics (Kanban), Anti-Harassment
jwang moved T346979: Report on baseline for interface customization from Next 2 weeks to Doing on the Product-Analytics (Kanban) board.
Jan 10 2024, 4:41 PM · Product-Analytics (Kanban), FY2023-24-WE 2.1 Typography and palette customizations, Web-Team-Backlog
jwang moved T352342: QA WebUIScroll port to the new metrics platform from Next 2 weeks to Doing on the Product-Analytics (Kanban) board.
Jan 10 2024, 4:41 PM · Web-Team-Backlog (FY2023-24 Q3 Sprint 3), Data Products (Data Products Sprint 08), Metrics Platform Backlog, Product-Analytics (Kanban), MediaWiki-extensions-WikimediaEvents
jwang moved T353970: Track metrics on Portuguese Wikipedia relating to IP-editing turn off from Next 2 weeks to Doing on the Product-Analytics (Kanban) board.
Jan 10 2024, 4:40 PM · Product-Analytics (Kanban), Temporary accounts

Jan 9 2024

jwang updated subscribers of T346979: Report on baseline for interface customization.

@ovasileva, @Jdlrobson, I have reran the analysis using the recent data as we discussed.

Jan 9 2024, 9:56 PM · Product-Analytics (Kanban), FY2023-24-WE 2.1 Typography and palette customizations, Web-Team-Backlog
jwang closed T342698: IP Info monitoring dashboard as Resolved.
Jan 9 2024, 6:40 PM · Product-Analytics (Kanban), Anti-Harassment
jwang updated the task description for T342698: IP Info monitoring dashboard.
Jan 9 2024, 6:40 PM · Product-Analytics (Kanban), Anti-Harassment
jwang added a comment to T342698: IP Info monitoring dashboard.

Dashboard has been published at https://superset.wikimedia.org/superset/dashboard/p/xgaOAD5rz2A/

Jan 9 2024, 6:39 PM · Product-Analytics (Kanban), Anti-Harassment

Jan 4 2024

jwang added a comment to T346979: Report on baseline for interface customization.

Hello @Sj, we only collect data on the viewport size buckets, and these are segmented into six groups. Unfortunately, 1400px is not the threshold to divide groups. I hope the data below still provides insights into how full-width preference varies in each group.

Jan 4 2024, 11:02 PM · Product-Analytics (Kanban), FY2023-24-WE 2.1 Typography and palette customizations, Web-Team-Backlog
jwang added a comment to T346979: Report on baseline for interface customization.

Hi @ovasileva , when I analyzed the data based on the viewport size buckets, I noticed that the preview disable rate in the 1000px-1199px group was significantly higher than in the adjacent bucket groups for both anonymous users and logged-in users. Is there any specific reason for this?

Jan 4 2024, 10:11 PM · Product-Analytics (Kanban), FY2023-24-WE 2.1 Typography and palette customizations, Web-Team-Backlog

Dec 19 2023

jwang updated subscribers of T348033: Provide a recent analysis on usage of width toggle.
Dec 19 2023, 6:28 PM · Web-Team-Backlog, Desktop Improvements (Vector 2022)
jwang added a comment to T348033: Provide a recent analysis on usage of width toggle.

Hi @Sj, we have discussed your questions within the team. The request requires recording the reader's user_id along with the timestamps of their visits and clicks. We don't track such detailed info for readers. The session based stats are the closest approximation we have for readers.

Dec 19 2023, 6:15 PM · Web-Team-Backlog, Desktop Improvements (Vector 2022)
jwang updated the task description for T346979: Report on baseline for interface customization.
Dec 19 2023, 12:10 AM · Product-Analytics (Kanban), FY2023-24-WE 2.1 Typography and palette customizations, Web-Team-Backlog
jwang updated the task description for T346979: Report on baseline for interface customization.
Dec 19 2023, 12:09 AM · Product-Analytics (Kanban), FY2023-24-WE 2.1 Typography and palette customizations, Web-Team-Backlog

Dec 18 2023

jwang added a comment to T346979: Report on baseline for interface customization.

@Sj, please see the broken down of preview, width and media viewer at T346979#9285473.

Dec 18 2023, 5:42 PM · Product-Analytics (Kanban), FY2023-24-WE 2.1 Typography and palette customizations, Web-Team-Backlog

Dec 14 2023

jwang edited projects for T336899: Capture Special:Mute data for final analysis, added: Product-Analytics (Kanban); removed Product-Analytics.
Dec 14 2023, 7:24 PM · Product-Analytics (Kanban), Anti-Harassment
jwang edited projects for T352342: QA WebUIScroll port to the new metrics platform, added: Product-Analytics (Kanban); removed Product-Analytics.
Dec 14 2023, 7:14 PM · Web-Team-Backlog (FY2023-24 Q3 Sprint 3), Data Products (Data Products Sprint 08), Metrics Platform Backlog, Product-Analytics (Kanban), MediaWiki-extensions-WikimediaEvents
jwang triaged T353029: QA desktopwebuiactionstracking schema port to the new metrics platform as High priority.
Dec 14 2023, 7:13 PM · Patch-For-Review, Web-Team-Backlog (FY2023-24 Q3 Sprint 5), Product-Analytics (Kanban), MediaWiki-extensions-WikimediaEvents
jwang moved T350709: QA User Preferences Selection Panel Instrumentation from Next 2 weeks to Doing on the Product-Analytics (Kanban) board.
Dec 14 2023, 7:11 PM · Product-Analytics (Kanban), FY2023-24-WE 2.1 Typography and palette customizations, Web-Team-Backlog
jwang added a project to T350709: QA User Preferences Selection Panel Instrumentation: Product-Analytics (Kanban).
Dec 14 2023, 7:11 PM · Product-Analytics (Kanban), FY2023-24-WE 2.1 Typography and palette customizations, Web-Team-Backlog
jwang added a comment to T350709: QA User Preferences Selection Panel Instrumentation.
Summary of data QA for the data collected on Dec 14, 2023
Dec 14 2023, 7:10 PM · Product-Analytics (Kanban), FY2023-24-WE 2.1 Typography and palette customizations, Web-Team-Backlog

Dec 13 2023

jwang updated the task description for T345674: Migrate the IP masking dashboard staging tables to partitioned tables.
Dec 13 2023, 5:32 PM · Product-Analytics (Kanban)
jwang closed T345674: Migrate the IP masking dashboard staging tables to partitioned tables as Resolved.

All to-dos are done.

Dec 13 2023, 5:31 PM · Product-Analytics (Kanban)

Dec 11 2023

jwang renamed T353029: QA desktopwebuiactionstracking schema port to the new metrics platform from QA *webUIActions schema port to the new metrics platform to QA desktopwebuiactionstracking and mobilewebuiactionstracking schema port to the new metrics platform.
Dec 11 2023, 5:25 PM · Patch-For-Review, Web-Team-Backlog (FY2023-24 Q3 Sprint 5), Product-Analytics (Kanban), MediaWiki-extensions-WikimediaEvents

Dec 5 2023

jwang updated subscribers of T352342: QA WebUIScroll port to the new metrics platform.
Dec 5 2023, 10:11 PM · Web-Team-Backlog (FY2023-24 Q3 Sprint 3), Data Products (Data Products Sprint 08), Metrics Platform Backlog, Product-Analytics (Kanban), MediaWiki-extensions-WikimediaEvents
jwang updated the task description for T350709: QA User Preferences Selection Panel Instrumentation.
Dec 5 2023, 7:20 PM · Product-Analytics (Kanban), FY2023-24-WE 2.1 Typography and palette customizations, Web-Team-Backlog
jwang updated the task description for T352342: QA WebUIScroll port to the new metrics platform.
Dec 5 2023, 7:09 PM · Web-Team-Backlog (FY2023-24 Q3 Sprint 3), Data Products (Data Products Sprint 08), Metrics Platform Backlog, Product-Analytics (Kanban), MediaWiki-extensions-WikimediaEvents
jwang claimed T352342: QA WebUIScroll port to the new metrics platform.
Dec 5 2023, 7:05 PM · Web-Team-Backlog (FY2023-24 Q3 Sprint 3), Data Products (Data Products Sprint 08), Metrics Platform Backlog, Product-Analytics (Kanban), MediaWiki-extensions-WikimediaEvents

Dec 1 2023

jwang added a comment to T346979: Report on baseline for interface customization.

@ovasileva ,please see the analysis of pin rate and overall non-default rate below.

Methodology

Dec 1 2023, 11:48 PM · Product-Analytics (Kanban), FY2023-24-WE 2.1 Typography and palette customizations, Web-Team-Backlog
jwang updated the task description for T346979: Report on baseline for interface customization.
Dec 1 2023, 12:08 AM · Product-Analytics (Kanban), FY2023-24-WE 2.1 Typography and palette customizations, Web-Team-Backlog

Nov 29 2023

jwang updated the task description for T342698: IP Info monitoring dashboard.
Nov 29 2023, 10:53 PM · Product-Analytics (Kanban), Anti-Harassment

Nov 9 2023

jwang added a comment to T346979: Report on baseline for interface customization.

@ovasileva , here is the baseline collection for font size on mobile web. Let me know if you have any questions.

Summary
% of pageview sessions which have set a non-default font size in the Minerva skin (On mobile web)

Metric: Number of unique sessions with regular font size disabled (non-default) / total number of unique initialized sessions
The following statistics are based on the data collected between Nov.4, 2023 and Nov.8 , 2023 (incomplete date) ,

user typeminmaxavgstd
Logged-in users1.18%1.28%1.24%0.04%
Anonymous users0.0139%0.0147%0.0144%0.0003%
Nov 9 2023, 1:17 AM · Product-Analytics (Kanban), FY2023-24-WE 2.1 Typography and palette customizations, Web-Team-Backlog
jwang added a comment to T346978: QA instrumentation for baseline for interface customization.
Summary of data QA for the data collected on Nov 8, 2023
Minerva (Mobile)

Schema: event.MobileWebUIActionsTracking
Instrumentation purpose: collect baseline for % of pageviews which have set a non-default font size in the Minerva skin (On mobile web)

What has been checkedField nameStatusNoteSnapshot of the result
Font sizeevent.font✅ PASSThe expected values are: small, regular, large, xlarge. The numbers of '0' and ‘NULL’ are tapering off, but will take more days.
image.png (504×702 px, 62 KB)
Nov 9 2023, 12:42 AM · Product-Analytics (Kanban), Web-Team-Backlog (Web Team FY2023-24 Q2 Sprint 2), FY2023-24-WE 2.1 Typography and palette customizations