Page MenuHomePhabricator

kzimmerman (Kate Zimmerman)
Director of Data Science, Product Analytics, Wikimedia Foundation

Today

  • Clear sailing ahead.

Tomorrow

  • Clear sailing ahead.

Friday

  • Clear sailing ahead.

User Details

User Since
Oct 26 2018, 9:14 PM (229 w, 4 d)
Availability
Available
IRC Nick
kzeta
LDAP User
Unknown
MediaWiki User
KZimmerman (WMF) [ Global Accounts ]

Recent Activity

Yesterday

kzimmerman updated subscribers of T328049: Investigate the effects of IP Masking on Data Eng systems.
Tue, Mar 21, 11:18 PM · Data Pipelines (sprint 10)
kzimmerman closed T331463: IE11 User and Traffic Analysis update 2023 as Resolved.
Tue, Mar 21, 5:21 PM · Design-Systems-Team, Browser-Support-Internet-Explorer, Product-Analytics
kzimmerman moved T332205: Clarify definitions around anonymous and temporary editors from Triage to Upcoming Quarter on the Product-Analytics board.
Tue, Mar 21, 5:19 PM · Data-Engineering, IP Masking, Product-Analytics
kzimmerman triaged T332205: Clarify definitions around anonymous and temporary editors as High priority.
Tue, Mar 21, 5:19 PM · Data-Engineering, IP Masking, Product-Analytics
kzimmerman moved T332212: Major (API) versioning of Event Platform streams from Triage to Tracking on the Product-Analytics board.
Tue, Mar 21, 5:14 PM · Metrics-Platform-Planning, Product-Analytics, Data-Engineering, WMF-Architecture-Team, Event-Platform Value Stream
kzimmerman moved T332621: Add log_search to monthly sqoop list from Triage to Tracking on the Product-Analytics board.
Tue, Mar 21, 5:11 PM · Product-Analytics, Data-Engineering
kzimmerman moved T330780: Regional views for Foundation-level metrics from Next 2 weeks to Doing on the Product-Analytics (Kanban) board.
Tue, Mar 21, 5:10 PM · Product-Analytics (Kanban)

Mon, Mar 20

kzimmerman added a comment to T303655: Decision on mobile app Commons pageviews.

Impact of this is low on high level metrics, so we would not prioritize this work from the high-level metrics perspective.

Mon, Mar 20, 6:39 PM · Product-Analytics
kzimmerman added a comment to T277703: Decision & communication around filtering automated traffic from Key Product Metrics.

We are increasingly *also* reporting on user pageviews, because we've seen increases in automated traffic since 2022. However, we only split out automated traffic in 2020, so for longer-term historical views we still need to include automated traffic.

Mon, Mar 20, 6:25 PM · Product-Analytics
kzimmerman closed T294954: Investigate drop in Page Previews data as Declined.
Mon, Mar 20, 6:23 PM · Product-Analytics (Kanban)
kzimmerman closed T327943: Run data visualization code on Jupyter notebooks on statbox as Resolved.

Thank you @HXi-WMF!

Mon, Mar 20, 6:15 PM · Product-Analytics (Kanban)
kzimmerman moved T332399: Content data for regional views from Triage to Tracking on the Product-Analytics board.
Mon, Mar 20, 6:11 PM · Product-Analytics, Research
kzimmerman edited projects for T332399: Content data for regional views, added: Product-Analytics; removed Product-Analytics (Kanban).
Mon, Mar 20, 6:10 PM · Product-Analytics, Research
kzimmerman added a project to T332399: Content data for regional views: Research.
Mon, Mar 20, 6:10 PM · Product-Analytics, Research

Thu, Mar 16

kzimmerman added a comment to T329585: Timeline visualization template.

Thank you @HXi-WMF , I think these look good! Can you also provide an update for active editors - returning vs new using this format?

Thu, Mar 16, 10:05 PM · Product-Analytics (Kanban)

Wed, Mar 15

kzimmerman updated the task description for T332205: Clarify definitions around anonymous and temporary editors.
Wed, Mar 15, 11:06 PM · Data-Engineering, IP Masking, Product-Analytics
kzimmerman added a project to T332205: Clarify definitions around anonymous and temporary editors: Data-Engineering.

Adding Data-Engineering since we will work with them on the technical details.

Wed, Mar 15, 5:52 PM · Data-Engineering, IP Masking, Product-Analytics
kzimmerman created T332205: Clarify definitions around anonymous and temporary editors.
Wed, Mar 15, 5:49 PM · Data-Engineering, IP Masking, Product-Analytics

Fri, Mar 10

kzimmerman reassigned T330780: Regional views for Foundation-level metrics from kzimmerman to Miriam.

Assigning to @Miriam as she'll be putting together the April staff meeting presentation and deciding on how to show the data.

Fri, Mar 10, 7:50 PM · Product-Analytics (Kanban)
kzimmerman added a comment to T329585: Timeline visualization template.

@HXi-WMF updates to charts per our discussion:

  • separate chart files into two groups: Readers (unique devices, content interactions) and Editors (active editors, active editors - returning vs new, net new content). This will make it easier to incorporate the charts into the two separate repos we have right now.
  • add buffer/padding to Y axis, so that the chart's peak and trough do not extend through the full Y axis (per the draft Data visualization guidelines that Maya mentioned: "a good measure is for the line to fill two-thirds of the chart area height"
  • For the data loss data: I agree with your suggestion of putting the data directly into the code as a temporary solution, rather than referencing data from Google docs. Make sure it’s well-commented.
Fri, Mar 10, 7:48 PM · Product-Analytics (Kanban)
kzimmerman closed T329586: Visualizations for pageviews to Wikimedia filtered by agent type user as Resolved.
Fri, Mar 10, 7:22 PM · Product-Analytics (Kanban)
kzimmerman updated the task description for T329586: Visualizations for pageviews to Wikimedia filtered by agent type user.
Fri, Mar 10, 7:21 PM · Product-Analytics (Kanban)
kzimmerman closed T328978: Visualization for Unique Devices as Resolved.
Fri, Mar 10, 7:21 PM · Product-Analytics (Kanban)
kzimmerman updated the task description for T331359: Reader data for regional views.
Fri, Mar 10, 12:29 AM · Product-Analytics (Kanban)
kzimmerman updated the task description for T330780: Regional views for Foundation-level metrics.
Fri, Mar 10, 12:26 AM · Product-Analytics (Kanban)
kzimmerman updated the task description for T330780: Regional views for Foundation-level metrics.
Fri, Mar 10, 12:22 AM · Product-Analytics (Kanban)
kzimmerman triaged T331463: IE11 User and Traffic Analysis update 2023 as Medium priority.

@Volker_E After discussing this with the team, I suggest coming in to office hours to help you get the data needed from Superset. I explored the data and put together this dashboard - https://superset.wikimedia.org/superset/dashboard/403/ - so perhaps that meets your needs? The first chart shows the proportion of pageviews from IE11 out of all pageviews - and has conditional formatting to flag where that proportion is above 0.1%

Fri, Mar 10, 12:13 AM · Design-Systems-Team, Browser-Support-Internet-Explorer, Product-Analytics

Thu, Mar 9

kzimmerman updated the task description for T331359: Reader data for regional views.
Thu, Mar 9, 12:19 AM · Product-Analytics (Kanban)
kzimmerman added a comment to T331359: Reader data for regional views.

We should continue to report on Wikipedia unique devices. Thank you for checking! I'll update the description to specify.

Thu, Mar 9, 12:18 AM · Product-Analytics (Kanban)

Tue, Mar 7

kzimmerman updated the task description for T331361: Editor data for regional views.
Tue, Mar 7, 8:26 PM · Product-Analytics (Kanban)
kzimmerman closed T327027: Massive spike in pageviews for a few enwiki pages beginning with "Index" as Declined.

I've associated this with the ticket about improving automated detection - T280565

Tue, Mar 7, 7:08 PM · Product-Analytics, Data Pipelines, Data-Engineering-Planning, Pageviews-Anomaly
kzimmerman closed T327027: Massive spike in pageviews for a few enwiki pages beginning with "Index", a subtask of T280565: Improve pageview automated traffic detection heuristics, as Declined.
Tue, Mar 7, 7:08 PM · Data-Engineering
kzimmerman added a subtask for T280565: Improve pageview automated traffic detection heuristics: T327027: Massive spike in pageviews for a few enwiki pages beginning with "Index".
Tue, Mar 7, 6:59 PM · Data-Engineering
kzimmerman added a parent task for T327027: Massive spike in pageviews for a few enwiki pages beginning with "Index": T280565: Improve pageview automated traffic detection heuristics.
Tue, Mar 7, 6:59 PM · Product-Analytics, Data Pipelines, Data-Engineering-Planning, Pageviews-Anomaly
kzimmerman moved T328978: Visualization for Unique Devices from Doing to Needs Sign-off on the Product-Analytics (Kanban) board.
Tue, Mar 7, 6:13 PM · Product-Analytics (Kanban)
kzimmerman moved T329586: Visualizations for pageviews to Wikimedia filtered by agent type user from Doing to Needs Sign-off on the Product-Analytics (Kanban) board.
Tue, Mar 7, 6:13 PM · Product-Analytics (Kanban)
kzimmerman closed T330939: February 2023 Wikimedia movement metrics as Resolved.
Tue, Mar 7, 6:05 PM · Product-Analytics (Kanban)
kzimmerman closed T328493: January 2023 Wikimedia movement metrics as Resolved.
Tue, Mar 7, 6:05 PM · Product-Analytics (Kanban)

Mon, Mar 6

kzimmerman created T331361: Editor data for regional views.
Mon, Mar 6, 9:57 PM · Product-Analytics (Kanban)
kzimmerman created T331359: Reader data for regional views.
Mon, Mar 6, 9:55 PM · Product-Analytics (Kanban)
kzimmerman updated the task description for T330780: Regional views for Foundation-level metrics.
Mon, Mar 6, 9:50 PM · Product-Analytics (Kanban)

Tue, Feb 28

kzimmerman created T330780: Regional views for Foundation-level metrics.
Tue, Feb 28, 5:54 PM · Product-Analytics (Kanban)

Mon, Feb 27

kzimmerman updated the task description for T329585: Timeline visualization template.
Mon, Feb 27, 9:33 PM · Product-Analytics (Kanban)

Tue, Feb 21

kzimmerman added a comment to T329585: Timeline visualization template.

@HXi-WMF One revision for Unique Devices: since there's only one line, can you take out the blue "Unique Devices" label? Thank you!

Tue, Feb 21, 8:36 PM · Product-Analytics (Kanban)
kzimmerman added a comment to T329586: Visualizations for pageviews to Wikimedia filtered by agent type user.

@HXi-WMF thank you! I want to go with (C), with the January YoY comparison. Can you also add the yellow circles to flag January 2023 and January 2022?

Tue, Feb 21, 7:57 PM · Product-Analytics (Kanban)
kzimmerman closed T326753: Visualization for Net New Content by Wikipedia, Wikidata, and Commons timeline as Resolved.
Tue, Feb 21, 7:34 PM · Product-Analytics (Kanban)
kzimmerman closed T326279: Visualization for New vs. Returning Editors timeline as Resolved.
Tue, Feb 21, 7:33 PM · Product-Analytics (Kanban)
kzimmerman closed T324372: Visualizations for Content Interactions and Editors timelines as Resolved.
Tue, Feb 21, 7:33 PM · Product-Analytics (Kanban)
kzimmerman closed T329588: Table visualization for user pageviews and active editors by country as Resolved.
Tue, Feb 21, 7:32 PM · Product-Analytics (Kanban)
kzimmerman moved T329588: Table visualization for user pageviews and active editors by country from Doing to Needs Sign-off on the Product-Analytics (Kanban) board.
Tue, Feb 21, 7:31 PM · Product-Analytics (Kanban)
kzimmerman updated the task description for T329588: Table visualization for user pageviews and active editors by country.
Tue, Feb 21, 7:31 PM · Product-Analytics (Kanban)
kzimmerman added a comment to T329588: Table visualization for user pageviews and active editors by country.

Thank you @HXi-WMF !

Tue, Feb 21, 7:31 PM · Product-Analytics (Kanban)
kzimmerman added a comment to T329585: Timeline visualization template.

@HXi-WMF Leaving them here is perfect - thank you! I'm going to start using this ticket for reference when we talk about timeline visuals.

Tue, Feb 21, 5:55 PM · Product-Analytics (Kanban)

Feb 17 2023

kzimmerman updated the task description for T329588: Table visualization for user pageviews and active editors by country.
Feb 17 2023, 9:17 PM · Product-Analytics (Kanban)
kzimmerman added a comment to T328978: Visualization for Unique Devices.

Thank you @HXi-WMF ! I think I prefer (1 - white box) although I'm a little torn. But let's try that one and see how it goes.

Feb 17 2023, 8:15 PM · Product-Analytics (Kanban)
kzimmerman reassigned T329588: Table visualization for user pageviews and active editors by country from Mayakp.wiki to HXi-WMF.

Thank you @Mayakp.wiki and @Iflorez ! Reassigning to @HXi-WMF

Feb 17 2023, 6:28 PM · Product-Analytics (Kanban)
kzimmerman reassigned T329586: Visualizations for pageviews to Wikimedia filtered by agent type user from Mayakp.wiki to HXi-WMF.

Thank you @Mayakp.wiki ! Reassigning to @HXi-WMF

Feb 17 2023, 6:26 PM · Product-Analytics (Kanban)
kzimmerman moved T329585: Timeline visualization template from Next 2 weeks to Doing on the Product-Analytics (Kanban) board.
Feb 17 2023, 6:25 PM · Product-Analytics (Kanban)
kzimmerman added a comment to T329585: Timeline visualization template.

Thank you @HXi-WMF ! I prefer (1). Can you go ahead and make the other charts like that?

Feb 17 2023, 6:25 PM · Product-Analytics (Kanban)

Feb 14 2023

kzimmerman renamed T329586: Visualizations for pageviews to Wikimedia filtered by agent type user from Visualizations for global user pageviews to Visualizations for pageviews to Wikimedia filtered by agent type user.
Feb 14 2023, 12:09 AM · Product-Analytics (Kanban)
kzimmerman removed a project from T329586: Visualizations for pageviews to Wikimedia filtered by agent type user: GlobalUserPage.

@Legoktm ah, this is not about GlobalUserPages. It's about pageviews to all Wikimedia sites, worldwide (not filtered by country), and only filtered to user traffic. Something like this on Wikistats https://stats.wikimedia.org/#/all-projects/reading/total-page-views/normal|line|2-year|agent~user|monthly

Feb 14 2023, 12:08 AM · Product-Analytics (Kanban)

Feb 13 2023

kzimmerman updated the task description for T329585: Timeline visualization template.
Feb 13 2023, 11:56 PM · Product-Analytics (Kanban)
kzimmerman updated the task description for T329588: Table visualization for user pageviews and active editors by country.
Feb 13 2023, 11:51 PM · Product-Analytics (Kanban)
kzimmerman created T329588: Table visualization for user pageviews and active editors by country.
Feb 13 2023, 11:47 PM · Product-Analytics (Kanban)
kzimmerman triaged T329586: Visualizations for pageviews to Wikimedia filtered by agent type user as High priority.
Feb 13 2023, 11:42 PM · Product-Analytics (Kanban)
kzimmerman created T329586: Visualizations for pageviews to Wikimedia filtered by agent type user.
Feb 13 2023, 11:42 PM · Product-Analytics (Kanban)
kzimmerman triaged T329585: Timeline visualization template as High priority.
Feb 13 2023, 11:41 PM · Product-Analytics (Kanban)
kzimmerman created T329585: Timeline visualization template.
Feb 13 2023, 11:41 PM · Product-Analytics (Kanban)
kzimmerman closed T326188: December 2022 Wikimedia movement metrics as Resolved.
Feb 13 2023, 11:39 PM · Product-Analytics (Kanban)
kzimmerman moved T328978: Visualization for Unique Devices from Next 2 weeks to Doing on the Product-Analytics (Kanban) board.
Feb 13 2023, 11:03 PM · Product-Analytics (Kanban)
kzimmerman added a comment to T328978: Visualization for Unique Devices.

Thank you @HXi-WMF ! That is a good point about not drawing too much attention to the blocked out area because that's not the intent of the chart.

Feb 13 2023, 6:11 PM · Product-Analytics (Kanban)

Feb 7 2023

kzimmerman added a comment to T313270: Add new region definitions to canonical data repository.

We are waiting for the GDI tables to be productionized before moving forward with this. Need to check in on T310712

Feb 7 2023, 6:08 PM · Product-Analytics
kzimmerman moved T313270: Add new region definitions to canonical data repository from Triage to Blocked on the Product-Analytics board.
Feb 7 2023, 6:05 PM · Product-Analytics
kzimmerman edited projects for T313270: Add new region definitions to canonical data repository, added: Product-Analytics; removed Product-Analytics (Kanban).
Feb 7 2023, 6:05 PM · Product-Analytics

Feb 6 2023

kzimmerman created T328978: Visualization for Unique Devices.
Feb 6 2023, 10:49 PM · Product-Analytics (Kanban)
kzimmerman closed T327211: Additional viz for Pageviews, a subtask of T324372: Visualizations for Content Interactions and Editors timelines, as Resolved.
Feb 6 2023, 10:29 PM · Product-Analytics (Kanban)
kzimmerman closed T327211: Additional viz for Pageviews as Resolved.
Feb 6 2023, 10:29 PM · Product-Analytics (Kanban)
kzimmerman moved T326279: Visualization for New vs. Returning Editors timeline from Needs Review to Needs Sign-off on the Product-Analytics (Kanban) board.
Feb 6 2023, 10:26 PM · Product-Analytics (Kanban)
kzimmerman moved T324372: Visualizations for Content Interactions and Editors timelines from Needs Review to Needs Sign-off on the Product-Analytics (Kanban) board.
Feb 6 2023, 10:26 PM · Product-Analytics (Kanban)
kzimmerman moved T326753: Visualization for Net New Content by Wikipedia, Wikidata, and Commons timeline from Needs Review to Needs Sign-off on the Product-Analytics (Kanban) board.
Feb 6 2023, 10:26 PM · Product-Analytics (Kanban)
kzimmerman moved T327211: Additional viz for Pageviews from Needs Review to Needs Sign-off on the Product-Analytics (Kanban) board.
Feb 6 2023, 10:26 PM · Product-Analytics (Kanban)

Feb 3 2023

kzimmerman added a comment to T313114: Analyze possible bot traffic for frwiki article Cookie (informatique).

@PBradley-WMF I took a look at the top linking sites for https://fr.wikipedia.org/wiki/Cookie_(informatique), and compared it to https://de.wikipedia.org/wiki/HTTP-Cookie, https://en.wikipedia.org/wiki/Cookie, https://en.wikipedia.org/wiki/HTTP_Cookie, and https://en.wikipedia.org/wiki/HTTP_Cookie. Several of the top linking sites for the French & German cookie pages are the same, actually. One common thread is that the links to Wikipedia appear in the privacy banner on pages (I saw this in several, but not all, of the websites I checked).

Feb 3 2023, 8:16 PM · Wikipedia-iOS-App-Backlog, Wikipedia-Android-App-Backlog, Data Pipelines, Product-Analytics, Data-Engineering-Planning, Pageviews-Anomaly

Feb 1 2023

kzimmerman added a comment to T313114: Analyze possible bot traffic for frwiki article Cookie (informatique).

Wanted to note a couple of things I had explored earlier:

Feb 1 2023, 11:37 PM · Wikipedia-iOS-App-Backlog, Wikipedia-Android-App-Backlog, Data Pipelines, Product-Analytics, Data-Engineering-Planning, Pageviews-Anomaly
kzimmerman added a comment to T328457: Review Superset permissions and assign roles as appropriate.

Thanks @BTullis , I'll ask my team to check and we'll reach out if we need anything else!

Feb 1 2023, 5:11 PM · Data Pipelines, Data-Engineering-Planning

Jan 31 2023

kzimmerman added a comment to T328457: Review Superset permissions and assign roles as appropriate.

@BTullis Can you please review permissions for the Product Analytics team? We should all have sql_lab access if we don't already.

Jan 31 2023, 5:54 PM · Data Pipelines, Data-Engineering-Planning

Jan 27 2023

kzimmerman closed T326660: December 2022 Wikimedia movement metrics as Resolved.
Jan 27 2023, 7:25 PM · Product-Analytics (Kanban)

Jan 26 2023

kzimmerman added a comment to T327221: Update the wiki comparison tool (2022).

@nshahquinn-wmf I think doing the Jan 2023 snapshot is a good solution, so I agree. It's better than trying to calculate the Dec 2022 snapshot based on 11 months of data for some columns, and this way we will still have a full year of seasonal data.

Jan 26 2023, 10:33 PM · Product-Analytics (Kanban)
kzimmerman closed T327814: Graph for growth of Wikimedia projects since 2014 as Resolved.

Thank you @HXi-WMF, the one with the blue works beautifully! I'm marking this as resolved.

Jan 26 2023, 5:55 PM · Product-Analytics (Kanban)

Jan 25 2023

kzimmerman added a comment to T327814: Graph for growth of Wikimedia projects since 2014.
  • Yes, the Wikidata line should be in front
  • I think the Wikidata line needs to stand out more - perhaps make it thicker and/or try different color combinations so it is more visually prominent
  • The color of the Commons label needs to match the color of the line
  • Graph notes: the link should go to Wikistats https://stats.wikimedia.org/
  • Remove the Y axis label; instead, I suggest for the title: "Growth of Wikimedia Projects: Content Items"
Jan 25 2023, 8:15 PM · Product-Analytics (Kanban)
kzimmerman moved T327814: Graph for growth of Wikimedia projects since 2014 from Next 2 weeks to Doing on the Product-Analytics (Kanban) board.
Jan 25 2023, 7:52 PM · Product-Analytics (Kanban)
kzimmerman triaged T327943: Run data visualization code on Jupyter notebooks on statbox as Medium priority.
Jan 25 2023, 6:39 PM · Product-Analytics (Kanban)
kzimmerman created T327943: Run data visualization code on Jupyter notebooks on statbox.
Jan 25 2023, 6:38 PM · Product-Analytics (Kanban)

Jan 24 2023

kzimmerman set Due Date to Jan 27 2023, 8:00 AM on T327814: Graph for growth of Wikimedia projects since 2014.
Jan 24 2023, 8:59 PM · Product-Analytics (Kanban)
kzimmerman assigned T327814: Graph for growth of Wikimedia projects since 2014 to HXi-WMF.
Jan 24 2023, 8:58 PM · Product-Analytics (Kanban)
kzimmerman created T327814: Graph for growth of Wikimedia projects since 2014.
Jan 24 2023, 8:58 PM · Product-Analytics (Kanban)
kzimmerman moved T326753: Visualization for Net New Content by Wikipedia, Wikidata, and Commons timeline from Next 2 weeks to Needs Review on the Product-Analytics (Kanban) board.
Jan 24 2023, 6:08 PM · Product-Analytics (Kanban)
kzimmerman moved T324372: Visualizations for Content Interactions and Editors timelines from Doing to Needs Review on the Product-Analytics (Kanban) board.
Jan 24 2023, 6:07 PM · Product-Analytics (Kanban)
kzimmerman moved T326279: Visualization for New vs. Returning Editors timeline from Doing to Needs Review on the Product-Analytics (Kanban) board.
Jan 24 2023, 6:07 PM · Product-Analytics (Kanban)
kzimmerman moved T327211: Additional viz for Pageviews from Doing to Needs Review on the Product-Analytics (Kanban) board.
Jan 24 2023, 6:07 PM · Product-Analytics (Kanban)
kzimmerman triaged T327211: Additional viz for Pageviews as High priority.
Jan 24 2023, 6:07 PM · Product-Analytics (Kanban)