Fri, Aug 16
Thu, Aug 15
Tue, Aug 13
I paired with @cchen on Friday to complete calculation of the Readers July metrics. Updated numbers and draft analysis notes added to the current staging deck template.
Fri, Aug 9
Thu, Aug 8
Here is a current summary of findings after looking into this the past couple days:
Tue, Aug 6
Mon, Aug 5
For reference, here are the two IE corrections currently applied when calculating the monthly pageviews for the board metrics.
I agree that leaving the tag as is (Option #1) makes the most sense given the complexity and potential issues that might arise with the other options.
Fri, Aug 2
Thu, Aug 1
A few quick checks of the current data:
Wed, Jul 31
Tue, Jul 30
Mon, Jul 29
Fri, Jul 26
Thu, Jul 25
I hashed any tokens or temporary identifiers for the following three active Web EventLogging schemas on the whitelist:
Wed, Jul 24
Tue, Jul 23
I made some small edits to indicate the date when the mobile web and amc tags were added to action in the log table and did some final report cleanup. There are no changes to the numbers previously reported. Please see final report. Let me know if you have any questions.
Jul 19 2019
Jul 18 2019
@ovasileva Here's the current report of the AMC KPIs. Let me know if you have any questions or need any additional info/adjustments for next week's Product meeting.
Jul 16 2019
@pmiazga - As discussed during standup, I agree we should replace userEditCount with editCountBucket (with the values you listed) so it's better suited for druid ingestion if we want the data available in Turnilo. Thanks for catching that!
Jul 15 2019
@ovasileva: Quick question- For the mobile edit rate and moderation action metrics, would you like to look at the overall rate across all target wikis identified for the project as defined here or just across all of the target wikis where AMC has been deployed to date for this report?
Jul 12 2019
Jul 11 2019
@pmiazga - I like the proposed name change: MobileWebUIActionsTracking
Jul 10 2019
Calculations for June 2019 complete. I've added my notes/analysis to the board deck.
Jul 9 2019
Jun 28 2019
Jun 26 2019
@kzimmerman Discussed this with Olga. The team has currently been using a Turnilo board to track the retention rate for opt-in advanced mobile mode users. I've added a link to the mediawiki page.
Jun 25 2019
Jun 13 2019
How many users use the "group" option as a user preference?
Jun 11 2019
@kzimmerman I've completed calculating May 2019 metrics and added the updated numbers and analysis notes to the board deck.
Jun 10 2019
Jun 5 2019
@phuedx - Do you know which user property controls the "group" preference for this question?
How frequently do users use the "group" option either as a filter or as a user preference? priority high
Jun 3 2019
I discussed the sampling rate for this with the Product Analytics team. Since all the eventlogging data was moved to Hadoop, there are fewer concerns about sending too much data from the Analytics Engineering side. Unless there are other concerns (happy to discuss), I think it's ok if we keep the sampling rate to 50% for navigation links outside the hamburger menu as well.
May 31 2019
Good news! I figured out access to the user properties table. I'll work on the questions that require info in this table starting with the following one marked as high priority:
How many users have enabled enhanced RC?
How frequently do users use the "group" option either as a filter or as a user preference?
May 30 2019
May 29 2019
May 28 2019
May 21 2019
Awesome, thanks @mforns! Looks good. Just a few questions/comments:
May 19 2019
May 14 2019
Which filters are used and how frequently?
@phuedx - Just confirming I'm currently working on Which filters are used and how frequently? Will post results soon
Apr 30 2019
I've updated the readers metrics for March 2019 (slides 24 and the readers interaction numbers for slide 25 ). Note: These are for the calendar month of March 2019 and not normalized as was done in previous months for readers data.
Apr 29 2019
Apr 26 2019
Pageview charts are updated through March 2019 and added to the current slide deck.
Apr 25 2019
Apr 24 2019
@kzimmerman - I added a stacked bar chart showing interactions for past four years through Feb 2019 to the draft slide deck. Let me know if you have any questions or need any adjustments for the metrics presentation.
Apr 18 2019
Apr 17 2019
And will create another task to add the user_tenure_field and also the edit_tags (Visualeditor, Wikitext, etc.) that will be added to the next snapshot of MediaWiki history.
If you think of any other dimension or metric to include, you could add it to that task.
Would that be OK?
Sounds good to me!
Data exploration results summarized and posted on meta. Marking as done but let me know if you have any questions!
Apr 16 2019
Thanks @mforns! And sorry for the delay. I'm reviewing the edits_hourly in turnilo and it looks good to me so far. The hourly resolution doesn't seem to be impacting the performance too much when I test adding various splits and filters so I'd recommend keeping it unless there are any major concerns.
Apr 1 2019
Mar 27 2019
Mar 26 2019
Here's the updated edit table schema with suggested transforms to ingest directly from mediawiki_history into Druid.
Mar 12 2019
Thanks for the update re the timeline. A meeting would be great - I’ll set up a time for us and @Neil_P._Quinn_WMF to meet this week if possible. I’ve worked with Neil to identify the simplified list of mediawiki_history dimensions and mapped those to druid expressions. I'll share with you soon and we can discuss at the meeting.
Feb 24 2019
Feb 21 2019
@mforns Thanks! Yes, happy to discuss and coordinate on this. I reviewed this task with @Neil_P._Quinn_WMF today. I'm going to first work on defining our desired dimensions and transformations based on the type of queries we'd want to run and how the data will be used, which might help inform the best method for loading the dataset. I’ll reach out to discuss once we have a better idea of the needed transforms if that works for you.