mforns (Marcel Ruiz Forns)
Software Engineer @ Analytics

Today

  • Clear sailing ahead.

Tomorrow

  • Clear sailing ahead.

Saturday

  • Clear sailing ahead.

User Details

User Since
Nov 7 2014, 8:52 PM (209 w, 5 d)
Availability
Available
IRC Nick
mforns
LDAP User
Mforns
MediaWiki User
Unknown

Recent Activity

Today

mforns added a comment to T209503: [EventLogging Sanitization] Enable older-than-90-day purging of unsanitized EL database (event) in Hive.

@Tbayer

Just to double-check: The information in the documentation that "Sanitization happens right after events are generated (with a couple hours lag)" is still current, right? In that case I don't think this will be a concern (although we will need to update some queries - CCing @Groceryheist regarding ReadingDepth).

Thu, Nov 15, 2:02 AM · Analytics-EventLogging, Analytics-Kanban

Yesterday

mforns added a comment to T209503: [EventLogging Sanitization] Enable older-than-90-day purging of unsanitized EL database (event) in Hive.

Yup, keeping time range as a filter, but also potentially dropping other fields which we may not need, if any. I will coordinate with Miriam off-this-task and we will give you the clear signal soon.

Wed, Nov 14, 10:00 PM · Analytics-EventLogging, Analytics-Kanban
mforns updated subscribers of T209503: [EventLogging Sanitization] Enable older-than-90-day purging of unsanitized EL database (event) in Hive.

Also, @Neil_P._Quinn_WMF, @nettrom_WMF and @chelsyx, please check out this task. I don't recall there was any pending issue on your side before we can proceed, but just in case. Thanks!

Wed, Nov 14, 8:06 PM · Analytics-EventLogging, Analytics-Kanban
mforns added a comment to T209503: [EventLogging Sanitization] Enable older-than-90-day purging of unsanitized EL database (event) in Hive.

Also, @mpopov, we should probably fix the white-list to include the recent (or any other) renames to EL schema fields T209087, and backfill sanitization before we activate the purging script. Otherwise, data will be lost for those renamed fields.

Wed, Nov 14, 8:03 PM · Analytics-EventLogging, Analytics-Kanban
mforns added a comment to T209503: [EventLogging Sanitization] Enable older-than-90-day purging of unsanitized EL database (event) in Hive.

@leila Yes, makes sense to me! When you say "copy specific parts of the table" you mean specific time ranges, no? Sure, let's do that.

Wed, Nov 14, 7:52 PM · Analytics-EventLogging, Analytics-Kanban
mforns added a comment to T209503: [EventLogging Sanitization] Enable older-than-90-day purging of unsanitized EL database (event) in Hive.

@mpopov @Tbayer @leila @bmansurov
Oh forgot... Please, feel free to subscribe other people that you think might be interested in participating in this discussion to this task. Thanks!

Wed, Nov 14, 5:00 PM · Analytics-EventLogging, Analytics-Kanban
mforns updated subscribers of T209503: [EventLogging Sanitization] Enable older-than-90-day purging of unsanitized EL database (event) in Hive.

@leila @bmansurov
Hi! I'd also like to confirm with you guys that it's OK to activate the script that will delete all events older than 90 days from Hive's event database (unsanitized), so you'll be left only with the sanitized version of it in event_sanitized database. I believe the main point we wan to discuss here is how we keep unsanitized CitationUsage events while we figure out a way to handle those with Legal. On my end, I'd be happy to white-list all fields temporarily, given that we continue an active conversation with Legal to solve this in the short term. Would that be OK with you? Do you see any other concerns in activating the script regarding EL data belonging to the Research team?

Wed, Nov 14, 4:57 PM · Analytics-EventLogging, Analytics-Kanban
mforns updated subscribers of T209503: [EventLogging Sanitization] Enable older-than-90-day purging of unsanitized EL database (event) in Hive.

@mpopov @Tbayer
Hi! I'd like to confirm that you guys are OK with us activating the script that will delete all events older than 90 days in Hive's 'event' database (unsanitized data).
As we discussed in earlier threads:

  • All instances of the app_install_id field are being kept indefinitely in a sanitized form: salted hash with rotating salt every 3 months, see: T198426, T199902.
  • As requested per @mpopov, the old salt is being kept for 2 extra weeks after salt rotation (end of quarter) to allow for consistent backfilling of the event_sanitized database in case of issues: T199899, T199900.
Wed, Nov 14, 4:47 PM · Analytics-EventLogging, Analytics-Kanban
mforns triaged T209503: [EventLogging Sanitization] Enable older-than-90-day purging of unsanitized EL database (event) in Hive as Normal priority.
Wed, Nov 14, 4:34 PM · Analytics-EventLogging, Analytics-Kanban
mforns added a comment to T196066: Add prometheus metrics for varnishkafka instances running on caching hosts.

Thank you @elukey!

Wed, Nov 14, 2:53 PM · Analytics-Kanban, Traffic, Analytics, Operations

Tue, Nov 13

mforns moved T196066: Add prometheus metrics for varnishkafka instances running on caching hosts from Next Up to In Progress on the Analytics-Kanban board.
Tue, Nov 13, 2:18 PM · Analytics-Kanban, Traffic, Analytics, Operations
mforns moved T199836: [EL sanitization] Write and productionize script to drop partitions older than 90 days in events database from In Progress to In Code Review on the Analytics-Kanban board.
Tue, Nov 13, 2:18 PM · Patch-For-Review, Analytics, Analytics-Kanban

Mon, Nov 12

mforns renamed T209087: [EventLogging Sanitization] Update EL sanitization white-list for field renames in EL schemas from [EventLogging Sanitization] Update EL sanitization whit-elist for field renames in EL schemas to [EventLogging Sanitization] Update EL sanitization white-list for field renames in EL schemas.
Mon, Nov 12, 5:22 PM · Product-Analytics, Reading-analysis, Analytics

Fri, Nov 9

mforns added a comment to T209087: [EventLogging Sanitization] Update EL sanitization white-list for field renames in EL schemas.

@Tbayer
I should have looked who that schema's maintainer was, sorry for that.
I intended it just as a heads up, and thought of you, given that you've been our main point of contact in the past, regarding EL Reading schemas in general.
Please, feel free to reasign the task! I also added other analysts to the task so that they can chime in.

Fri, Nov 9, 4:59 PM · Product-Analytics, Reading-analysis, Analytics

Thu, Nov 8

mforns updated subscribers of T209087: [EventLogging Sanitization] Update EL sanitization white-list for field renames in EL schemas.
Thu, Nov 8, 6:52 PM · Product-Analytics, Reading-analysis, Analytics
mforns created T209087: [EventLogging Sanitization] Update EL sanitization white-list for field renames in EL schemas.
Thu, Nov 8, 6:51 PM · Product-Analytics, Reading-analysis, Analytics

Tue, Nov 6

mforns created T208872: [EventLoggingToDruid] Add explicit types to numeric dimensions so that they are ingested as such.
Tue, Nov 6, 6:04 PM · Patch-For-Review, Analytics-Kanban, Analytics

Fri, Nov 2

mforns added a subtask for T203669: Return to real time banner impressions in Druid: T208589: [EventLoggingToDruid] Add support for ingesting subfields of map columns.
Fri, Nov 2, 2:14 PM · Analytics-Kanban, User-Elukey, Analytics
mforns added a parent task for T208589: [EventLoggingToDruid] Add support for ingesting subfields of map columns: T203669: Return to real time banner impressions in Druid.
Fri, Nov 2, 2:14 PM · Analytics
mforns created T208589: [EventLoggingToDruid] Add support for ingesting subfields of map columns.
Fri, Nov 2, 2:14 PM · Analytics
mforns added a comment to T189475: Identify common abuse filters that affect translations.

Oh, so I should just simply create my own table like that, in an SQL script scheduled with report updater? I thought I'd need to do it with a DBA or something :)

Fri, Nov 2, 12:04 PM · Language-Team (Language-2018-October-December), CX-analytics

Mon, Oct 29

mforns added a comment to T189475: Identify common abuse filters that affect translations.

I think it looks good to start!

Mon, Oct 29, 8:23 PM · Language-Team (Language-2018-October-December), CX-analytics

Fri, Oct 26

mforns added a comment to T189475: Identify common abuse filters that affect translations.

@Amire80 In one of our last stand/up meetings, we brought up this task, and some of our team members recalled that Superset was not working properly with labs MySQL databases. We are making sure that's true. @JAllemandou, you said log db was working for you in Superset?

Fri, Oct 26, 10:38 AM · Language-Team (Language-2018-October-December), CX-analytics

Tue, Oct 23

mforns moved T206342: Finalize eventlogging to druid ingestion from In Code Review to In Progress on the Analytics-Kanban board.
Tue, Oct 23, 2:03 PM · Patch-For-Review, Analytics, Analytics-Kanban
mforns renamed T206342: Finalize eventlogging to druid ingestion from Finalize eventlogging to druid ingestion with a whitelist instead of a blacklist to Finalize eventlogging to druid ingestion.
Tue, Oct 23, 2:03 PM · Patch-For-Review, Analytics, Analytics-Kanban
mforns moved T166414: Explore NavigationTiming by faceted properties - EventLogging refine from In Code Review to Done on the Analytics-Kanban board.
Tue, Oct 23, 2:02 PM · Analytics-Kanban, Performance-Team (Radar), Analytics, Patch-For-Review
mforns moved T205562: Ingest data into druid for readingDepth schema from In Code Review to Done on the Analytics-Kanban board.
Tue, Oct 23, 2:02 PM · Readers-Web-Backlog (Tracking), Patch-For-Review, Analytics-Kanban, Analytics
mforns moved T202751: Ingest data from PageIssues EventLogging schema into Druid from Ready to Deploy to Done on the Analytics-Kanban board.
Tue, Oct 23, 2:02 PM · Patch-For-Review, Analytics-Kanban, Product-Analytics, Reading-analysis, Readers-Web-Backlog (Tracking), Page-Issue-Warnings, Analytics
mforns added a comment to T205562: Ingest data into druid for readingDepth schema .

I backfilled the last 3 months of data. This is now productionized!
Data will continue to be imported automatically every hour
(with a 5 hour lag to allow for previous collection and refinement of EL events into Hive).
Next steps are:

  • Write a comprehensive documentation about EventLoggingToDruid ingestion.
  • Remove the confusing Count metric from the datasource in Turnilo, or at least uncheck it by default (and make the default the actual eventCount).
  • Try to add a new metric to the datasource, eventCountPercentage, that normalizes eventCount splits by the total aggregate, so that time measure buckets become percentage-of-total values, instead of frequencies. This way they will not vary with throughput changes or seasonality, and will be a lot easier to follow. (not sure if this will be possible, though)

In any case these items will not be part of this task, I will tackle them as part of T206342.
Will move this task to Done in Analytics-Kanban.
Cheers!

Tue, Oct 23, 2:02 PM · Readers-Web-Backlog (Tracking), Patch-For-Review, Analytics-Kanban, Analytics
mforns added a comment to T202751: Ingest data from PageIssues EventLogging schema into Druid.

I backfilled the last 3 months of data. This is now productionized!
Data will continue to be imported automatically every hour
(with a 5 hour lag to allow for previous collection and refinement of EL events into Hive).
Next steps are:

  • Write a comprehensive documentation about EventLoggingToDruid ingestion.
  • Remove the confusing Count metric from the datasource in Turnilo, or at least uncheck it by default (and make the default the actual eventCount).
  • Try to add a new metric to the datasource, eventCountPercentage, that normalizes eventCount splits by the total aggregate, so that time measure buckets become percentage-of-total values, instead of frequencies. This way they will not vary with throughput changes or seasonality, and will be a lot easier to follow. (not sure if this will be possible, though)

In any case these items will not be part of this task, I will tackle them as part of T206342.
Will move this task to Done in Analytics-Kanban.
Cheers!

Tue, Oct 23, 2:02 PM · Patch-For-Review, Analytics-Kanban, Product-Analytics, Reading-analysis, Readers-Web-Backlog (Tracking), Page-Issue-Warnings, Analytics
mforns added a comment to T166414: Explore NavigationTiming by faceted properties - EventLogging refine.

I backfilled the last 3 months of data. This is now productionized!
Data will continue to be imported automatically every hour
(with a 5 hour lag to allow for previous collection and refinement of EL events into Hive).
Next steps are:

  • Write a comprehensive documentation about EventLoggingToDruid ingestion.
  • Remove the confusing Count metric from the datasource in Turnilo, or at least uncheck it by default (and make the default the actual eventCount).
  • Try to add a new metric to the datasource, eventCountPercentage, that normalizes eventCount splits by the total aggregate, so that time measure buckets become percentage-of-total values, instead of frequencies. This way they will not vary with throughput changes or seasonality, and will be a lot easier to follow. (not sure if this will be possible, though)

In any case these items will not be part of this task, I will tackle them as part of T206342.
Will move this task to Done in Analytics-Kanban.
Cheers!

Tue, Oct 23, 2:01 PM · Analytics-Kanban, Performance-Team (Radar), Analytics, Patch-For-Review
mforns renamed T206342: Finalize eventlogging to druid ingestion from Parametize eventlogging to druid ingestion with a whitelist instead of a blacklist to Finalize eventlogging to druid ingestion with a whitelist instead of a blacklist.
Tue, Oct 23, 2:00 PM · Patch-For-Review, Analytics, Analytics-Kanban

Fri, Oct 19

mforns added a project to T196066: Add prometheus metrics for varnishkafka instances running on caching hosts: Analytics-Kanban.
Fri, Oct 19, 3:19 PM · Analytics-Kanban, Traffic, Analytics, Operations
mforns moved T206342: Finalize eventlogging to druid ingestion from Done to In Code Review on the Analytics-Kanban board.
Fri, Oct 19, 2:41 PM · Patch-For-Review, Analytics, Analytics-Kanban
mforns claimed T196066: Add prometheus metrics for varnishkafka instances running on caching hosts.
Fri, Oct 19, 1:47 PM · Analytics-Kanban, Traffic, Analytics, Operations

Wed, Oct 17

mforns moved T199836: [EL sanitization] Write and productionize script to drop partitions older than 90 days in events database from Next Up to In Progress on the Analytics-Kanban board.
Wed, Oct 17, 3:16 PM · Patch-For-Review, Analytics, Analytics-Kanban
mforns moved T206342: Finalize eventlogging to druid ingestion from In Code Review to Done on the Analytics-Kanban board.
Wed, Oct 17, 3:15 PM · Patch-For-Review, Analytics, Analytics-Kanban

Tue, Oct 16

mforns created T207207: [EL2Druid] Make RefineTarget compatible with Druid and use it from EventLoggingToDruid.
Tue, Oct 16, 7:32 PM · Analytics
mforns added a comment to T207165: eventlogging_db_sanitization script failed.

@Gilles @elukey
Since we changed the EL blacklist that prevented schemas to be loaded to MySQL to a whitelist, new schemas are being loaded only to Hive by default.
So this scenario will happen frequently from now on.

Tue, Oct 16, 1:02 PM · Analytics-Kanban, Analytics

Oct 15 2018

mforns lowered the priority of T177927: Refactor kafka_config.rb and and kafka_cluster_name.rb in puppet to avoid explicit hiera calls from Normal to Low.
Oct 15 2018, 4:39 PM · Analytics, Traffic, Operations, User-Elukey
mforns lowered the priority of T198985: Alarms on Webrequest data processing and pageview volume from High to Normal.
Oct 15 2018, 4:38 PM · Analytics
mforns lowered the priority of T198910: Alarms in Eventlogging hadoop sanitization from High to Normal.
Oct 15 2018, 4:38 PM · Analytics
mforns closed T194702: Read Dashiki annotations into Wikistats as Declined.
Oct 15 2018, 4:37 PM · Analytics, Analytics-Wikistats
mforns closed T194702: Read Dashiki annotations into Wikistats, a subtask of T178015: Beta Release: Wikistats: support annotations in graphs, as Declined.
Oct 15 2018, 4:37 PM · Analytics-Kanban, Analytics, Analytics-Wikistats
mforns lowered the priority of T190700: Automate creation of sqoop list of wikis to import data for from sitematrix from High to Normal.
Oct 15 2018, 4:35 PM · Analytics, Analytics-Wikistats
mforns added a comment to T202751: Ingest data from PageIssues EventLogging schema into Druid.

Just a note that I'm deleting the Druid dataset temporarily, to apply some renames and productionize the final job.
Will be back up within 1 day hopefully.

Oct 15 2018, 2:01 PM · Patch-For-Review, Analytics-Kanban, Product-Analytics, Reading-analysis, Readers-Web-Backlog (Tracking), Page-Issue-Warnings, Analytics
mforns added a comment to T205562: Ingest data into druid for readingDepth schema .

Just a note that I'm deleting the Druid dataset temporarily, to apply some renames and productionize the final job.
Will be back up within 1 day hopefully.

Oct 15 2018, 2:01 PM · Readers-Web-Backlog (Tracking), Patch-For-Review, Analytics-Kanban, Analytics
mforns added a comment to T166414: Explore NavigationTiming by faceted properties - EventLogging refine.

Just a note that I'm deleting the Druid dataset temporarily, to apply some renames and productionize the final job.
Will be back up within 1 day hopefully.

Oct 15 2018, 2:01 PM · Analytics-Kanban, Performance-Team (Radar), Analytics, Patch-For-Review

Oct 11 2018

mforns moved T206342: Finalize eventlogging to druid ingestion from In Progress to In Code Review on the Analytics-Kanban board.
Oct 11 2018, 9:43 AM · Patch-For-Review, Analytics, Analytics-Kanban
mforns moved T166414: Explore NavigationTiming by faceted properties - EventLogging refine from Paused to In Code Review on the Analytics-Kanban board.
Oct 11 2018, 9:43 AM · Analytics-Kanban, Performance-Team (Radar), Analytics, Patch-For-Review
mforns added a comment to T202751: Ingest data from PageIssues EventLogging schema into Druid.

It occurred to me afterwards though that this might be because we were looking at the Count measure instead of the Event Count measure. What is the meaning of the former? Is this documented somewhere? Is it necessary to include in the Turnilo options? This will likely not be the last time that it causes that kind of confusion.

Oct 11 2018, 9:43 AM · Patch-For-Review, Analytics-Kanban, Product-Analytics, Reading-analysis, Readers-Web-Backlog (Tracking), Page-Issue-Warnings, Analytics
mforns added a comment to T202751: Ingest data from PageIssues EventLogging schema into Druid.

Another question: It seems that the dimensions lack e.g. Ua Browser Major and other user agent derived fields (that we have and use in e.g. https://turnilo.wikimedia.org/#pageviews_daily/ ). In the web team we often need these when evaluating EL data, see e.g. this example from earlier today: T204143#4650771 . Could they be added, analogously to the pageviews data?

Oct 11 2018, 9:33 AM · Patch-For-Review, Analytics-Kanban, Product-Analytics, Reading-analysis, Readers-Web-Backlog (Tracking), Page-Issue-Warnings, Analytics

Oct 9 2018

mforns updated subscribers of T202751: Ingest data from PageIssues EventLogging schema into Druid.

@mforns Great to hear that Druid already allows ingestion of array types! But just to clarify, it seems that this involves information reduction of some kind? At least I'm only seeing scalar values in the selection dropdown in Turnilo (below).

Oct 9 2018, 8:24 PM · Patch-For-Review, Analytics-Kanban, Product-Analytics, Reading-analysis, Readers-Web-Backlog (Tracking), Page-Issue-Warnings, Analytics

Oct 8 2018

mforns closed T73710: MD5 checksums missing from pagecounts-all as Declined.

Please reopen if still an issue.

Oct 8 2018, 4:50 PM · Analytics
mforns added a comment to T126281: [Regression] stats.wikipedia.org redirect no longer works ("Domain not served here").

@BBlack ping, bumping this up

Oct 8 2018, 4:45 PM · Traffic, Analytics, Operations, Regression, Analytics-Wikistats
mforns moved T126281: [Regression] stats.wikipedia.org redirect no longer works ("Domain not served here") from Operational Excellence to Radar on the Analytics board.
Oct 8 2018, 4:45 PM · Traffic, Analytics, Operations, Regression, Analytics-Wikistats
mforns lowered the priority of T200717: Scan npm dependencies for vulnerabilities from High to Low.
Oct 8 2018, 4:44 PM · Analytics
mforns added a project to T199693: Table view of timely results in wikistats 2 should be ordered in time descending: Analytics-Kanban.
Oct 8 2018, 4:43 PM · Patch-For-Review, Analytics-Kanban, Analytics
mforns lowered the priority of T199340: Decommission edit analysis dashboard from High to Normal.
Oct 8 2018, 4:41 PM · Patch-For-Review, Analytics-Kanban, Analytics, Contributors-Analysis, Product-Analytics
mforns placed T199340: Decommission edit analysis dashboard up for grabs.
Oct 8 2018, 4:41 PM · Patch-For-Review, Analytics-Kanban, Analytics, Contributors-Analysis, Product-Analytics
mforns lowered the priority of T194706: Organize annotations pages on meta by convention from High to Low.
Oct 8 2018, 4:31 PM · Analytics, Analytics-Wikistats
mforns closed T194702: Read Dashiki annotations into Wikistats, a subtask of T178015: Beta Release: Wikistats: support annotations in graphs, as Declined.
Oct 8 2018, 4:31 PM · Analytics-Kanban, Analytics, Analytics-Wikistats
mforns closed T194702: Read Dashiki annotations into Wikistats as Declined.

We just transfered annotations by hand already.

Oct 8 2018, 4:31 PM · Analytics, Analytics-Wikistats
mforns moved T194428: Pixel ratio messed up on Windows Chrome from Next Up to Paused on the Analytics-Kanban board.
Oct 8 2018, 4:30 PM · Analytics-Kanban, Analytics-Wikistats, Analytics
mforns added a project to T194428: Pixel ratio messed up on Windows Chrome: Analytics-Kanban.
Oct 8 2018, 4:30 PM · Analytics-Kanban, Analytics-Wikistats, Analytics
mforns lowered the priority of T194428: Pixel ratio messed up on Windows Chrome from High to Low.
Oct 8 2018, 4:30 PM · Analytics-Kanban, Analytics-Wikistats, Analytics
mforns lowered the priority of T190855: Enable automatic ingestion from eventlogging into druid for some schemas from High to Normal.
Oct 8 2018, 4:23 PM · Analytics-Kanban
mforns added a comment to T190855: Enable automatic ingestion from eventlogging into druid for some schemas.

This task refers to ingesting any number of schemas with just one job, that ideally reads schema registry / meta schema data.

Oct 8 2018, 4:23 PM · Analytics-Kanban
mforns lowered the priority of T188927: Changes to map projection in wikistats from High to Normal.
Oct 8 2018, 4:19 PM · Analytics-Wikistats, Analytics
mforns assigned T183183: Present Wikistats 2 charts for the period selected by the user. to Nuria.
Oct 8 2018, 4:18 PM · Analytics-Kanban, Analytics, Analytics-Wikistats
mforns set the point value for T183180: roadmap of migration to Wikistats 2 to 0.
Oct 8 2018, 4:17 PM · Analytics-Wikistats, Analytics
mforns raised the priority of T183180: roadmap of migration to Wikistats 2 from High to Needs Triage.
Oct 8 2018, 4:17 PM · Analytics-Wikistats, Analytics
mforns lowered the priority of T178015: Beta Release: Wikistats: support annotations in graphs from High to Normal.
Oct 8 2018, 4:16 PM · Analytics-Kanban, Analytics, Analytics-Wikistats
mforns added a project to T178015: Beta Release: Wikistats: support annotations in graphs: Analytics-Kanban.
Oct 8 2018, 4:15 PM · Analytics-Kanban, Analytics, Analytics-Wikistats
mforns set the point value for T178015: Beta Release: Wikistats: support annotations in graphs to 0.
Oct 8 2018, 4:15 PM · Analytics-Kanban, Analytics, Analytics-Wikistats
mforns raised the priority of T161149: Provide edit tags in the Data Lake edit data from Normal to Needs Triage.
Oct 8 2018, 4:13 PM · Analytics
mforns lowered the priority of T161149: Provide edit tags in the Data Lake edit data from High to Normal.
Oct 8 2018, 4:13 PM · Analytics
mforns changed the point value for T205665: Make wikistats UI family aware: you should be able to select a family in drop down menu and request available metrics for it from 8 to 3.
Oct 8 2018, 4:10 PM · Analytics-Kanban, Analytics-Wikistats, Analytics
mforns moved T206311: Make area metrics collapsible from In Code Review to Paused on the Analytics-Kanban board.
Oct 8 2018, 4:10 PM · Patch-For-Review, Analytics, Analytics-Wikistats, Analytics-Kanban
mforns set the point value for T205665: Make wikistats UI family aware: you should be able to select a family in drop down menu and request available metrics for it to 8.
Oct 8 2018, 4:09 PM · Analytics-Kanban, Analytics-Wikistats, Analytics
mforns set the point value for T205562: Ingest data into druid for readingDepth schema to 5.
Oct 8 2018, 4:09 PM · Readers-Web-Backlog (Tracking), Patch-For-Review, Analytics-Kanban, Analytics
mforns changed the point value for T202751: Ingest data from PageIssues EventLogging schema into Druid from 5 to 3.
Oct 8 2018, 4:09 PM · Patch-For-Review, Analytics-Kanban, Product-Analytics, Reading-analysis, Readers-Web-Backlog (Tracking), Page-Issue-Warnings, Analytics
mforns set the point value for T202751: Ingest data from PageIssues EventLogging schema into Druid to 5.
Oct 8 2018, 4:08 PM · Patch-For-Review, Analytics-Kanban, Product-Analytics, Reading-analysis, Readers-Web-Backlog (Tracking), Page-Issue-Warnings, Analytics
mforns set the point value for T205509: Replace the Analytics Hadoop coordinator - Hive/Oozie/etc... (hardware refresh) to 8.
Oct 8 2018, 4:08 PM · User-Elukey, Patch-For-Review, Analytics-Kanban, Analytics
mforns set the point value for T202490: Automate XML-to-parquet transformation for XML dumps (oozie job) to 8.
Oct 8 2018, 4:08 PM · Patch-For-Review, Analytics-Kanban, Research, Analytics
mforns set the point value for T202489: Copy monthly XML files from public-dumps to HDFS to 5.
Oct 8 2018, 4:07 PM · Patch-For-Review, Analytics-Kanban, Research, Analytics
mforns moved T205644: Improve Dashiki extension messaging from Ready to Deploy to Done on the Analytics-Kanban board.
Oct 8 2018, 4:07 PM · MW-1.32-notes (WMF-deploy-2018-10-16 (1.32.0-wmf.26)), Patch-For-Review, Analytics-Kanban, Analytics
mforns set the point value for T205644: Improve Dashiki extension messaging to 8.
Oct 8 2018, 4:06 PM · MW-1.32-notes (WMF-deploy-2018-10-16 (1.32.0-wmf.26)), Patch-For-Review, Analytics-Kanban, Analytics
mforns set the point value for T206171: Annotations need to use adjustedGraphData to 5.
Oct 8 2018, 4:06 PM · Patch-For-Review, Analytics-Kanban, Analytics
mforns set the point value for T205915: Top metrics. Implement failsafe mechanism for when current month computations are not available to 3.
Oct 8 2018, 4:05 PM · Patch-For-Review, Analytics-Wikistats, Analytics-Kanban
mforns set the point value for T205933: Add caching to wikistats 2 annotations request to 1.
Oct 8 2018, 4:04 PM · Patch-For-Review, Analytics-Kanban, Analytics
mforns closed T198258: [Wikistats2] Bug in Top Viewed Articles since bookmark routing changes as Declined.
Oct 8 2018, 4:04 PM · Analytics-Kanban, Analytics
mforns set the point value for T187414: Wikistats 2.0: "aa.wikipedia.org" exists and has data available, but marked "Invalid" to 5.
Oct 8 2018, 4:03 PM · Patch-For-Review, Analytics-Kanban, Analytics, Analytics-Wikistats
mforns set the point value for T198600: Correct data-removal jobs for mediawiki tables (public and private) to 13.
Oct 8 2018, 4:02 PM · Patch-For-Review, Analytics-Kanban, Analytics
mforns set the point value for T189882: Create reports in wikistats UI for "most prolific editors" (a.k.a "top contributors") to 5.
Oct 8 2018, 4:02 PM · Patch-For-Review, Analytics-Wikistats, Analytics-Kanban, Analytics
mforns set the point value for T205641: Add ability to bucketize integers as part of event ingestion to 5.
Oct 8 2018, 4:01 PM · Patch-For-Review, Analytics-Kanban, Analytics
mforns moved T206331: Git push and pull don't complete from Incoming to Radar on the Analytics board.
Oct 8 2018, 3:46 PM · User-Elukey, Analytics, Analytics-Wikistats
mforns moved T206277: Revision visibility change event sets a wrong performer from Incoming to Radar on the Analytics board.
Oct 8 2018, 3:46 PM · MW-1.32-notes (WMF-deploy-2018-10-16 (1.32.0-wmf.26)), Services (done), Core Platform Team Kanban (Done with CPT), Analytics, EventBus
mforns added a comment to T206269: [Hackathon] Consider converting AQS to TypeScript.

Dan, we saw this task during Groskin' and we'll leave it for when you're here.
Prepare to argue :]

Oct 8 2018, 3:45 PM · Services (watching), Analytics
mforns moved T206267: Create labeled dataset for bot identification from Incoming to Bots on the Analytics board.
Oct 8 2018, 3:44 PM · Analytics-Kanban, Research, Analytics