Page MenuHomePhabricator

Add mediawiki_product_metrics_incident_reporting_system_interaction to the sanitization allowlist
Open, MediumPublic

Description

incident_reporting_system_interaction was instrumented in T372823 using the metrics platform, and data will be deleted after 90 days by default.

We'd like to add non-PII fields from the incident_reporting_system_interaction data stream to the allowlist to retain this data longer.

This schema tracks user interactions with the incident reporting system. I've proposed fields below that need to be scrubbed following data retention guidelines but please let me know if you have any changes.

What to keep

action
action_context
action_source
action_subtype
funnel_event_sequence_position
funnel_name
meta.domain,
meta.dt,
page.title,
page.id,
page.namespace,
performer.language,
normalized_host

Fields that should be hashed

funnel_entry_token
performer.session_id
performer.id

What NOT to keep

agent
http
experiments
element_friendly_name,
element_id,
mediawiki,
performer.pageview_id
performer.groups
performer.edit_count
performer.active_browsing_session_token
user_agent_map
sample

Event Timeline

cchen triaged this task as Medium priority.
cchen moved this task from Triage to Kanban on the Product-Analytics board.
cchen edited projects, added Product-Analytics (Kanban); removed Product-Analytics.
cchen added a subscriber: kostajh.

This seems fine to me. Thanks!

Change #1117633 had a related patch set uploaded (by Conniecc1; author: Conniecc1):

[analytics/refinery@master] Add the Incident reporting System interaction schema to the allowlist

https://gerrit.wikimedia.org/r/1117633

@mforns - I've added you as a reviewer on this patch to add the mediawiki_product_metrics_incident_reporting_system_interaction schema to the allowlist. Here's a link to the instrumentation to fields. Please let me know if you have any questions or suggested revisions. Thank you!

@mforns - I've added you as a reviewer on this patch to add the mediawiki_product_metrics_incident_reporting_system_interaction schema to the allowlist. Here's a link to the instrumentation to fields. Please let me know if you have any questions or suggested revisions. Thank you!

Is this still pending?

Hi! I've been looking for the schema in the schemas-event-secondary repo, but could not find it. Can you please point me to it? 🙏

@kostajh passed me the the instrumentation spec. That's all I need for the review. 🙏

Change #1117633 abandoned by Conniecc1:

[analytics/refinery@master] Add the Incident reporting System interaction schema to the allowlist

https://gerrit.wikimedia.org/r/1117633