We need to add non-PII fields from the new mediawiki_pref_diff schema (created in T261842) to the allowlist so that the data won't get purged after 90 days.
Later we decided to configure a new stream specifically for skin-related preference change events: mediawiki_skin_diff (T289622)
With the new instrumentation, here is our proposal of event sanitization.
Proposal
What NOT to keep
- user_agent_map
- http
- meta:
- request_id
- id
- domain
- dt
What to hash
- user_hash
What to keep
- initial_state
- final_state
- bucketed_user_edit_count
- normalized_host
Done
https://gerrit.wikimedia.org/r/c/analytics/refinery/+/737471