Page MenuHomePhabricator

Add mediawiki_skin_diff to the allowlist
Closed, ResolvedPublic

Description

We need to add non-PII fields from the new mediawiki_pref_diff schema (created in T261842) to the allowlist so that the data won't get purged after 90 days.

Later we decided to configure a new stream specifically for skin-related preference change events: mediawiki_skin_diff (T289622)

With the new instrumentation, here is our proposal of event sanitization.

Proposal

What NOT to keep

  • user_agent_map
  • http
  • meta:
    • request_id
    • id
    • domain
    • dt

What to hash

  • user_hash

What to keep

  • initial_state
  • final_state
  • bucketed_user_edit_count
  • normalized_host
Done

https://gerrit.wikimedia.org/r/c/analytics/refinery/+/737471

Event Timeline

Change 713644 had a related patch set uploaded (by MNeisler; author: MNeisler):

[analytics/refinery@master] Add the mediawiki_pref_diff event platform stream to the allowlist

https://gerrit.wikimedia.org/r/713644

Change 713644 abandoned by MNeisler:

[analytics/refinery@master] Add the mediawiki_pref_diff event platform stream to the allowlist

Reason:

Per discussions with Mikhail, we decided it would be worthwhile to configure a new stream specifically for skin-related preference change events sent by the VectorPrefDiffInstrumentation. This will help ensure that changes to other preferences are kept in a separate stream and different retention policies can be applied. I'm abandoning this change and will submit a new patch once the new stream is configured. Thanks all!

https://gerrit.wikimedia.org/r/713644

Reassigning to @jwang to do once unblocked on work that will be done in T289622

MNeisler added a subscriber: jwang.
jwang renamed this task from Add mediawiki_pref_diff to the allowlist to Add mediawiki_skin_diff to the allowlist.Nov 8 2021, 7:11 PM
jwang updated the task description. (Show Details)
jwang updated the task description. (Show Details)

Change 737471 had a related patch set uploaded (by Jenniferwang; author: Jenniferwang):

[analytics/refinery@master] Add skin_diff schema to sanitize allowlist

https://gerrit.wikimedia.org/r/737471

Hi @mforns, could you review the patch?

Change 737471 merged by Mforns:

[analytics/refinery@master] Add skin_diff schema to sanitize allowlist

https://gerrit.wikimedia.org/r/737471