Page MenuHomePhabricator

Update stream configuration to capture user id
Closed, ResolvedPublic

Description

When reviewing the stream configuration from T370907 as part of updating the instrumentation plan, I noticed that the provide_values list doesn't include the user ID (performer_id). Instead, it's configured to capture the user name (performer_name). Events generated on testwiki match this.

Since usernames can change but user IDs stay consistent, we prefer the latter in data analysis as it makes it easy to join across tables.

I noticed that the measurement specification doesn't specify this, and this wasn't listed in the instrument specification because the spec I had used as a template also didn't list it, which is probably why it wasn't noticed until now. Apologies for that!

Acceptance Criteria

  • Stream configuration also lists performer_id in provide_values

Event Timeline

KStoller-WMF moved this task from Inbox to Up Next (estimated tasks) on the Growth-Team board.

Change #1079475 had a related patch set uploaded (by Cyndywikime; author: Cyndywikime):

[operations/mediawiki-config@master] Update stream configuration to capture user id

https://gerrit.wikimedia.org/r/1079475

Change #1079475 merged by jenkins-bot:

[operations/mediawiki-config@master] GrowthExperiments: update stream configuration to capture user id

https://gerrit.wikimedia.org/r/1079475

Mentioned in SAL (#wikimedia-operations) [2024-10-15T13:04:09Z] <urbanecm@deploy2002> Started scap sync-world: Backport for [[gerrit:1079521|[wikidatawiki] Enable the CampaignEvents extension (T375411)]], [[gerrit:1079475|GrowthExperiments: update stream configuration to capture user id (T376833)]]

Mentioned in SAL (#wikimedia-operations) [2024-10-15T13:11:06Z] <urbanecm@deploy2002> cyndywikime, daimona, urbanecm: Backport for [[gerrit:1079521|[wikidatawiki] Enable the CampaignEvents extension (T375411)]], [[gerrit:1079475|GrowthExperiments: update stream configuration to capture user id (T376833)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug)

Mentioned in SAL (#wikimedia-operations) [2024-10-15T13:23:35Z] <urbanecm@deploy2002> Finished scap sync-world: Backport for [[gerrit:1079521|[wikidatawiki] Enable the CampaignEvents extension (T375411)]], [[gerrit:1079475|GrowthExperiments: update stream configuration to capture user id (T376833)]] (duration: 19m 25s)

Confirmed by generating events on testwiki and querying logged events in the Data Lake that performer.id is now captured correctly in the database table. Closing this as resolved, thanks everyone!