Page MenuHomePhabricator

Refine eventlogging pipeline should not refine data for domains that are not wikimedia's
Closed, DuplicatePublic5 Estimated Story Points

Assigned To
Authored By
Nuria
Apr 1 2019, 6:57 PM
Referenced Files
None
Tokens
"Mountain of Wealth" token, awarded by phuedx."Mountain of Wealth" token, awarded by Jdlrobson.

Description

Refine eventlogging pipeline should not refine data for domains that are not wikimedia's. It is not infrequent that other wikis like www.wikipedia-with-spam.org run a clone of our code and , as such, they endup running our instrumenting code and sending us their eventlogging events.

Those events should probably be dropped (ideally) before they get refined. This is somewhat related to: https://phabricator.wikimedia.org/T219162

and https://github.com/wikimedia/analytics-refinery/commit/58a03f623cd6124fd4de70cb8d7e739a90b58214