Hypothesis statement: If we explore and define Wikimedia-specific methods for a unique device identification model, we will be able to define the collection and storage mechanisms that we can later implement in our anti-abuse workflows to enable more targeted blocking of bad actors.
Three main steps
- how will unique device identification help reducing the false positive rate caused from IP blocking
- what are the features we need to consider from all data we are collecting today that will help identifying unique device, and can we build a dataset with collection and storage mechanism (https://phabricator.wikimedia.org/T360195 shows additional data we just started to collect)
- can we build a model around features that deemed important
