Page MenuHomePhabricator

Add regexps that match the bots that follow the User-Agent policy {hawk}
Closed, ResolvedPublic3 Estimated Story Points

Description

Currently, the PageviewDefinition.java does not have regular expressions that match the bots that follow the conventions specified in https://meta.wikimedia.org/wiki/User-Agent_policy. We should implement this, and tag them as "bot" (or "spider" :-), it's a low hanging fruit I think.

Event Timeline

mforns raised the priority of this task from to Needs Triage.
mforns updated the task description. (Show Details)
mforns added a project: Analytics.
mforns subscribed.
Milimetric triaged this task as Medium priority.Feb 4 2016, 6:08 PM
Milimetric moved this task from Incoming to Analytics Query Service on the Analytics board.
Milimetric set Security to None.
Milimetric subscribed.

research whether the new expressions that we're adding have a match on the cluster

Nuria subscribed.

Moving this task to kanban so it is up for grabs given that it is pretty small

Milimetric set the point value for this task to 3.Mar 3 2016, 5:38 PM

Change 275760 had a related patch set uploaded (by Madhuvishy):
Implement the Wikimedia User Agent policy in setting agent type

https://gerrit.wikimedia.org/r/275760

Change 275760 merged by Joal:
Implement the Wikimedia User Agent policy in setting agent type

https://gerrit.wikimedia.org/r/275760