Page MenuHomePhabricator

Add regexps that match the bots that follow the User-Agent policy {hawk}
Closed, ResolvedPublic3 Estimated Story Points

Description

Currently, the PageviewDefinition.java does not have regular expressions that match the bots that follow the conventions specified in https://meta.wikimedia.org/wiki/User-Agent_policy. We should implement this, and tag them as "bot" (or "spider" :-), it's a low hanging fruit I think.

Event Timeline

mforns raised the priority of this task from to Needs Triage.
mforns updated the task description. (Show Details)
mforns added a project: Analytics.
mforns added a subscriber: mforns.
Milimetric triaged this task as Medium priority.Feb 4 2016, 6:08 PM
Milimetric moved this task from Incoming to Analytics Query Service on the Analytics board.
Milimetric set Security to None.
Milimetric added a subscriber: Milimetric.

research whether the new expressions that we're adding have a match on the cluster

Nuria added a subscriber: Nuria.

Moving this task to kanban so it is up for grabs given that it is pretty small

Milimetric set the point value for this task to 3.Mar 3 2016, 5:38 PM

Change 275760 had a related patch set uploaded (by Madhuvishy):
Implement the Wikimedia User Agent policy in setting agent type

https://gerrit.wikimedia.org/r/275760

Change 275760 merged by Joal:
Implement the Wikimedia User Agent policy in setting agent type

https://gerrit.wikimedia.org/r/275760