Running the following query in Hive results in 20,814 results, which is odd to say the least:
USE event; SELECT * FROM citationusage WHERE event.session_token="1d8be34f432aaad1" AND year=2018 AND month=7;
The events are all happening on the same page and the action is also the same 'extClick' (click on an external link). The code that logs these events looks OK.
The user agent isn't identified as a bot, yet so many requests are coming from this single user in a short span of time. The only explanation I currently have is that these events are generated by a bot masquerading as a user and clicking on all external links.
What do you think? Have you seen something similar with other EventLogging schemas?
This is not a single case. We see other similar cases with other session_tokens:
session_token | events |
---|---|
some session token | 7817 |
some session token | 6797 |
some session token | 6115 |
some session token | 5757 |
some session token | 4246 |
some session token | 4241 |
some session token | 4171 |
some session token | 4011 |
some session token | 2479 |
some session token | 2470 |
some session token | 2117 |
some session token | 1762 |
some session token | 1755 |
some session token | 1704 |
some session token | 1663 |
some session token | 1625 |
some session token | 1520 |
some session token | 1494 |
some session token | 1270 |