Page MenuHomePhabricator

Investigate June Unique devices increase of 170% for wikidata
Closed, ResolvedPublic

Event Timeline

So, thanks to @JAllemandou for reminding me that turnilo should be the thing I use to investigate this.

It looks like the maint spike was between May 30th and June 3rd (inclusive) with more requests that are part of the spike tailing off until June 20th.
This can be seen in the graph below, which also shows that the spike came from a single country.

image.png (960×1 px, 96 KB)

turnilo link

Looking at this further it would appear that all requests in the spike came from a single ISP and from a few different ~5 IPs.
The UAs for the requests seems to all be or mostly be unique with various version numbers within the UAs being different.
This results in the requests being detected as different devices hence the spike.

Perhaps some further investigation is needed

Addshore claimed this task.

It looks like this might be some bot or script scraping stuff that isn't identified as a script in any way, and that is rotating UAs...
The requests just seem to be to random entities, some existing, some not existing, but nothing more fancy than that.

Screen Shot 2018-07-13 at 12.43.07 PM.png (1×2 px, 342 KB)

It coincides with a spike of pageviews from thailand, that seems like a bot accessing the desktop size, will investigate a bit as to whether this bot was accepting cookies.

Bot did not accepted cookies, user agent was changing slightly, in 1000 records when this event is happening 995 are part of event and of those about 200 are unique user agents. Still the IP is the same and the volumes of requests so high that I am wondering how these requests did not get throttled. Will look at throttling limits.

@Nuria should I file a follow up ticket about adding an annotation to the graph explaining this spike?

yes , please, I listed issue on dataset page:
We do not yet have annotations in wikistats (we will at the end of quarter) but when we do this is a good one to list. Moving ticket to bot work.

Addshore claimed this task.

Looks great! :)

image.png (469×713 px, 43 KB)