Page MenuHomePhabricator

Artificial spike in offset of unique devices from November to February 6th on wikidata
Closed, DeclinedPublic

Event Timeline

Nuria renamed this task from Artificial spike in offset of unique devices from November 14th to February 6th to Artificial spike in offset of unique devices from November 14th to February 6th on wikidata.May 17 2017, 9:39 AM

Offset spikes and underestimate decreases in the same amount (so total is same) . Best seen in wikidata mobile: https://goo.gl/o8oAzj

Nuria renamed this task from Artificial spike in offset of unique devices from November 14th to February 6th on wikidata to Artificial spike in offset of unique devices from November to February 6th on wikidata.May 17 2017, 10:18 AM
ema renamed this task from Artificial spike in offset of unique devices from November to February 6th on wikidata to Artificial spike in offset of unique devices from November 14th to February 6th on wikidata.May 17 2017, 10:19 AM
ema added a project: Traffic.
ema renamed this task from Artificial spike in offset of unique devices from November 14th to February 6th on wikidata to Artificial spike in offset of unique devices from November to February 6th on wikidata.May 17 2017, 10:20 AM

Summing up from IRC's conversation between @Nuria and @ema:

From the 2nd of November we start seeing a shift of the Unique Devices data per domain. The totals of Unique Devices are mostly not affected but the Unique devices computation is made of two parts: 1) (underestimate) users for whom the cookie is set + 2) (offset) users that have no cookies.

From about November 2nd to February 6th (2017) we see that the proportion of devices on the offset is much bigger than it was prior. And, the proportion of users on underestimate is smaller. What this tells us is that cookies seem to be expiring sooner than they should.

This matches up with varnish4 progressive rollout: https://gerrit.wikimedia.org/r/#/q/topic:varnish4-upgrade+(status:open+OR+status:merged)

Screen Shot 2017-05-17 at 12.23.55 PM.png (1×2 px, 283 KB)

This is the offset data for wikidata mobile, which represents devices coming w/o a last access cookie.

Nuria renamed this task from Artificial spike in offset of unique devices from November to February 6th on wikidata to Artificial spike in offset of unique devices from November to February 6th on wikidata.May 17 2017, 10:31 AM

@ema: has the way we compute nocookies flag on X-Analytics changed? It should take into account "all" cookies not just last access. I think that from the code in github {1] nothing has changed but asking just in case.,

This is probably unrelated but does this way of setting cookies (geoIP) [2] make them visible on the http.cookies object in varnish?

[1] https://github.com/wikimedia/puppet/blob/production/modules/varnish/templates/analytics.inc.vcl.erb#L171

[2]
https://github.com/wikimedia/puppet/blob/production/modules/varnish/templates/geoip.inc.vcl.erb#L179

ArielGlenn triaged this task as Medium priority.Jun 26 2017, 9:18 AM

Is this something we still need answers for, or have we just moved past it into a new normal?

Ladsgroup subscribed.

Three years have passed from this incident and as result, there's no data left from that time to examine. It's not happening anymore (https://stats.wikimedia.org/#/wikidata.org/reading/unique-devices/normal|line|3-month|~total|daily) and we have lots of new means to detect bot traffic these days. I close this as declined. Feel free to reopen.