Page MenuHomePhabricator

GeoIP updates can users to jump to new country in geowiki files
Closed, DeclinedPublic

Description

Editors get geocoded again and again for each day they are
considered active.

So if the GeoIP gets updated, active editors with jump to a
different country from one day to the next.

So consider I edit 5 pages on enwiki on 2013-08-10 using an IP
address that's associated to Italy at that point in time.
On 2013-08-29 the IP->Country database gets updated, and that
previous IP is now marked as United States.
The scripts would count me active for United States from that day
on.

+------------+---------+---------------+

DateAustriaUnited States

+------------+---------+---------------+

2013-08-11*
2013-08-12*
...*
2013-08-28*
2013-08-29*
2013-08-30*
...*
2013-09-10*

+------------+---------+---------------+


Version: unspecified
Severity: normal

Details

Reference
bz54650

Event Timeline

bzimport raised the priority of this task from to Needs Triage.Nov 22 2014, 2:18 AM
bzimport set Reference to bz54650.
bzimport added a subscriber: Unknown Object (MLST).

Mhmm. That table should have been

+------------+---------+---------------+

DateItalyUnited States

+------------+---------+---------------+

2013-08-11*
2013-08-12*
...*
2013-08-28*
2013-08-29*
2013-08-30*
...*
2013-09-10*

+------------+---------+---------------+

That behavior sounds sane enough to me. Do you recommend something alternative?

(In reply to comment #2)

That behavior sounds sane enough to me. Do you recommend something
alternative?

I'd expect a table like:
+------------+---------+---------------+

DateItalyUnited States

+------------+---------+---------------+

2013-08-11*
2013-08-12*
...*
2013-08-28*
2013-08-29*
2013-08-30*
...*
2013-09-10*

+------------+---------+---------------+

If the 5 edits came from an Italian IP address when they were made, those 5
edits should count for Italy for the whole 30-day period, regardless where the
IP "wanders" afterwards.

Of course, I make 5 additional edits from the very same address on 2013-09-05
(that's after the Italy->US switch), the table should become:
+------------+---------+---------------+

DateItalyUnited States

+------------+---------+---------------+

2013-08-11*
2013-08-12*
...*
2013-08-28*
2013-08-29*
2013-08-30*
...*
2013-09-04*
2013-09-05**
2013-09-06**
...**
2013-09-10**

+------------+---------+---------------+

On 2013-09-10, I'd be considered active editor in Italy due to the first 5
editors, when the IP was associated to Italy. And additionally, I am considered
active editor for the United States due to the latter edits.

(That would match the behaviour we're seeing if I do the first 5 edits on an IP that's always Italy, and the second 5 edits on an IP that's always United States)

Aklapper triaged this task as Lowest priority.Nov 2 2015, 2:32 PM

Declining this Analytics-Visualization task, as this project tag was used for Limn. Limn has been superseded by Dashiki.
If this task still makes sense in terms of Dashiki, then please file a new task tagged with Data-Engineering-Dashiki (or reopen and update this task plus update the project tags of this task).
Thanks a lot! (Reference: T274647)