Page MenuHomePhabricator

Check Data inconsistency Banner_close_events
Closed, ResolvedPublic1 Estimated Story Points


There might be an error in the export of #T212840. The numbers of banner_close_events are significantly lower than in another export (see: #T211852) of the same data. An analysis of a daily x-rate (see diagram) suggests that the data in #T212840 is too low.

Event Timeline

Hm, this is the query I used to export the data, I don't really see any way the data could have been incomplete from this. Maybe @gabriel-wmde / @Tonina_Zhelyazkova_WMDE can have another look at this and see if they notice anything odd:

hive -e 'SELECT year, month, day, event.bannerName, event.finalSlide, event.slidesShown, useragent.browser_family, useragent.browser_major, useragent.device_family, useragent.os_family, useragent.os_major, wiki FROM event.wmdebannerevents WHERE year = 2018 AND event.bannerAction = "banner-closed";' | sed 's/[\t]/,/g' > ~/banner_close_events_2018.csv

I have done another export and sent another email with the latest dump. It has a bit different file size but it may be that I cut off a tiny bit myself last time. Please check if that resolves the issue.

Other than that, my best guess would be that maybe the data was too much and the software you used (Microsoft Office?) to work with it cut parts of it off. I cannot open the complete file in LibreOffice, for example.

The data is now consistent with the older export! Thanks!

Tim_WMDE set the point value for this task to 1.Apr 8 2019, 11:27 AM

Okay, not sure what happened there but that's good to hear!