I think it's blocking on the URL path /beacon/event. See https://easylist.to/easylist/easyprivacy.txt and T220627#5638168.
- Queries
- All Stories
- Search
- Advanced Search
- Transactions
- Transaction Logs
Advanced Search
Dec 4 2019
Dec 2 2019
Just to note, we have the same problem for the new CentralNotice data pipeline, which uses EventLogging, as compared to the old pipeline, which uses a custom call to beacon/impression not blocked by AdBlock. In case it's useful: see T236834#5696044 (and the two comments after that).
In T236835#5682487, @Isaac wrote:I'm not sure if this is at all pertinent, but we spent some time trying to debug a situation where we were missing about 10% of EventLogging data from people taking a survey served via the QuickSurveys tool. In essence, for 10% of readers, the tool was displaying the surveys correctly and we knew they had taken the survey but we never get the EventLogging that we should have. We ultimately decided that it was a mixture of adblock (which has settings that allow the javascript etc. to show the survey but blocks EventLogging) and, in our case, people could right-click off the page to take the survey and that wasn't triggered EventLogging as expected.
Windows for log samples:
Nov 27 2019
Differences found in orphaned old pipeline events:
- 28% orphaned GET requests vs. 10% overall GET requests
- 63% orphaned Windows requests vs. 39% overall Windows requests
Just did one-to-one merges using web request logs in Hive, in both directions, using fairly large samples in both cases.
Nov 25 2019
Here are some results:
Nov 21 2019
In T238560#5682048, @Nuria wrote:could you give some examples of issues you expect to see and troubleshoot (maybe some tickets from the past?)?
Nov 20 2019
Hi all! Congrats to all for your work on this...
Nov 19 2019
Nov 18 2019
Windows for log samples:
Nov 14 2019
Nov 12 2019
Here's a summary of the situation regarding discrepancies between the new and old pipelines for Landing Pages (includes also measures obtained in T235284).
Nov 11 2019
FRUEC currently accepts LandingPage events with no language property in the JSON input. However, if the property exists and its value is an empty string, the event is marked invalid and not counted. However, in such a case, the legacy system (DjangoBannerStats) defaults the language to 'en'.
Nov 8 2019
Nov 6 2019
Nov 4 2019
Matching on landing page, country, language, and other event fields, with old log timestamps always earlier than the new log timestamps, by at most 30 seconds, we get:
Trying different options for better matching... Just a small improvement by allowing new log timestamps that are closest to the new old log ones, that is, removing the requirement that the new log event be after the old log one. With this method, we get 136 unmatched events in new log, and 510 in the old one.
I've dug into this some more, improving filtering compared to what was done for T235284. I've got some more clarity about what the differences are, but it's still pretty ugly.
Nov 1 2019
In T235845#5625873, @Christine_Domgoergen_WMDE wrote:Okay, great, thank you for the update! @GoranSMilovanovic @Tim_WMDE Does it work now?
Oct 31 2019
Scheduled to deploy in a few minutes...
In T235845#5623892, @Christine_Domgoergen_WMDE wrote:Great, thank you everyone! @GoranSMilovanovic Can you now get the impression data or do you need anything else?
Thanks so much for flagging this, and many apologies for the noise!! There's now a patch in review for T236627: CentralNotice: Adapt impression event schema for campaign fallback.
Here's a patch!!! Apologies for the trouble...!
Oct 30 2019
Oct 29 2019
Note: the second bullet point from the task description has been spun out as T236845.
Oct 28 2019
I've dug deeper into the Landing Page discrepancy, comparing sequences of log entries from the same IPs in both old and new pipelines.
Oct 27 2019
Oct 25 2019
So actually it seems the problem is not duplicate entries in the old logs, but rather a few IP addresses hammering on the site using some sort of script, which doesn't run client-side code, so it doesn't generate any client-side events.
Made some progress on figuring out what the difference is between old and new logs. It looks like there are a lot of duplicate entries in the old log files.