The current Edit instrumentation is asking for schema 11448630 but getting 11319708 instead. This is causing events to go to the wrong place.
Description
Details
Related Objects
- Mentioned In
- rEEVLa0912d8602a1: api: Send Last-Modified header with revision timestamp
rMEXT97c4107840c3: Updated mediawiki/extensions Project: mediawiki/extensions/EventLogging…
rEEVL37effccb8446: RemoteSchema: Make most methods protected
rMEXT64ad445c88e8: Updated mediawiki/extensions Project: mediawiki/extensions/EventLogging…
rEEVL29f55b80a08b: ResourceLoaderSchemaModule: Use definition hash instead of fake timestamp
rEEVL3bd8a30c7940: ResourceLoaderSchemaModule: Use definition hash instead of fake timestamp
rEEVL63c1c83c12e2: ResourceLoaderSchemaModule: Use definition hash instead of fake timestamp
rMEXT3c2f35b4b529: Updated mediawiki/extensions Project: mediawiki/extensions/EventLogging… - Mentioned Here
- T93586: Make all Event Logging instrumentation target the same Schema:Edit revision
Event Timeline
The events coming into the system are from both schemas, numbers for today:
nuria@vanadium:~/VE-events-march-26$ wc -l Edit_events.txt
2427409 Edit_events.txt
nuria@vanadium:~/VE-events-march-26$ more Edit_events.txt | grep 11448630 | wc -l
1410050
So, as you can see this points to a client side caching issue in SOME clients, others are retrieving your last schema just fine.
I really do not think this ticket belongs in analytics. Probably @Krenair can shed lite into caching issues with mw code.
Or it could also be your newest code is not yet deploed everywhere and it is (lawfully) requesting the older schema, let me do some greps on vanadium data.
See @Krinkle's comment. It may be that the correct schema numbers are all coming from the server-side logging, which naturally is not affected by ResourceLoader/EventLogging interactions.
Never mind my two last comments there, looks like the issues with clienbt side caching are dependent on the js request a user has done, thus if requesting the Schema with newer modules that the user doesn't have the right schema is returned.
See @Krinkle's comment. It may be that the correct schema numbers are all coming from the server-side logging,
No, no, those are from client side only
Change 199986 had a related patch set uploaded (by Krinkle):
api: Send Last-Modified header with revision timestamp
Change 200009 had a related patch set uploaded (by Krinkle):
[WIP] RemoteSchema: Expose timestamp from API
Change 200028 had a related patch set uploaded (by Krinkle):
ResourceLoaderSchemaModule: Use definition hash instead of fake timestamp
Change 200034 had a related patch set uploaded (by Jforrester):
ResourceLoaderSchemaModule: Use definition hash instead of fake timestamp
Change 200035 had a related patch set uploaded (by Jforrester):
ResourceLoaderSchemaModule: Use definition hash instead of fake timestamp
Change 200028 merged by jenkins-bot:
ResourceLoaderSchemaModule: Use definition hash instead of fake timestamp
Change 200034 merged by jenkins-bot:
ResourceLoaderSchemaModule: Use definition hash instead of fake timestamp
Change 200035 merged by jenkins-bot:
ResourceLoaderSchemaModule: Use definition hash instead of fake timestamp
Yes! Thanks very much everyone. I'm running the queries against wikitext data for the first time now, pretty exciting.
Change 199986 merged by jenkins-bot:
api: Send Last-Modified header with revision timestamp