Page MenuHomePhabricator

Migrate WikibaseTermboxInteraction EventLogging Schema to new EventPlatform thingy
Open, HighPublic3 Estimated Story Points

Description

The WikibaseTermboxInteraction schema was created in context of T211015 and we probably want to keep it until at least the new Termbox is available on Desktop, whenever that is.

Thus we should make sure it is migrated to the new platform.

For more context see the parent ticket.

(One of?) the Grafana boards with data from this schema https://grafana.wikimedia.org/d/000000018/eventlogging-schema?orgId=1&var-schema=WikibaseTermboxInteraction

  • 1. Pick a schema to migrate
  • 2. Create a new task to track this schema's migration
  • 3. Create /analytics/legacy/ schema
  • 4. Edit-protect the metawiki Schema page at https://meta.wikimedia.org/wiki/Schema:<SchemaName>
  • 5. Manually evolve the Hive table to use new schema
  • 6. Add entry to wgEventStreams, wgEventLoggingStreamNames and wgEventLoggingSchemas in operations/mediwiki-config
  • 7. Once the legacy stream's data is fully produced through EventGate, switch to using Refine job that uses schema repo instead of meta.wm.org
  • 8. Edit the producer extension.json and set EventLoggingSchemas to the new schema URI
  • 9. Once the producer extension.json is fully deployed, edit wgEventLoggingSchemas in operations/mediawiki-config InitialiseSettings.php and remove the schema's entry.
  • 10. Mark the schema as migrated in the EventLogging Schema Migration Audit spreadsheet

Event Timeline

Restricted Application added a subscriber: Aklapper. · View Herald Transcript
odimitrijevic triaged this task as High priority.
odimitrijevic moved this task from Incoming to Event Platform on the Analytics board.

@Michael Hi!

I'm going to migrate this schema during the next couple weeks.
I need to askk you a couple questions about it.

  1. Do you need to collect IP or geocode information together with this schema? The legacy EventLogging system collects them by default. But in the new system we only collect them if necessary. Please, let me know!
  2. Is the instrumentation that generates this data in the front-end (JS)? Or is it in the back end (PHP)?

Cheers!

@Michael Hi!

I'm going to migrate this schema during the next couple weeks.
I need to askk you a couple questions about it.

  1. Do you need to collect IP or geocode information together with this schema? The legacy EventLogging system collects them by default. But in the new system we only collect them if necessary. Please, let me know!

I don't think so.

  1. Is the instrumentation that generates this data in the front-end (JS)? Or is it in the back end (PHP)?

As you can see from the only patch that is linked to the task mentioned in the description about the creation of this schema, it is tracked in the browser javascript with mw.track. Please see the linked task T211015 for more details.

Change 721010 had a related patch set uploaded (by Mforns; author: Mforns):

[schemas/event/secondary@master] Add WikibaseTermboxInteraction to analytics/legacy

https://gerrit.wikimedia.org/r/721010

Change 721010 merged by jenkins-bot:

[schemas/event/secondary@master] Add WikibaseTermboxInteraction to analytics/legacy

https://gerrit.wikimedia.org/r/721010

EChetty set the point value for this task to 3.Thu, Jun 30, 5:05 PM