Page MenuHomePhabricator

QA Wikilambda instrumentation port to new core interactions metrics platform version
Open, In Progress, MediumPublic

Description

Objective: To analyze the quality of the data collected with the updated Metrics Platform-based Wikilambda instrument by comparing that data to the data collected with the existing "monoschema" Wikilambda instrument and verify whether the migration to Metrics Platform can proceed or if there are any issues that need to be resolved.

Prerequisites for data QA

  • Mapping of old instrumentation to new instrumentation:
  • Specific QA needs have been identified and are agreed upon
      • Analyst decides in collaboration with engineers/PM whether to QA the whole instrument, or if there are key parts that should be QAed and the rest can be assumed to be okay.
      • Document the parts and the relevant queries:
    • (1) overall counts by action and sub-action (if applicable)
    • (2) counts by specific identifiers (e.g. by activity and browser session)
    • NOTE: these queries will almost always be limited by time in some way
  • New instrument has been deployed and activated (<link to phab task>)
      • Engineer has verified that events are flowing in and that the instrument is not producing schema validation errors.
    • Verify that events are flowing in:
      • EventGate Grafana dashboard
      • Kafka by Topic Grafana dashboard
      • EventStreams
  • New instrument doesn't have any schema validation errors
  • Prioritization agreement between analyst & PM of the QA work in the context of other needs/requests (e.g. PM may need to wait longer for some analysis so that the analyst can do the QA work)
  • Documentation of old and new table names and date of deployment for analyst's reference:
InstrumentTable nameStream deployed (if applicable)Instrumentation Task
Oldevent.wikifunctions_ui2023-07-26T297511
Newevent.mediawiki_product_metrics_wikifunctions_ui2024-04-11T350497

Data QA checklist

If more than one instrument is being migrated, these steps need to be completed for each one.

  • Count the daily number of schema validation errors for
    • Old instrument
    • New instrument
  • Compare counts of events by action and sub-action (as defined in the mapping from prerequisites)
    • if relevant Compare counts by specific identifier (as defined in the prerequisites)
  • Upload QA notebooks to Gitlab, making sure to follow data publication guidelines
  • Document any issues (or notable observations found) on this ticket
  • Resolve this ticket

NOTE: If any issues were identified that require fixing the new instrument, data QA of the fixed instrument will need to be filed as a new Phab task. Some of the checked prerequisites will carry over.

Event Timeline

MNeisler renamed this task from Q Wikilambda instrumentation port to new core interactions metrics platform version to QA Wikilambda instrumentation port to new core interactions metrics platform version.Mar 1 2024, 3:50 PM
MNeisler triaged this task as Medium priority.
MNeisler added a project: Product-Analytics.
Jdforrester-WMF changed the task status from Open to In Progress.Mon, Apr 8, 5:56 PM
Jdforrester-WMF assigned this task to MNeisler.
Jdforrester-WMF moved this task from Backlog to In Progress on the Abstract Wikipedia team board.

QA is currently pending deployment of a patch to add the stream to $wgEventLoggingStreamNames. The patch is expected to be deployed tomorrow and then data should start to be available in event.mediawiki_product_metrics_wikifunctions_ui for QA

MNeisler updated the task description. (Show Details)