Page MenuHomePhabricator

[Analytics] [Request] Set up GraphQL superset dashboard
Open, Stalled, Needs TriagePublic

Description

Wikidata Analytics Request

This task was generated using the Wikidata Analytics request form. Please use the task templates linked on our project page to create tasks for the team. Thank you!

Purpose

Please provide as much context as possible as well as what the produced insights or services will be used for.

The Wikibase Reuse team will release GraphQL v1 in Feb 2026 and we'd like to have a superset dashboard to track the usage.

Desired Outputs

The desired outputs of this task are listed as check boxes and confirmed as being finished below.

The following metrics should be available:

  • Total number of requests monthly
  • Total number of distinct actors monthly
  • Total number of requests per distinct actor monthly

Deadline

Please make the time sensitivity of this request clear with a date that it should be completed by. If there is no specific date, then the task will be triaged based on its priority.

28.2.2026


Information below this point is filled out by the task assignee.

Assignee Planning

Sub Tasks

A full breakdown of the steps to complete this task.

  • Define terms needed for the task
    • actors: distinct user agents and ips
    • GraphQL endpoint: uri_query LIKE '?action=wbgraphql%' (comes from T417026)
  • Write create table and job queries
    • Jobs are one for deriving metadata that deletes data after two months and another for monthly metrics
    • Queries written except for information needed above, which will take 2 mins to add in :)
  • Test queries on local schema (can be done once GraphQL endpoint is released)
  • Write DAG for metric generation orchestration (data-engineering/airflow-dags/merge_requests/1983)
  • Deploy DAG
  • Add charts to WD API metrics dashboard (three columns for comparison with REST and Action APIs)

Estimation

Estimate: 1-2 days
Actual:

Data

The tables that will be referenced in this task and the samples from them that will be used.

  • wmf.webrequest
    • sample_size: No need to sample as the request load is very low (yet to be released)

Notes

Things that came up during the completion of this task, questions to be answered and follow up tasks.

  • Note

Event Timeline

AndrewTavis_WMDE updated the task description. (Show Details)
AndrewTavis_WMDE updated the task description. (Show Details)

MR for the DAG is up in !1983, which we'll bring in post the queries in analytics/hql/airflow_jobs/wd_graphql_api being finalized and tested :)

AndrewTavis_WMDE changed the task status from Open to In Progress.Tue, Feb 10, 3:47 PM
AndrewTavis_WMDE updated the task description. (Show Details)
AndrewTavis_WMDE updated the task description. (Show Details)
AndrewTavis_WMDE changed the task status from In Progress to Stalled.Tue, Feb 17, 10:25 AM
AndrewTavis_WMDE moved this task from In Progress to Stalled on the Wikidata Analytics (Kanban) board.

Stalling for now until work on T417026 is finalized to and the endpoint is set. Once we know it, we'll be able to finish the queries, test them and deploy the DAG :)

action=wbgraphql is what we're going with for now!

Thanks @Ifrahkhanyaree_WMDE! I've switched over the queries to now have uri_query LIKE '?action=wbgraphql%'. I'm ready to test them and deploy once we have a release of the GraphQL endpoint! 😊