Page MenuHomePhabricator

Wikidata's new Analytics repo needs work
Closed, ResolvedPublic1 Estimated Story Points

Description

We have a new Analytics repo to hold our HQL queries and potentially other future code/queries related to analyzing metrics and logs.

https://gitlab.wikimedia.org/repos/wikidata-platform/analytics

(It currently holds a query referred to by an Airflow DAG.)

It needs some cleanup.

AC:

  • Repo has a license
  • Repo has a README
  • (if it makes sense) Set up SQL linting
  • (after decision with team) protect the "main" branch

Details

Related Changes in GitLab:
TitleReferenceAuthorSource BranchDest Branch
Add Sqlfluff linting and a CI pipelinerepos/wikidata-platform/analytics!1lericksoncicdmain
Customize query in GitLab

Event Timeline

I've added the apache 2.0 license and a very small README. I'll work on the other items next week.

DSantamaria changed the task status from Open to In Progress.Apr 8 2026, 12:25 PM
DSantamaria added a project: Essential-Work.

This is all done. I added linting in a pipeline for MRs in the Analytics repo. I also added a license and readme. Turns out the "main" branch of the repo was already protected, too. So, all done with this cleanup.

lerickson set the point value for this task to 3.Apr 13 2026, 3:51 PM
trueg changed the point value for this task from 3 to 2.Apr 13 2026, 4:09 PM
lerickson changed the point value for this task from 2 to 1.Apr 13 2026, 6:24 PM