Wikidata Analytics Request
This task was generated using the Wikidata Analytics request form. Please use the task template linked on our project page to create issues for the team. Thank you!
Purpose
Please provide as much context as possible as well as what the produced insights or services will be used for.
As Wikidata Product Managers, we would like to understand better how Wikidata's new REST API is used. T366621: [Analytics] Analysis of REST API user agents for May 2024 was very useful for this and we would like to get this information regularly.
Note to Lydia: To improve the quality of the data further, we would need to do steps to strengthen our users' compliance with the user agent policy (T329044).
Specific Results
Please detail the specific results that the task should deliver.
Wikidata PMs in the nda LDAP group (with PII access) receive the information from T366621: [Analytics] Analysis of REST API user agents for May 2024 once per quarter (if a manual step is involved) or once per month (just in case this was fully automated).
Optional: add a split by read/write operations (but don't bother if this is complicated)
Desired Outputs
Please list the desired outputs of this task.
Easy to access spreadsheet/csv (anything relying on our normal UCS, SUL or Wikitech logins will do, but it should not require a Kerberos authentication)
Deadline
Please make the time sensitivity of this task clear with a date that it should be completed by. If there is no specific date, then the task will be triaged based on its priority.
DD.MM.YYYY
Information below this point is filled out by the task assignee.
Assignee Planning
Sub Tasks
A full breakdown of the steps to complete this task.
- White a DAG to call a query for Wikidata REST API user agents
- Write a query for user agents and total requests for a given month and the most before
- Create the needed table and make sure that the analytics-wmde user has access to it
- Test the query in a Spark SQL instance to make sure it runs properly
- Test the DAG to make sure it finishes properly
- Send out to Lydia and Ifrah for confirmation (ensure that Ifrah gets the right permissions first, if necessary)
Estimation
Estimate: Half a day
Actual: One day (mostly waiting for it to finish while working on other things)
Data
The tables that will be referenced in this task.
Notes
Things that came up during the completion of this task, questions to be answered and follow up tasks.
- Note