Page MenuHomePhabricator

Create a method to automatically sync data from the database to the sheet
Open, In Progress, Needs TriagePublicSpike

Description

Context

the metrics application collects and processes data from known self-hosted wikibase instances. our metrics sheet allows mere mortals to be able to access this data, conduct analyses, and build dashboards to keep track of key metrics. the two are currently entirely disconnected except for when someone manually moves data between them.

Goal

automate the movement of key data from the database into google sheets so that the users of the sheet can expect to find up-to-date information there.

Acceptance Criteria

  • update existing sheet w latest data (decide on some reasonable period, something in the range of daily is probably fine)

the data to add:

  • total items, lexemes, properties, triples
  • mediawiki major version, whether it's LTS
  • can we find the sparql endpoint? y/n
  • user counts for last month (human, bot, total log)
  • connectivity stats (specifically: Total Connections, avg distance, Connectivity)

(if you discover any key data in the db that is missing please consider noting it in the above list and including it.)

also add, but likely need their own tabs in the sheet:

  • property popularity
  • extension data

Note: this is the automation equivalent to this manual step: T369909