Page MenuHomePhabricator

Easy dimensional data visualization
Open, LowPublic

Description

Example use case: @Isaac needed a dashboard for a dataset many dimensions. There was no easy way to set up a public visualization with our current tools, so he set up https://turnilo-public.wmcloud.org/ on a cloud instance he's managing.

I think it's a good solution for now, but I think the use case is generic and we should eliminate one-offs.
Dashiki was supposed to be our go-to tool for this kind of data, but we haven't followed through with it lately. This task should decide on a plan going forward, even if that's just accepting that we'll deal with one-off solutions until we can make a better plan.

Event Timeline

fdans triaged this task as Medium priority.Apr 19 2021, 4:34 PM
fdans moved this task from Incoming to Data Exploration Tools on the Analytics board.

We talked this over, some quick notes:

  • leaving this as-is for now, keeping an eye on how it scales. If we see problems we can split the data into "the last 6 months" and "all time but simplified schema"
  • discuss with our director when they start
  • options:
    • public Turnilo with care paid to caching so we don't overwhelm the druid cluster
    • clean up Dashiki / AirFlow
    • integrate in a new section of Wikistats

All of those options make sense to me long-term. For now, it's pretty easy to deploy a new Turnilo instance on Cloud VPS (all the requirements etc. are handled via a shell script) so I can always help people build their own if that's necessary before we arrive at a more general solution: https://github.com/wikimedia/research-api-endpoint-template/tree/turnilo

odimitrijevic lowered the priority of this task from Medium to Low.Nov 20 2021, 12:30 AM
odimitrijevic moved this task from Incoming (new tickets) to Visualize on the Data-Engineering board.