Page MenuHomePhabricator

Process experiment exposures and conversions and output them to a Presto table in a format that GrowthBook can ingest
Open, Needs TriagePublic

Description

Background

GrowthBook can connect to a handful of data sources that we support running in production out of the box: MariaDB, Postgres, and Presto. We will need a process (an Airflow DAG?) that transforms the data gathered using the schemas from T365796 and T365807 into a format the GrowthBook can ingest.

Resources
  1. https://docs.growthbook.io/warehouses/mysql-or-mariadb
  2. https://docs.growthbook.io/warehouses/postgres
  3. https://docs.growthbook.io/warehouses/prestodb-or-trino
AC
  • TBD

Event Timeline

phuedx added a subscriber: mforns.

@mforns: I've not written a task to create an Airflow DAG (or similar data processing pipeline) before. What kind of AC would you expect to see?