Page MenuHomePhabricator

Present "Notebooks in Airflow" solution to PA and discuss ownership of different steps
Open, Needs TriagePublic

Description

  • Ownership Map -> What we will own vs What others will own
    • Show idea (NotebookOperator in Airflow DAG, using single conda env automatically packaged by CI) to Product Analytics and ask if they would like that.
    • Discuss who would take care of writing DAGs, testing DAGs, reviewing DAG code, merging, deploying Airflow, receiving alerts, troubleshooting failed DAGs, updating the notebooks conda env with new libraries, etc.

Details

Other Assignee
mforns

Event Timeline

EChetty triaged this task as High priority.
EChetty updated Other Assignee, added: mforns.
EChetty moved this task from To be prioritised to Discussed (Radar) on the Data Pipelines board.
Aklapper added a subscriber: EChetty.

Removing inactive assignee (please do so as part of team offboarding!).

mpopov raised the priority of this task from High to Needs Triage.
mpopov subscribed.

The description probably needs updated since the proposal we're going with covers a lot of ownership questions, but there are still open ones around what notebook DAGs look like, where artifacts are stored, how notebooks are versioned, private repos, etc.