Page MenuHomePhabricator

Write a job entirely in Airflow with spark and/or sparkSQL
Closed, ResolvedPublic

Description

We should write and test a full Airflow DAG that mirrors a job we have in production.
It should have a SparkSQL (converted from Hive) query and a Spark job.
If we need, we should have 2 jobs that test both of these connectors.

Event Timeline

mforns triaged this task as High priority.Jun 28 2021, 3:47 PM
mforns moved this task from Incoming to Airflow on the Analytics board.
mforns updated the task description. (Show Details)

Change 702668 had a related patch set uploaded (by Mforns; author: Mforns):

[analytics/refinery@master] Add airflow DAG for anomaly detection (POC)

https://gerrit.wikimedia.org/r/702668

Change 707489 had a related patch set uploaded (by Ottomata; author: Ottomata):

[operations/puppet@production] airflow - set default smtp settings

https://gerrit.wikimedia.org/r/707489

Change 707489 merged by Ottomata:

[operations/puppet@production] airflow - set default smtp settings

https://gerrit.wikimedia.org/r/707489

Change 707517 had a related patch set uploaded (by Mforns; author: Mforns):

[analytics/refinery/source@master] Simplify RSVD anomaly detection job for Airflow POC

https://gerrit.wikimedia.org/r/707517

Change 708314 had a related patch set uploaded (by Ottomata; author: Ottomata):

[operations/puppet@production] Move airflow-analytics-test instance to an-test-client1001

https://gerrit.wikimedia.org/r/708314

Change 708314 merged by Ottomata:

[operations/puppet@production] Move airflow-analytics-test instance to an-test-client1001

https://gerrit.wikimedia.org/r/708314

Change 707517 merged by jenkins-bot:

[analytics/refinery/source@master] Simplify RSVD anomaly detection job for Airflow POC

https://gerrit.wikimedia.org/r/707517

Change 702668 abandoned by Mforns:

[analytics/refinery@master] Add airflow DAG for anomaly detection (POC)

Reason:

This code has been migrated to https://gitlab.wikimedia.org/repos/data-engineering/airflow-dags

https://gerrit.wikimedia.org/r/702668