We have 3 Oozie jobs that load pageview data to Druid:
- Pageview druid hourly
- Pageview druid daily
- Pageview druid monthly
We can group them in 1 DAG file with:
- hurly DAG
- daily DAG
- monthly DAG
We have 3 Oozie jobs that load pageview data to Druid:
We can group them in 1 DAG file with:
Subject | Repo | Branch | Lines +/- | |
---|---|---|---|---|
Migrate pageview druid load hql queries to Airflow | analytics/refinery | master | +515 -0 |
Title | Reference | Author | Source Branch | Dest Branch | |
---|---|---|---|---|---|
Migrate pageview druid loading jobs to airflow | repos/data-engineering/airflow-dags!365 | ebysans | T334104_pageview_druid | main |
Status | Subtype | Assigned | Task | ||
---|---|---|---|---|---|
Resolved | JArguello-WMF | T324485 [Airflow] Migrate Druid loading Oozie jobs - Parent task | |||
Resolved | Snwachukwu | T334104 [Airflow] Migrate pageview-related Druid loading Oozie jobs |
We should add the referer information to the Druid datasources as requested in https://phabricator.wikimedia.org/T331028 !
Change 910520 had a related patch set uploaded (by Snwachukwu; author: Snwachukwu):
[analytics/refinery@master] Migrate pageview druid load hql queries to Airflow
ebysans opened https://gitlab.wikimedia.org/repos/data-engineering/airflow-dags/-/merge_requests/365
Migrate pageview druid loading jobs to airflow
milimetric merged https://gitlab.wikimedia.org/repos/data-engineering/airflow-dags/-/merge_requests/365
Migrate pageview druid loading jobs to airflow
Change 910520 merged by Snwachukwu:
[analytics/refinery@master] Migrate pageview druid load hql queries to Airflow