We have 2 Oozie jobs that load sampled webrequest data into Druid
An hourly one and a daily one.
We can group them both in the same DAG file.
Description
Description
Details
Details
Status | Subtype | Assigned | Task | ||
---|---|---|---|---|---|
Resolved | JArguello-WMF | T324485 [Airflow] Migrate Druid loading Oozie jobs - Parent task | |||
Resolved | mforns | T334106 [Airflow] Migrate webrequest sampled 128 druid loading jobs |
Event Timeline
Comment Actions
Change 911890 had a related patch set uploaded (by Mforns; author: Mforns):
[analytics/refinery@master] Migrate queries for webrequest_sampled_128 to /hql (Airflow/Spark3)
Comment Actions
And here's the airflow-dags MR:
https://gitlab.wikimedia.org/repos/data-engineering/airflow-dags/-/merge_requests/364
Comment Actions
Change 911890 merged by Milimetric:
[analytics/refinery@master] Migrate queries for webrequest_sampled_128 to /hql (Airflow/Spark3)
Comment Actions
Change 916537 had a related patch set uploaded (by Mforns; author: Mforns):
[analytics/refinery@master] Fix webrequest sampled 128 druid loading queries
Comment Actions
Change 916537 merged by Milimetric:
[analytics/refinery@master] Fix webrequest sampled 128 druid loading queries