Page MenuHomePhabricator

[Airflow] Migrate Netflow Druid loading jobs
Closed, ResolvedPublic9 Estimated Story Points

Description

We have 6 systemd timers that are triggering Druid ingestion of netflow-related datasets.
We should migrate them to Airflow.

The systemd timers are:

  1. netflow druid hourly loading
  2. netflow druid daily loading
  3. netflow druid sanitization
  4. network_flows_internal druid hourly loading
  5. network_flows_internal druid daily loading
  6. network_flows_internal druid sanitization

We can group them into 2 DAG files:

  1. Netflow DAG file, containing:
    • Hourly DAG
    • Daily DAG
    • Sanitization DAG
  2. Nework_flows_internal DAG file, containing:
    • Hourly DAG
    • Daily DAG
    • Sanitization DAG

Event Timeline

mforns set the point value for this task to 9.Apr 5 2023, 3:07 PM

Change 906660 had a related patch set uploaded (by Mforns; author: Mforns):

[operations/puppet@production] analytics::refinery::job::druid_load: absent all jobs

https://gerrit.wikimedia.org/r/906660

Change 906662 had a related patch set uploaded (by Mforns; author: Mforns):

[operations/puppet@production] analytics::refinery::job::druid_load: Remove remaining jobs

https://gerrit.wikimedia.org/r/906662

Change 906660 abandoned by Mforns:

[operations/puppet@production] analytics::refinery::job::druid_load: absent all jobs

Reason:

Messed up creating this on top of an old change.

https://gerrit.wikimedia.org/r/906660

Change 906662 abandoned by Mforns:

[operations/puppet@production] analytics::refinery::job::druid_load: Remove remaining jobs

Reason:

Messed up creating this on top of an old change.

https://gerrit.wikimedia.org/r/906662

Change 906665 had a related patch set uploaded (by Mforns; author: Mforns):

[operations/puppet@production] ::analytics::refinery::job::druid_load: absent remaining jobs

https://gerrit.wikimedia.org/r/906665

Change 906667 had a related patch set uploaded (by Mforns; author: Mforns):

[operations/puppet@production] ::analytics::refinery::job::druid_load: remove remaining jobs

https://gerrit.wikimedia.org/r/906667

Change 906665 merged by Ottomata:

[operations/puppet@production] ::analytics::refinery::job::druid_load: absent remaining jobs

https://gerrit.wikimedia.org/r/906665