Page MenuHomePhabricator

Orchestrate dumps v1 from an airflow instance
Closed, ResolvedPublic

Description

We would like to orchestrate the dumps v1 jobs from an airflow instance, which would give us observability, retries, etc, for free. For this to happen, we need to write a DAG that would fetch the CronJob spec provisioned in T352650, extract the pod spec from it and start it via KubernetesPodOperator.

Details

Related Changes in Gerrit:
SubjectRepoBranchLines +/-
operations/deployment-chartsmaster+26 -12
operations/deployment-chartsmaster+30 -1
operations/deployment-chartsmaster+17 -9
operations/deployment-chartsmaster+3 -3
operations/deployment-chartsmaster+5 -5
operations/deployment-chartsmaster+186 -1
operations/dumpsmaster+4 -4
operations/dumpsmaster+10 -3
operations/dumpsmaster+21 -2
operations/deployment-chartsmaster+401 -4
operations/deployment-chartsmaster+2 -0
operations/deployment-chartsmaster+151 -3
operations/dumpsmaster+13 -1
operations/deployment-chartsmaster+1 -1
operations/deployment-chartsmaster+1 -1
operations/dumpsmaster+5 -5
operations/deployment-chartsmaster+13 -155
operations/deployment-chartsmaster+5 -5
operations/deployment-chartsmaster+4 -5
operations/deployment-chartsmaster+28 -1
operations/deployment-chartsmaster+16 -0
operations/puppetproduction+67 -0
operations/deployment-chartsmaster+4 -1
operations/deployment-chartsmaster+2 -2
operations/deployment-chartsmaster+43 -0
operations/deployment-chartsmaster+651 -13
operations/deployment-chartsmaster+5 -5
operations/deployment-chartsmaster+2 -0
operations/deployment-chartsmaster+1 -0
operations/deployment-chartsmaster+1 -1
operations/deployment-chartsmaster+3 -3
operations/deployment-chartsmaster+11 -1
operations/deployment-chartsmaster+2 -2
Show related patches Customize query in gerrit

Related Objects

StatusSubtypeAssignedTask
OpenNone
ResolvedBTullis
Resolvedbrouberol
Resolvedbrouberol
Resolvedbrouberol
ResolvedBTullis
Resolvedbrouberol
ResolvedBTullis
Resolvedbrouberol
ResolvedBTullis
ResolvedBTullis
ResolvedBTullis
Resolvedbrouberol
Resolvedbrouberol
ResolvedBTullis
Resolvedbrouberol
Resolvedbrouberol
Resolvedbrouberol
Resolvedbrouberol
Resolvedbrouberol
Resolvedbrouberol
Resolvedbrouberol
Resolvedbrouberol
Resolvedbrouberol
Resolvedbrouberol
Resolvedbrouberol
Resolvedbrouberol
Resolvedbrouberol
Resolvedbrouberol
Resolvedbrouberol
ResolvedBTullis
Resolvedbrouberol
Resolvedbrouberol
Resolvedbrouberol
Resolvedbrouberol
ResolvedBTullis
ResolvedBTullis
Resolvedbrouberol
Resolvedbrouberol
Resolvedbrouberol
Resolvedbrouberol
Resolvedbrouberol
ResolvedBTullis
Resolvedbrouberol
ResolvedBTullis
Resolvedbrouberol
ResolvedBTullis
Resolvedbrouberol
Resolvedbrouberol

Event Timeline

There are a very large number of changes, so older changes are hidden. Show Older Changes

Change #1143495 had a related patch set uploaded (by Btullis; author: Btullis):

[operations/deployment-charts@master] Bump the resources available to airflow kubernetes pod operator tasks

https://gerrit.wikimedia.org/r/1143495

Change #1143495 merged by jenkins-bot:

[operations/deployment-charts@master] Bump the resources available to airflow kubernetes pod operator tasks

https://gerrit.wikimedia.org/r/1143495

Change #1143531 had a related patch set uploaded (by Btullis; author: Btullis):

[operations/deployment-charts@master] Reduce the limits on the default kubernetes pod operator tasks

https://gerrit.wikimedia.org/r/1143531

Change #1143531 merged by jenkins-bot:

[operations/deployment-charts@master] Reduce the limits on the default kubernetes pod operator tasks

https://gerrit.wikimedia.org/r/1143531

Change #1154248 had a related patch set uploaded (by Btullis; author: Btullis):

[operations/deployment-charts@master] Airflow: Add local settings to enable the xcom_sidecar functionality

https://gerrit.wikimedia.org/r/1154248

Change #1156377 had a related patch set uploaded (by Btullis; author: Btullis):

[operations/deployment-charts@master] Bump up the CPU and RAM resources for airflow related namespaces

https://gerrit.wikimedia.org/r/1156377

Change #1156377 merged by jenkins-bot:

[operations/deployment-charts@master] Bump up the CPU and RAM resources for airflow related namespaces

https://gerrit.wikimedia.org/r/1156377

Change #1154248 merged by jenkins-bot:

[operations/deployment-charts@master] Airflow: Add local settings to enable the xcom_sidecar functionality

https://gerrit.wikimedia.org/r/1154248

brouberol merged https://gitlab.wikimedia.org/repos/data-engineering/airflow-dags/-/merge_requests/1437

test_k8s/dumps: Adjust the SQL/XML dumps DAGs after their first full scale run

brouberol opened https://gitlab.wikimedia.org/repos/data-engineering/airflow-dags/-/merge_requests/1440

(fix) test_k8s/dumps/xml_sql: define both the pod_override and pod_template_file executor configs

brouberol merged https://gitlab.wikimedia.org/repos/data-engineering/airflow-dags/-/merge_requests/1440

(fix) test_k8s/dumps/xml_sql: define both the pod_override and pod_template_file executor configs

Change #1162954 had a related patch set uploaded (by Brouberol; author: Brouberol):

[operations/deployment-charts@master] mediawiki-dumps-legacy: deploy the sync toolbox with the sync image

https://gerrit.wikimedia.org/r/1162954

Change #1162954 merged by Brouberol:

[operations/deployment-charts@master] mediawiki-dumps-legacy: deploy the sync toolbox with the sync image

https://gerrit.wikimedia.org/r/1162954

brouberol opened https://gitlab.wikimedia.org/repos/data-engineering/airflow-dags/-/merge_requests/1456

test_k8s/dumps: rely on the dagrun logical date instead of start date to craft folder names

brouberol merged https://gitlab.wikimedia.org/repos/data-engineering/airflow-dags/-/merge_requests/1456

test_k8s/dumps: rely on the dagrun data interval end date instead of start date to craft folder names

BTullis subscribed.

I'm calling this done, as we have made the switch from bare-metal to Airflow in T397848: Switch over the Dumps_v1 system to run from Airflow instead of snapshot servers