We would like to orchestrate the dumps v1 jobs from an airflow instance, which would give us observability, retries, etc, for free. For this to happen, we need to write a DAG that would fetch the CronJob spec provisioned in T352650, extract the pod spec from it and start it via KubernetesPodOperator.
Description
Details
Event Timeline
brouberol opened https://gitlab.wikimedia.org/repos/data-engineering/airflow-dags/-/merge_requests/1274
Add new wiki recently introduced in noc
brouberol merged https://gitlab.wikimedia.org/repos/data-engineering/airflow-dags/-/merge_requests/1274
Add new wiki recently introduced in noc
Change #1143495 had a related patch set uploaded (by Btullis; author: Btullis):
[operations/deployment-charts@master] Bump the resources available to airflow kubernetes pod operator tasks
Change #1143495 merged by jenkins-bot:
[operations/deployment-charts@master] Bump the resources available to airflow kubernetes pod operator tasks
Change #1143531 had a related patch set uploaded (by Btullis; author: Btullis):
[operations/deployment-charts@master] Reduce the limits on the default kubernetes pod operator tasks
Change #1143531 merged by jenkins-bot:
[operations/deployment-charts@master] Reduce the limits on the default kubernetes pod operator tasks
Change #1154248 had a related patch set uploaded (by Btullis; author: Btullis):
[operations/deployment-charts@master] Airflow: Add local settings to enable the xcom_sidecar functionality
Change #1156377 had a related patch set uploaded (by Btullis; author: Btullis):
[operations/deployment-charts@master] Bump up the CPU and RAM resources for airflow related namespaces
Change #1156377 merged by jenkins-bot:
[operations/deployment-charts@master] Bump up the CPU and RAM resources for airflow related namespaces
Change #1154248 merged by jenkins-bot:
[operations/deployment-charts@master] Airflow: Add local settings to enable the xcom_sidecar functionality
brouberol opened https://gitlab.wikimedia.org/repos/data-engineering/airflow-dags/-/merge_requests/1437
test_k8s/dumps: tag the DAGs with 'partial' or 'full'
brouberol merged https://gitlab.wikimedia.org/repos/data-engineering/airflow-dags/-/merge_requests/1437
test_k8s/dumps: Adjust the SQL/XML dumps DAGs after their first full scale run
brouberol opened https://gitlab.wikimedia.org/repos/data-engineering/airflow-dags/-/merge_requests/1440
(fix) test_k8s/dumps/xml_sql: define both the pod_override and pod_template_file executor configs
brouberol merged https://gitlab.wikimedia.org/repos/data-engineering/airflow-dags/-/merge_requests/1440
(fix) test_k8s/dumps/xml_sql: define both the pod_override and pod_template_file executor configs
Change #1162954 had a related patch set uploaded (by Brouberol; author: Brouberol):
[operations/deployment-charts@master] mediawiki-dumps-legacy: deploy the sync toolbox with the sync image
Change #1162954 merged by Brouberol:
[operations/deployment-charts@master] mediawiki-dumps-legacy: deploy the sync toolbox with the sync image
btullis opened https://gitlab.wikimedia.org/repos/data-engineering/airflow-dags/-/merge_requests/1455
Dumps_v1: Configure data-engineering-alerts@w.o for all dump alerts
brouberol opened https://gitlab.wikimedia.org/repos/data-engineering/airflow-dags/-/merge_requests/1456
test_k8s/dumps: rely on the dagrun logical date instead of start date to craft folder names
btullis merged https://gitlab.wikimedia.org/repos/data-engineering/airflow-dags/-/merge_requests/1455
Dumps_v1: Configure data-engineering-alerts@w.o for all dump alerts
brouberol merged https://gitlab.wikimedia.org/repos/data-engineering/airflow-dags/-/merge_requests/1456
test_k8s/dumps: rely on the dagrun data interval end date instead of start date to craft folder names
brouberol opened https://gitlab.wikimedia.org/repos/data-engineering/airflow-dags/-/merge_requests/1457
test_k8s/dumps/xml_sql: remove obsolete comment
brouberol merged https://gitlab.wikimedia.org/repos/data-engineering/airflow-dags/-/merge_requests/1457
test_k8s/dumps/xml_sql: remove obsolete comment
brouberol opened https://gitlab.wikimedia.org/repos/data-engineering/airflow-dags/-/merge_requests/1458
test_k8s/dumps/xml_sql: cleanup task definition
brouberol merged https://gitlab.wikimedia.org/repos/data-engineering/airflow-dags/-/merge_requests/1458
test_k8s/dumps/xml_sql: cleanup task definition
btullis opened https://gitlab.wikimedia.org/repos/data-engineering/airflow-dags/-/merge_requests/1461
Dumps_v1: Tidy and make uniform the fetch_job_pod_spec tasks
btullis merged https://gitlab.wikimedia.org/repos/data-engineering/airflow-dags/-/merge_requests/1461
Dumps_v1: Tidy and make uniform the fetch_job_pod_spec tasks
btullis opened https://gitlab.wikimedia.org/repos/data-engineering/airflow-dags/-/merge_requests/1463
Dumps_v1: DRY up the parallel-rsync default arguments
btullis merged https://gitlab.wikimedia.org/repos/data-engineering/airflow-dags/-/merge_requests/1463
Dumps_v1: DRY up the parallel-rsync default arguments
btullis opened https://gitlab.wikimedia.org/repos/data-engineering/airflow-dags/-/merge_requests/1465
Dumps_v1: Miscellaneous schedule_interval fixes
btullis merged https://gitlab.wikimedia.org/repos/data-engineering/airflow-dags/-/merge_requests/1465
Dumps_v1: Miscellaneous schedule_interval fixes
brouberol opened https://gitlab.wikimedia.org/repos/data-engineering/airflow-dags/-/merge_requests/1473
test_k8s/dumps: mark all dumps are active
brouberol merged https://gitlab.wikimedia.org/repos/data-engineering/airflow-dags/-/merge_requests/1473
test_k8s/dumps: mark all dumps are active
I'm calling this done, as we have made the switch from bare-metal to Airflow in T397848: Switch over the Dumps_v1 system to run from Airflow instead of snapshot servers