Page MenuHomePhabricator

Migrate the airflow-platform-eng scheduler to Kubernetes
Closed, ResolvedPublic

Description

Following the procedure outlined on https://wikitech.wikimedia.org/wiki/Data_Platform/Systems/Airflow/Kubernetes/Operations#Migrate_the_scheduler_and_kerberos_components_to_Kubernetes

  • Create the kerberos principals and the base64 representation of the instance keytab
  • Create the S3 user
  • add the secrets
  • Create the S3 bucket
  • Sync all the scheduler and DAG task logs to S3
  • stop all the airflow systemd services and sync the logs again
  • deploy the helmfile.d changes (to be deployed)
  • deploy the puppet changes ervices_ensure: absent (to be deployed
  • Verify Dags are running

Event Timeline

Gehel triaged this task as High priority.Nov 25 2024, 1:33 PM

Change #1125424 had a related patch set uploaded (by Stevemunene; author: Stevemunene):

[operations/deployment-charts@master] airflow-product-eng: migrate scheduler and db to k8s

https://gerrit.wikimedia.org/r/1125424

Instance owners have requested us to postpone our move to dse-k8s until they are done testing and running currently blocked pipelines related to https://phabricator.wikimedia.org/T387470 and https://phabricator.wikimedia.org/T386255

Change #1133367 had a related patch set uploaded (by Stevemunene; author: Stevemunene):

[operations/puppet@production] airflow-platform-eng: set up all services except systemd

https://gerrit.wikimedia.org/r/1133367

Change #1125424 merged by jenkins-bot:

[operations/deployment-charts@master] airflow-product-eng: migrate scheduler and db to k8s

https://gerrit.wikimedia.org/r/1125424

Change #1133367 merged by Stevemunene:

[operations/puppet@production] airflow-platform-eng: set up all services except systemd

https://gerrit.wikimedia.org/r/1133367

Stevemunene updated the task description. (Show Details)

Scheduler is moved and all the DAGS have been verified to be running as expected