The Data-Platform-SRE has been working on a migration of our Airflow services from host-based to Kubernetes, for around a year.
The project was originally defined in Airflow: High-Availability Strategy - A design for maximum uptime of our pipelines
The work was tracked in: T362788: Migrate Airflow to the dse-k8s cluster
We presented our work at FOSDEM 2025 in Enhancing Airflow for Analytics, Data Engineering, and ML at Wikimedia
We have also presented a deep-dive session internally to the Data-Platform group: in DPE Deep Dive - Airflow on K8s
We believe that it would make a good post for the tech blog.