Right now, we have the spark-submit executable in the airflow image, but it cannot run due to java not being installed.
airflow@airflow-scheduler-66d44f5d5b-9vcsc:/opt/airflow$ spark-submit --help JAVA_HOME is not set airflow@airflow-scheduler-66d44f5d5b-9vcsc:/opt/airflow$ which java airflow@airflow-scheduler-66d44f5d5b-9vcsc:/opt/airflow$
Our DAGs also refer to spark3-submit in the Skein jobs they run (example), and not spark-submit, so we need to make sure both work.