Page MenuHomePhabricator

Content_gap_metrics stage of knowledge_gaps job failing repeatedly
Closed, ResolvedPublic

Description

During the 2024-02-01 run of the knowledge_gap metrics job, the content_gap_metrics stage has now failed six times. Each time the error has been similar:

[2024-02-26, 15:58:26 UTC] {taskinstance.py:1937} ERROR - Task failed with exception
Traceback (most recent call last):
  File "/usr/lib/airflow/lib/python3.10/site-packages/airflow/providers/apache/spark/operators/spark_submit.py", line 160, in execute
    self._hook.submit(self._application)
  File "/srv/deployment/airflow-dags/research/wmf_airflow_common/hooks/spark.py", line 435, in submit
    return self._skein_hook.submit()
  File "/srv/deployment/airflow-dags/research/wmf_airflow_common/hooks/skein.py", line 280, in submit
    raise AirflowException(str(self))
airflow.exceptions.AirflowException: SkeinHook Airflow SparkSkeinSubmitHook skein launcher knowledge_gaps__content_gap_metrics__20240101 application_1707226456123_106278

This is currently blocking production of the Jan 2024 movement metrics.