Every night at between 00:01:30 to 00:03:30 airflow scheduler from an-aluncher1002 is facing DB conneciton issue
Mar 06 00:01:33 an-launcher1002 airflow-scheduler@analytics[5803]: The above exception was the direct cause of the following exception: Mar 06 00:01:33 an-launcher1002 airflow-scheduler@analytics[5803]: MySQLdb._exceptions.OperationalError: (2026, 'SSL connection error: Error in the pull function.') Mar 06 00:01:33 an-launcher1002 airflow-scheduler@analytics[5803]: super().__init__(*args, **kwargs2) Mar 06 00:01:33 an-launcher1002 airflow-scheduler@analytics[5803]: File "/usr/lib/airflow/lib/python3.7/site-packages/MySQLdb/connections.py", line 185, in __init__ Mar 06 00:01:33 an-launcher1002 airflow-scheduler@analytics[5803]: return Connection(*args, **kwargs) Mar 06 00:01:33 an-launcher1002 airflow-scheduler@analytics[5803]: File "/usr/lib/airflow/lib/python3.7/site-packages/MySQLdb/__init__.py", line 130, in Connect Mar 06 00:01:33 an-launcher1002 airflow-scheduler@analytics[5803]: return self.dbapi.connect(*cargs, **cparams) Mar 06 00:01:33 an-launcher1002 airflow-scheduler@analytics[5803]: File "/usr/lib/airflow/lib/python3.7/site-packages/sqlalchemy/engine/default.py", line 508, in connect Mar 06 00:01:33 an-launcher1002 airflow-scheduler@analytics[5803]: return dialect.connect(*cargs, **cparams) Mar 06 00:01:33 an-launcher1002 airflow-scheduler@analytics[5803]: File "/usr/lib/airflow/lib/python3.7/site-packages/sqlalchemy/engine/strategies.py", line 114, in connect Mar 06 00:01:33 an-launcher1002 airflow-scheduler@analytics[5803]: connection = pool._invoke_creator(self) Mar 06 00:01:33 an-launcher1002 airflow-scheduler@analytics[5803]: File "/usr/lib/airflow/lib/python3.7/site-packages/sqlalchemy/pool/base.py", line 656, in __connect Mar 06 00:01:33 an-launcher1002 airflow-scheduler@analytics[5803]: raise exception Mar 06 00:01:33 an-launcher1002 airflow-scheduler@analytics[5803]: File "/usr/lib/airflow/lib/python3.7/site-packages/sqlalchemy/util/compat.py", line 182, in raise_ Mar 06 00:01:33 an-launcher1002 airflow-scheduler@analytics[5803]: with_traceback=exc_tb, Mar 06 00:01:33 an-launcher1002 airflow-scheduler@analytics[5803]: File "/usr/lib/airflow/lib/python3.7/site-packages/sqlalchemy/util/langhelpers.py", line 70, in __exit__ Mar 06 00:01:33 an-launcher1002 airflow-scheduler@analytics[5803]: pool.logger.debug("Error on connect(): %s", e) Mar 06 00:01:33 an-launcher1002 airflow-scheduler@analytics[5803]: File "/usr/lib/airflow/lib/python3.7/site-packages/sqlalchemy/pool/base.py", line 661, in __connect Mar 06 00:01:33 an-launcher1002 airflow-scheduler@analytics[5803]: self.__connect(first_connect_check=True) Mar 06 00:01:33 an-launcher1002 airflow-scheduler@analytics[5803]: File "/usr/lib/airflow/lib/python3.7/site-packages/sqlalchemy/pool/base.py", line 440, in __init__ Mar 06 00:01:33 an-launcher1002 airflow-scheduler@analytics[5803]: return _ConnectionRecord(self) Mar 06 00:01:33 an-launcher1002 airflow-scheduler@analytics[5803]: File "/usr/lib/airflow/lib/python3.7/site-packages/sqlalchemy/pool/base.py", line 309, in _create_connection Mar 06 00:01:33 an-launcher1002 airflow-scheduler@analytics[5803]: return self._create_connection() Mar 06 00:01:33 an-launcher1002 airflow-scheduler@analytics[5803]: File "/usr/lib/airflow/lib/python3.7/site-packages/sqlalchemy/pool/impl.py", line 137, in _do_get Mar 06 00:01:33 an-launcher1002 airflow-scheduler@analytics[5803]: raise exception Mar 06 00:01:33 an-launcher1002 airflow-scheduler@analytics[5803]: File "/usr/lib/airflow/lib/python3.7/site-packages/sqlalchemy/util/compat.py", line 182, in raise_ Mar 06 00:01:33 an-launcher1002 airflow-scheduler@analytics[5803]: with_traceback=exc_tb, Mar 06 00:01:33 an-launcher1002 airflow-scheduler@analytics[5803]: File "/usr/lib/airflow/lib/python3.7/site-packages/sqlalchemy/util/langhelpers.py", line 70, in __exit__ Mar 06 00:01:33 an-launcher1002 airflow-scheduler@analytics[5803]: self._dec_overflow() Mar 06 00:01:33 an-launcher1002 airflow-scheduler@analytics[5803]: File "/usr/lib/airflow/lib/python3.7/site-packages/sqlalchemy/pool/impl.py", line 140, in _do_get Mar 06 00:01:33 an-launcher1002 airflow-scheduler@analytics[5803]: rec = pool._do_get() Mar 06 00:01:33 an-launcher1002 airflow-scheduler@analytics[5803]: File "/usr/lib/airflow/lib/python3.7/site-packages/sqlalchemy/pool/base.py", line 495, in checkout Mar 06 00:01:33 an-launcher1002 airflow-scheduler@analytics[5803]: fairy = _ConnectionRecord.checkout(pool) Mar 06 00:01:33 an-launcher1002 airflow-scheduler@analytics[5803]: File "/usr/lib/airflow/lib/python3.7/site-packages/sqlalchemy/pool/base.py", line 778, in _checkout Mar 06 00:01:33 an-launcher1002 airflow-scheduler@analytics[5803]: return _ConnectionFairy._checkout(self) Mar 06 00:01:33 an-launcher1002 airflow-scheduler@analytics[5803]: File "/usr/lib/airflow/lib/python3.7/site-packages/sqlalchemy/pool/base.py", line 364, in connect Mar 06 00:01:33 an-launcher1002 airflow-scheduler@analytics[5803]: return fn() Mar 06 00:01:33 an-launcher1002 airflow-scheduler@analytics[5803]: File "/usr/lib/airflow/lib/python3.7/site-packages/sqlalchemy/engine/base.py", line 2336, in _wrap_pool_connect Mar 06 00:01:33 an-launcher1002 airflow-scheduler@analytics[5803]: Traceback (most recent call last): Mar 06 00:01:33 an-launcher1002 airflow-scheduler@analytics[5803]: Process DagFileProcessor346386-Process: Mar 06 00:01:27 an-launcher1002 airflow-scheduler@analytics[5803]: (Background on this error at: http://sqlalche.me/e/13/e3q8)
This issue can lead to jobs being reported in failure while they have run successfully
Mar 03 00:03:41 an-launcher1002 airflow-scheduler@analytics[5803]: [2023-03-03 00:03:41,218] {scheduler_job.py:655} INFO - Setting task instance <TaskInstance: pageview_actor_hourly.compute_pageview_actor_hourly 2023-03-02 22:00:00+00:00 [queued]> state to failed as reported by executor Mar 03 00:03:41 an-launcher1002 airflow-scheduler@analytics[5803]: [2023-03-03 00:03:41,217] {scheduler_job.py:648} ERROR - Executor reports task instance <TaskInstance: pageview_actor_hourly.compute_pageview_actor_hourly 2023-03-02 22:00:00+00:00 [queued]> finished (failed) although the task says its queued. (Info: None) Was the task killed externally? ... DB ISSUE ... Mar 03 00:01:31 an-launcher1002 airflow-scheduler@analytics[5803]: <TaskInstance: pageview_actor_hourly.compute_pageview_actor_hourly 2023-03-02 22:00:00+00:00 [scheduled]> Mar 03 00:01:31 an-launcher1002 airflow-scheduler@analytics[5803]: <TaskInstance: pageview_actor_hourly.compute_pageview_actor_hourly 2023-03-02 22:00:00+00:00 [scheduled]>