You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@airflow.apache.org by "Kaxil Naik (Jira)" <ji...@apache.org> on 2020/03/04 00:12:00 UTC

[jira] [Resolved] (AIRFLOW-6040) Airflow scheduler with kubernetes executor fails :- Unknown error in KubernetesJobWatcher

     [ https://issues.apache.org/jira/browse/AIRFLOW-6040?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Kaxil Naik resolved AIRFLOW-6040.
---------------------------------
    Fix Version/s: 1.10.10
       Resolution: Fixed

> Airflow scheduler with kubernetes executor fails :- Unknown error in KubernetesJobWatcher
> -----------------------------------------------------------------------------------------
>
>                 Key: AIRFLOW-6040
>                 URL: https://issues.apache.org/jira/browse/AIRFLOW-6040
>             Project: Apache Airflow
>          Issue Type: Bug
>          Components: contrib, executor-kubernetes, scheduler
>    Affects Versions: 1.10.6
>            Reporter: Ashutosh Srivastava
>            Assignee: Daniel Imberman
>            Priority: Major
>             Fix For: 1.10.10
>
>
> I am trying to set up airflow with the kubernetes executor. I have cloned airflow 1.10.6 and am building the docker image and then deploying it with kube. The pods are running, the service airflow also starts. The webserver is working fine. But when I check the logs for the scheduler I get the following error.
>  
> {{ERROR - Error while health checking kube watcher process. Process died for unknown reasons
> INFO - Event: and now my watch begins starting at resource_version: 0
> ERROR - Unknown error in KubernetesJobWatcher. Failing
> Traceback (most recent call last):
>   File "/usr/local/lib/python2.7/dist-packages/airflow/contrib/executors/kubernetes_executor.py", line 333, in run
>     self.worker_uuid, self.kube_config)
>   File "/usr/local/lib/python2.7/dist-packages/airflow/contrib/executors/kubernetes_executor.py", line 358, in _run
>     **kwargs):
>   File "/usr/local/lib/python2.7/dist-packages/kubernetes/watch/watch.py", line 144, in stream
>     for line in iter_resp_lines(resp):
>   File "/usr/local/lib/python2.7/dist-packages/kubernetes/watch/watch.py", line 48, in iter_resp_lines
>     for seg in resp.read_chunked(decode_content=False):
>   File "/usr/local/lib/python2.7/dist-packages/urllib3/response.py", line 781, in read_chunked
>     self._original_response.close()
>   File "/usr/lib/python2.7/contextlib.py", line 35, in __exit__
>     self.gen.throw(type, value, traceback)
>   File "/usr/local/lib/python2.7/dist-packages/urllib3/response.py", line 439, in _error_catcher
>     raise ReadTimeoutError(self._pool, None, "Read timed out.")
> ReadTimeoutError: HTTPSConnectionPool(host='10.0.0.1', port=443): Read timed out.}}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)