You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@airflow.apache.org by "Ashutosh Srivastava (Jira)" <ji...@apache.org> on 2019/11/22 10:10:00 UTC

[jira] [Created] (AIRFLOW-6040) Airflow scheduler with kubernetes executor fails :- Unknown error in KubernetesJobWatcher

Ashutosh Srivastava created AIRFLOW-6040:
--------------------------------------------

             Summary: Airflow scheduler with kubernetes executor fails :- Unknown error in KubernetesJobWatcher
                 Key: AIRFLOW-6040
                 URL: https://issues.apache.org/jira/browse/AIRFLOW-6040
             Project: Apache Airflow
          Issue Type: Bug
          Components: contrib, executor-kubernetes, scheduler
    Affects Versions: 1.10.6
            Reporter: Ashutosh Srivastava
            Assignee: Daniel Imberman


I am trying to set up airflow with the kubernetes executor. I have cloned airflow 1.10.6 and am building the docker image and then deploying it with kube. The pods are running, the service airflow also starts. The webserver is working fine. But when I check the logs for the scheduler I get the following error.

 

{{ERROR - Error while health checking kube watcher process. Process died for unknown reasons
INFO - Event: and now my watch begins starting at resource_version: 0
ERROR - Unknown error in KubernetesJobWatcher. Failing
Traceback (most recent call last):
  File "/usr/local/lib/python2.7/dist-packages/airflow/contrib/executors/kubernetes_executor.py", line 333, in run
    self.worker_uuid, self.kube_config)
  File "/usr/local/lib/python2.7/dist-packages/airflow/contrib/executors/kubernetes_executor.py", line 358, in _run
    **kwargs):
  File "/usr/local/lib/python2.7/dist-packages/kubernetes/watch/watch.py", line 144, in stream
    for line in iter_resp_lines(resp):
  File "/usr/local/lib/python2.7/dist-packages/kubernetes/watch/watch.py", line 48, in iter_resp_lines
    for seg in resp.read_chunked(decode_content=False):
  File "/usr/local/lib/python2.7/dist-packages/urllib3/response.py", line 781, in read_chunked
    self._original_response.close()
  File "/usr/lib/python2.7/contextlib.py", line 35, in __exit__
    self.gen.throw(type, value, traceback)
  File "/usr/local/lib/python2.7/dist-packages/urllib3/response.py", line 439, in _error_catcher
    raise ReadTimeoutError(self._pool, None, "Read timed out.")
ReadTimeoutError: HTTPSConnectionPool(host='10.0.0.1', port=443): Read timed out.}}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)