You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@beam.apache.org by "Jens Wiren (Jira)" <ji...@apache.org> on 2021/03/11 13:02:00 UTC

[jira] [Created] (BEAM-11959) Python Beam SDK Harness hangs when install pip packages

Jens Wiren created BEAM-11959:
---------------------------------

             Summary: Python Beam SDK Harness hangs when install pip packages
                 Key: BEAM-11959
                 URL: https://issues.apache.org/jira/browse/BEAM-11959
             Project: Beam
          Issue Type: Bug
          Components: sdk-py-harness
    Affects Versions: 2.28.0, 2.27.0
         Environment: Kubernetes v1.19.6
            Reporter: Jens Wiren


When running a Beam pipeline using Flink as backend, the python sdk harness hangs when trying to install pip packages. Tested using Flink 1.10.3.

 

Specifically this was tested by running a TFX pipeline which gets submitted and registered as it should, but the SDK Harness hangs when installing:

2021/03/10 12:16:20 Initializing python harness: /opt/apache/beam/boot --id=1-1 --logging_endpoint=localhost:39795 --artifact_endpoint=localhost:34095 --provision_endpoint=localhost:42999 --control_endpoint=localhost:38129
2021/03/10 12:16:20 Found artifact: tfx_ephemeral-0.27.0.tar.gz
2021/03/10 12:16:20 Found artifact: extra_packages.txt
2021/03/10 12:16:20 Installing setup packages ...
2021/03/10 12:16:20 Installing extra package: tfx_ephemeral-0.27.0.tar.gz

and nothing else is shown irregardless how long it is left. I can manually install the TFX package by exec into the container in < 3 min.

The Flink task-manager then waits idling and periodically  logs:

2021-03-10 11:29:26,287 INFO org.apache.beam.runners.fnexecution.environment.ExternalEnvironmentFactory - Still waiting for startup of environment from localhost:50000 for worker id 1-1



--
This message was sent by Atlassian Jira
(v8.3.4#803005)