You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@beam.apache.org by "Brian Hulette (Jira)" <ji...@apache.org> on 2021/04/13 19:12:00 UTC

[jira] [Commented] (BEAM-12163) Python GHA PreCommits flake with grpc.FutureTimeoutError on SDK harness startup

    [ https://issues.apache.org/jira/browse/BEAM-12163?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17320480#comment-17320480 ] 

Brian Hulette commented on BEAM-12163:
--------------------------------------

I couldn't find anything suspicious in the commits before the first failure.

There was a grpcio release (1.37.0) ~coincident with the first failure, but I checked on it and this was not used in the first failing run.

I diffed the dependencies between the first failure and last success (https://github.com/apache/beam/runs/2274621182?check_suite_focus=true) and found:
{code}
❯ diff good bad
12c12
< docker==4.4.4
---
> docker==5.0.0
57c57
< SQLAlchemy==1.4.5
---
> SQLAlchemy==1.4.6
{code}

SQLAlchemy is probably innocuous, but I bet docker 5.0.0 release could cause this. I'll try setting docker <5

> Python GHA PreCommits flake with grpc.FutureTimeoutError on SDK harness startup
> -------------------------------------------------------------------------------
>
>                 Key: BEAM-12163
>                 URL: https://issues.apache.org/jira/browse/BEAM-12163
>             Project: Beam
>          Issue Type: Bug
>          Components: sdk-py-core, test-failures
>            Reporter: Brian Hulette
>            Assignee: Brian Hulette
>            Priority: P2
>              Labels: currently-failing, flake
>
> Example stacktrace:
> {code}
>   File "/home/runner/work/beam/beam/sdks/python/apache_beam/runners/portability/fn_api_runner/worker_handlers.py", line 639, in start_worker
>     raise RuntimeError("Error starting worker: %s" % response.error)
> RuntimeError: Error starting worker: Traceback (most recent call last):
>   File "/home/runner/work/beam/beam/sdks/python/apache_beam/runners/worker/worker_pool_main.py", line 154, in StartWorker
>     data_buffer_time_limit_ms=self._data_buffer_time_limit_ms)
>   File "/home/runner/work/beam/beam/sdks/python/apache_beam/runners/worker/sdk_worker.py", line 193, in __init__
>     grpc.channel_ready_future(self._control_channel).result(timeout=60)
>   File "/home/runner/work/beam/beam/sdks/python/target/.tox/py36/lib/python3.6/site-packages/grpc/_utilities.py", line 140, in result
>     self._block(timeout)
>   File "/home/runner/work/beam/beam/sdks/python/target/.tox/py36/lib/python3.6/site-packages/grpc/_utilities.py", line 86, in _block
>     raise grpc.FutureTimeoutError()
> grpc.FutureTimeoutError
> {code}
> First failure on master branch: https://github.com/apache/beam/runs/2283782613?check_suite_focus=true



--
This message was sent by Atlassian Jira
(v8.3.4#803005)