You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@beam.apache.org by "Kenneth Knowles (Jira)" <ji...@apache.org> on 2020/08/25 13:45:00 UTC

[jira] [Updated] (BEAM-10808) StreamingDataflowWorker streaming rpcs due not always observe stream failures, until timeout

     [ https://issues.apache.org/jira/browse/BEAM-10808?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Kenneth Knowles updated BEAM-10808:
-----------------------------------
    Status: Open  (was: Triage Needed)

> StreamingDataflowWorker streaming rpcs due not always observe stream failures, until timeout
> --------------------------------------------------------------------------------------------
>
>                 Key: BEAM-10808
>                 URL: https://issues.apache.org/jira/browse/BEAM-10808
>             Project: Beam
>          Issue Type: Bug
>          Components: runner-dataflow
>            Reporter: Sam Whittle
>            Assignee: Sam Whittle
>            Priority: P2
>
> The StreamingDataflow worker communicates to Streaming Engine with grpc streaming rpcs. Occasionaly with service failures, the error is not observed by the streaming rpc until the deadline is exceeded.  As the deadline is multiple minutes, this adds unnecessary latency to pipelines that are otherwise keeping up with seconds delay.
> Testing has shown that grpc channel health checks are not sufficient but that application level health checks ensure that streaming rpc health issues are detected earlier than the stream timeout.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)