You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@beam.apache.org by Janek Bevendorff <ja...@uni-weimar.de> on 2022/09/22 09:01:29 UTC

Beam on Flink: GOAWAY with error code ENHANCE_YOUR_CALM and debug data equal to "too_many_pings"

Hi,

There are multiple issue reports about this or similar issues on 
GitHub/Jira but all of them without any proper solution, so maybe you 
can help me.

I am running Beam on Flink (using the Portable runner via Beam's Flink 
job server) and when something takes a bit longer than expected or the 
shuffle size gets a bit larger, my workers keep failing randomly with 
the following error:

E0922 08:50:52.814447061     222 chttp2_transport.cc:1167] Received a 
GOAWAY with error code ENHANCE_YOUR_CALM and debug data equal to 
"too_many_pings"

I have already tried adding

("grpc.http2.max_pings_without_data", 0),
("grpc.http2.max_ping_strikes", 0)

to DEFAULT_OPTIONS in 
sdks/python/apache_beam/runners/worker/channel_factory.py, but without 
success. Are there any other places where gRPC connections are 
established that need these extra options? Are there any other options 
that I overlooked?

The most relevant (unsolved) issue report is probably this one here: 
https://github.com/apache/beam/issues/21598

This issue is pretty serious, since it pretty much prevents me from 
running jobs with more than a handful of workers or large data.

Many thanks
Janek