You are viewing a plain text version of this content. The canonical link for it is here.
Posted to github@beam.apache.org by GitBox <gi...@apache.org> on 2022/06/03 22:59:55 UTC

[GitHub] [beam] kennknowles opened a new issue, #19236: Bound the amount of data on the data plane by time.

kennknowles opened a new issue, #19236:
URL: https://github.com/apache/beam/issues/19236

   This is especially important for Fn API reads, where each element represents a shard to read and may be very expensive, but many elements may be waiting in the Fn API buffer.
   
   The need for this will be mitigated with full SDF support for liquid sharding over the Fn API, but not eliminated unless the runner can "unread" elements it has already sent. 
   
   This is especially important in for dataflow jobs that start out small but then detect that they need more workers (e.g. due to the initial inputs being an SDF).
   
   Imported from Jira [BEAM-5791](https://issues.apache.org/jira/browse/BEAM-5791). Original Jira may contain additional context.
   Reported by: robertwb.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@beam.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org