You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@beam.apache.org by "Ben Sidhom (JIRA)" <ji...@apache.org> on 2018/05/03 00:21:00 UTC
[jira] [Commented] (BEAM-4228) The FlinkRunner shouldn't require
all of the values for a key to fit in memory
[ https://issues.apache.org/jira/browse/BEAM-4228?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16461769#comment-16461769 ]
Ben Sidhom commented on BEAM-4228:
----------------------------------
For context, see [https://github.com/apache/beam/pull/5226/files#r185652571.]
> The FlinkRunner shouldn't require all of the values for a key to fit in memory
> ------------------------------------------------------------------------------
>
> Key: BEAM-4228
> URL: https://issues.apache.org/jira/browse/BEAM-4228
> Project: Beam
> Issue Type: New Feature
> Components: runner-flink
> Reporter: Thomas Groh
> Priority: Major
>
> The use of a reducer that adds all of the elements that it consumes to a list is the primary way in which this occurs - if instead, we produce a filtered iterable, or a collection of filtered iterables, we can lazily iterate over all of the contained elements without having to buffer all of the elements.
>
> For an example of where this occurs, see {{Concatenate}} in {{FlinkBatchPortablePipelineTranslator}}.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)