You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@beam.apache.org by "Ben Sidhom (JIRA)" <ji...@apache.org> on 2018/05/03 00:21:00 UTC

[jira] [Commented] (BEAM-4228) The FlinkRunner shouldn't require all of the values for a key to fit in memory

    [ https://issues.apache.org/jira/browse/BEAM-4228?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16461769#comment-16461769 ] 

Ben Sidhom commented on BEAM-4228:
----------------------------------

For context, see [https://github.com/apache/beam/pull/5226/files#r185652571.]

> The FlinkRunner shouldn't require all of the values for a key to fit in memory
> ------------------------------------------------------------------------------
>
>                 Key: BEAM-4228
>                 URL: https://issues.apache.org/jira/browse/BEAM-4228
>             Project: Beam
>          Issue Type: New Feature
>          Components: runner-flink
>            Reporter: Thomas Groh
>            Priority: Major
>
> The use of a reducer that adds all of the elements that it consumes to a list is the primary way in which this occurs - if instead, we produce a filtered iterable, or a collection of filtered iterables, we can lazily iterate over all of the contained elements without having to buffer all of the elements.
>  
> For an example of where this occurs, see {{Concatenate}} in  {{FlinkBatchPortablePipelineTranslator}}.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)