You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@beam.apache.org by "Thomas Groh (JIRA)" <ji...@apache.org> on 2016/07/14 20:00:23 UTC

[jira] [Comment Edited] (BEAM-435) DirectRunner GBK -- task per key?

    [ https://issues.apache.org/jira/browse/BEAM-435?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15378247#comment-15378247 ] 

Thomas Groh edited comment on BEAM-435 at 7/14/16 7:59 PM:
-----------------------------------------------------------

Yes. Bundle processing is keyed after a GroupByKey to ensure that per-key state is accessed only by the appropriate key. Writing a limited number of output files requires bundles to be merged, either via the runner doing so or adding an intervening transform to do so.


was (Author: tgroh):
Yes. Bundle processing is keyed after a GroupByKey to ensure that per-key state is accessed only by the appropriate key.

> DirectRunner GBK -- task per key?
> ---------------------------------
>
>                 Key: BEAM-435
>                 URL: https://issues.apache.org/jira/browse/BEAM-435
>             Project: Beam
>          Issue Type: Bug
>          Components: runner-direct
>    Affects Versions: 0.1.0-incubating, 0.2.0-incubating
>            Reporter: Daniel Halperin
>            Assignee: Thomas Groh
>
> See [BEAM-434] -- is the direct runner producing a bundle per-key in the GBK output?



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)