You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@beam.apache.org by "Robert Bradshaw (JIRA)" <ji...@apache.org> on 2017/06/16 21:47:00 UTC
[jira] [Updated] (BEAM-2453) The Java DirectRunner should exercise
all parts of a CombineFn
[ https://issues.apache.org/jira/browse/BEAM-2453?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Robert Bradshaw updated BEAM-2453:
----------------------------------
Summary: The Java DirectRunner should exercise all parts of a CombineFn (was: The DirectRunner should exercise all parts of a CombineFn)
> The Java DirectRunner should exercise all parts of a CombineFn
> --------------------------------------------------------------
>
> Key: BEAM-2453
> URL: https://issues.apache.org/jira/browse/BEAM-2453
> Project: Beam
> Issue Type: Bug
> Components: runner-direct
> Reporter: Thomas Groh
> Assignee: Thomas Groh
>
> Specifically it should:
> Create some number of accumulators; add elements to these accumulators, merge the created accumulators, and extract the output.
> This can be performed by replacing the {{Combine.perKey}} composite transform with a multi-step combine {{CombineBundles -> GroupByKey -> MergeAccumulators}}
> Where {{CombineBundles}} is a {{ParDo}} which takes input {{KV<K, InputT>}} and produces {{KV<K, AccumT>}}, outputting in {{FinishBundle}} (this can only be performed if the Combine takes no side inputs or does not have merging windows). {{MergeAccumulators}} takes in {{KV<K, Iterable<AccumT>>}} and produces {{KV<K, OutputT>}} by merging all of the accumulators and extracting the output.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)