You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@beam.apache.org by "Kenneth Knowles (JIRA)" <ji...@apache.org> on 2017/08/21 21:58:00 UTC
[jira] [Updated] (BEAM-2620) Revisit GroupByKey.createWithFewKeys
[ https://issues.apache.org/jira/browse/BEAM-2620?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Kenneth Knowles updated BEAM-2620:
----------------------------------
Component/s: (was: beam-model-runner-api)
(was: sdk-java-core)
> Revisit GroupByKey.createWithFewKeys
> ------------------------------------
>
> Key: BEAM-2620
> URL: https://issues.apache.org/jira/browse/BEAM-2620
> Project: Beam
> Issue Type: Bug
> Components: beam-model
> Reporter: Thomas Groh
>
> This doesn't have a parallel within the GroupByKeyPayload, so there's currently no way to send it through the Runner API.
> The place it will almost always be created is in a {{Combine.globally()}}.
> It's potentially useful as an optimizer hint. The Dataflow Runner in streaming mode disables combiner lifting unless the GroupByKey has the fewKeys property set to true.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)