You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@beam.apache.org by "Kenneth Knowles (JIRA)" <ji...@apache.org> on 2019/03/19 03:24:00 UTC
[jira] [Commented] (BEAM-2620) Revisit GroupByKey.createWithFewKeys
[ https://issues.apache.org/jira/browse/BEAM-2620?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16795604#comment-16795604 ]
Kenneth Knowles commented on BEAM-2620:
---------------------------------------
[~lcwik] [~CraigChambersG] doing massive backlog of triaging and found this. For your consideration.
> Revisit GroupByKey.createWithFewKeys
> ------------------------------------
>
> Key: BEAM-2620
> URL: https://issues.apache.org/jira/browse/BEAM-2620
> Project: Beam
> Issue Type: Bug
> Components: beam-model
> Reporter: Thomas Groh
> Priority: Major
> Labels: portability
>
> This doesn't have a parallel within the GroupByKeyPayload, so there's currently no way to send it through the Runner API.
> The place it will almost always be created is in a {{Combine.globally()}}.
> It's potentially useful as an optimizer hint. The Dataflow Runner in streaming mode disables combiner lifting unless the GroupByKey has the fewKeys property set to true.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)