You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@beam.apache.org by "Kenneth Knowles (JIRA)" <ji...@apache.org> on 2019/03/19 03:24:00 UTC

[jira] [Commented] (BEAM-2620) Revisit GroupByKey.createWithFewKeys

    [ https://issues.apache.org/jira/browse/BEAM-2620?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16795604#comment-16795604 ] 

Kenneth Knowles commented on BEAM-2620:
---------------------------------------

[~lcwik] [~CraigChambersG] doing massive backlog of triaging and found this. For your consideration.

> Revisit GroupByKey.createWithFewKeys
> ------------------------------------
>
>                 Key: BEAM-2620
>                 URL: https://issues.apache.org/jira/browse/BEAM-2620
>             Project: Beam
>          Issue Type: Bug
>          Components: beam-model
>            Reporter: Thomas Groh
>            Priority: Major
>              Labels: portability
>
> This doesn't have a parallel within the GroupByKeyPayload, so there's currently no way to send it through the Runner API.
> The place it will almost always be created is in a {{Combine.globally()}}.
> It's potentially useful as an optimizer hint. The Dataflow Runner in streaming mode disables combiner lifting unless the GroupByKey has the fewKeys property set to true. 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)