You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pig.apache.org by "Cheolsoo Park (JIRA)" <ji...@apache.org> on 2014/03/25 23:27:15 UTC

[jira] [Created] (PIG-3835) Optimize union followed by store

Cheolsoo Park created PIG-3835:
----------------------------------

             Summary: Optimize union followed by store
                 Key: PIG-3835
                 URL: https://issues.apache.org/jira/browse/PIG-3835
             Project: Pig
          Issue Type: Sub-task
          Components: tez
    Affects Versions: tez-branch
            Reporter: Cheolsoo Park
             Fix For: tez-branch


PIG-3742 implements union using VertexGroup. Currently, union is a blocking operator meaning that a new vertex is added for its succeeding operators.

But if there is only one store in the succeeding vertex, MROutput could be directly attached to VertexGroup instead of adding a new vertex for it. Then, each union source vertex will write directly to the destination, and therefore, it will be faster.



--
This message was sent by Atlassian JIRA
(v6.2#6252)