You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pig.apache.org by "Daniel Dai (JIRA)" <ji...@apache.org> on 2008/08/15 02:07:44 UTC

[jira] Updated: (PIG-362) Limit can not push in front of ForEach with flatten

     [ https://issues.apache.org/jira/browse/PIG-362?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Daniel Dai updated PIG-362:
---------------------------

    Attachment: PIG-362.patch

> Limit can not push in front of ForEach with flatten
> ---------------------------------------------------
>
>                 Key: PIG-362
>                 URL: https://issues.apache.org/jira/browse/PIG-362
>             Project: Pig
>          Issue Type: Bug
>          Components: impl
>    Affects Versions: types_branch
>            Reporter: Daniel Dai
>            Assignee: Daniel Dai
>             Fix For: types_branch
>
>         Attachments: PIG-362.patch
>
>
> Currently logical optimizer will push Limit in front of ForEach with flatten. It is based on the assumption that ForEach with a flatten always increase the number of records. However, this is a false assumption. In the case that there is empty bags inside input tuple, the number of output records can be 0, thus less than input records. 
> We have no way to know whether there is an empty bag in the input at optimization time. So the only solution is not to push Limit in front of ForEach with flatten

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.