You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pig.apache.org by "Alan Gates (JIRA)" <ji...@apache.org> on 2008/02/25 22:18:51 UTC

[jira] Resolved: (PIG-110) Jobs using the combiner and flatten of group keys produce wrong results

     [ https://issues.apache.org/jira/browse/PIG-110?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Alan Gates resolved PIG-110.
----------------------------

       Resolution: Fixed
    Fix Version/s: 0.1.0

Fix checked in as revision 630997.   Thanks Johannes for contributing the tests.

> Jobs using the combiner and flatten of group keys produce wrong results
> -----------------------------------------------------------------------
>
>                 Key: PIG-110
>                 URL: https://issues.apache.org/jira/browse/PIG-110
>             Project: Pig
>          Issue Type: Bug
>          Components: impl
>    Affects Versions: 0.0.0
>            Reporter: Alan Gates
>            Assignee: Antonio Magnaghi
>             Fix For: 0.1.0
>
>         Attachments: PIG-110.patch, PIG-110_test.patch, PIG-110_test_v2.patch
>
>
> A job such as the following:
> a = load 'mydata';
> b = group a by ($0, $1);
> c = foreach b generate flatten(group), COUNT($1)
> currently produces wrong results.  Instead of returning the count in the 3rd column, it returns a repeat of the value in the second column.  If the combiner is forced off, this does not occur, so I assume this is in the combiner logic somewhere.
> This bug was introduced in revision 617338 (the checkin for PIG-32).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.