You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pig.apache.org by "Alan Gates (JIRA)" <ji...@apache.org> on 2008/02/25 22:18:51 UTC
[jira] Resolved: (PIG-110) Jobs using the combiner and flatten of
group keys produce wrong results
[ https://issues.apache.org/jira/browse/PIG-110?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Alan Gates resolved PIG-110.
----------------------------
Resolution: Fixed
Fix Version/s: 0.1.0
Fix checked in as revision 630997. Thanks Johannes for contributing the tests.
> Jobs using the combiner and flatten of group keys produce wrong results
> -----------------------------------------------------------------------
>
> Key: PIG-110
> URL: https://issues.apache.org/jira/browse/PIG-110
> Project: Pig
> Issue Type: Bug
> Components: impl
> Affects Versions: 0.0.0
> Reporter: Alan Gates
> Assignee: Antonio Magnaghi
> Fix For: 0.1.0
>
> Attachments: PIG-110.patch, PIG-110_test.patch, PIG-110_test_v2.patch
>
>
> A job such as the following:
> a = load 'mydata';
> b = group a by ($0, $1);
> c = foreach b generate flatten(group), COUNT($1)
> currently produces wrong results. Instead of returning the count in the 3rd column, it returns a repeat of the value in the second column. If the combiner is forced off, this does not occur, so I assume this is in the combiner logic somewhere.
> This bug was introduced in revision 617338 (the checkin for PIG-32).
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.