You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pig.apache.org by "Pradeep Kamath (JIRA)" <ji...@apache.org> on 2008/10/09 02:33:44 UTC
[jira] Updated: (PIG-465) PERFORMANCE: removing keys from the value
[ https://issues.apache.org/jira/browse/PIG-465?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Pradeep Kamath updated PIG-465:
-------------------------------
Assignee: Pradeep Kamath
Status: Patch Available (was: Open)
Attached patch
> PERFORMANCE: removing keys from the value
> -----------------------------------------
>
> Key: PIG-465
> URL: https://issues.apache.org/jira/browse/PIG-465
> Project: Pig
> Issue Type: Improvement
> Affects Versions: types_branch
> Reporter: Olga Natkovich
> Assignee: Pradeep Kamath
> Fix For: types_branch
>
> Attachments: PIG-465.patch
>
>
> Currently, reducers get the key data twice: once in the key and once in the value. If grouping key is the large part of the value, this causes large data replication and performance loss.
> The key should not be sent as part of the value. Instead, a metadata should used to assist in reconstructing the row from the key and the remaining data
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.