You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pig.apache.org by "Alan Gates (JIRA)" <ji...@apache.org> on 2010/06/01 18:23:37 UTC

[jira] Commented: (PIG-1426) Change the size of Tuple from Int to VInt when Serialize Tuple

    [ https://issues.apache.org/jira/browse/PIG-1426?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12874092#action_12874092 ] 

Alan Gates commented on PIG-1426:
---------------------------------

I think that's the right approach.  We'll need to make sure Zebra and BinStorage have a way to ask for the old TupleFactory.  

> Change the size of Tuple from Int to VInt when Serialize Tuple
> --------------------------------------------------------------
>
>                 Key: PIG-1426
>                 URL: https://issues.apache.org/jira/browse/PIG-1426
>             Project: Pig
>          Issue Type: Improvement
>          Components: data
>    Affects Versions: 0.8.0
>            Reporter: Jeff Zhang
>            Assignee: Jeff Zhang
>             Fix For: 0.8.0
>
>         Attachments: PIG_1426.patch
>
>
> Most of  time,  the size of tuple is not very large, one byte is enough for store the size of tuple. So I suggest to use VInt instead of Int for the size of tuple when doing Serialization. Because the key type of map output is Tuple, so this can reduce the amount of data transferred from mapper to reducer. 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.