You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@crunch.apache.org by "Chao Shi (JIRA)" <ji...@apache.org> on 2014/03/22 03:59:42 UTC

[jira] [Created] (CRUNCH-368) TupleWritable.Comparator

Chao Shi created CRUNCH-368:
-------------------------------

             Summary: TupleWritable.Comparator
                 Key: CRUNCH-368
                 URL: https://issues.apache.org/jira/browse/CRUNCH-368
             Project: Crunch
          Issue Type: Improvement
          Components: Core
    Affects Versions: 0.8.3
            Reporter: Chao Shi
            Assignee: Chao Shi


This patch should improve comparison performance on TupleWritables. It saves the deserialization overhead. It is particularly useful when the input tuple are large, e.g. contains long strings.

Please note that this changes the binary format of TupleWritable. It adds a var-int indicating size of field after each type code. This is a limitation of the writable system. We do not know the size of each field until fully desalinizing it. 



--
This message was sent by Atlassian JIRA
(v6.2#6252)