You are viewing a plain text version of this content. The canonical link for it is here.

Posted to common-dev@hadoop.apache.org by "Owen O'Malley (JIRA)" <ji...@apache.org> on 2008/10/20 07:36:44 UTC

[jira] Updated: (HADOOP-4276) The mapred.*ID classes are inefficient for hashCode and serialization

     [ https://issues.apache.org/jira/browse/HADOOP-4276?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Owen O'Malley updated HADOOP-4276:
----------------------------------

    Attachment: h4276.patch

This patch:
  1. Removes the string generation during hashing of id objects.
  2. Reuses the id objects during readFields.
  3. Defines a protected field for SEPARATOR and removes UNDERLINE.
  4. Replace the toStringWOPrefix methods with addId that will reuse the same StringBuilder, which is more efficient.
  5. Store the jtIdentifier as Text so that it doesn't need to be encoded for sending across RPC.

> The mapred.*ID classes are inefficient for hashCode and serialization
> ---------------------------------------------------------------------
>
>                 Key: HADOOP-4276
>                 URL: https://issues.apache.org/jira/browse/HADOOP-4276
>             Project: Hadoop Core
>          Issue Type: Improvement
>            Reporter: Owen O'Malley
>             Fix For: 0.20.0
>
>         Attachments: h4276.patch
>
>
> Currently the ID classes call toString and hash the resulting string rather than computing a hash directly.
> The ID classes also create new instances of the higher level object in readFields (via read) rather than re-using the object via readFields.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.