You are viewing a plain text version of this content. The canonical link for it is here.
Posted to common-dev@hadoop.apache.org by "ZhuGuanyin (JIRA)" <ji...@apache.org> on 2009/05/07 08:50:30 UTC

[jira] Updated: (HADOOP-5779) KeyFieldBasedPartitioner would lost data if specifed field not exist, and it should encode free not only support utf8

     [ https://issues.apache.org/jira/browse/HADOOP-5779?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

ZhuGuanyin updated HADOOP-5779:
-------------------------------

        Fix Version/s:     (was: 0.20.1)
                       0.21.0
          Description: 
1) Currently,  KeyFieldBasedPartitioner only support utf8 encoded recored,  we should use text or byteswriteable data types.

2) when using KeyFieldBasedPartitioner, if the record doesn't contain the specified field, the endChar would equal with array.length, which throw ArrayOutOfIndex exception, losting that record!

  was:
1) Currently,  KeyFieldBasedPartitioner only support utf8 encoded recored,  we should use text or byteswriteable data types.
2) when using KeyFieldBasedPartitioner, if the record doesn't contain the specified field, the endChar would equal with array.length, which throw ArrayOutOfIndex exception, losting that record!

    Affects Version/s: 0.20.0
              Summary: KeyFieldBasedPartitioner would lost data if specifed field not exist, and it should encode free not only support utf8  (was: KeyFieldBasedPartitioner should encode free and handle ArrayOutOfIndex exception!)

> KeyFieldBasedPartitioner would lost data if specifed field not exist, and it should encode free not only support utf8
> ---------------------------------------------------------------------------------------------------------------------
>
>                 Key: HADOOP-5779
>                 URL: https://issues.apache.org/jira/browse/HADOOP-5779
>             Project: Hadoop Core
>          Issue Type: Bug
>          Components: mapred
>    Affects Versions: 0.20.0
>            Reporter: ZhuGuanyin
>             Fix For: 0.21.0
>
>         Attachments: encode-free-KeyFieldBasedPartitioner.patch
>
>
> 1) Currently,  KeyFieldBasedPartitioner only support utf8 encoded recored,  we should use text or byteswriteable data types.
> 2) when using KeyFieldBasedPartitioner, if the record doesn't contain the specified field, the endChar would equal with array.length, which throw ArrayOutOfIndex exception, losting that record!

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.