You are viewing a plain text version of this content. The canonical link for it is here.
Posted to common-dev@hadoop.apache.org by "ZhuGuanyin (JIRA)" <ji...@apache.org> on 2009/05/18 03:53:45 UTC
[jira] Issue Comment Edited: (HADOOP-5779) KeyFieldBasedPartitioner
would lost data if specifed field not exist, and it should encode free not
only support utf8
[ https://issues.apache.org/jira/browse/HADOOP-5779?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12708734#action_12708734 ]
ZhuGuanyin edited comment on HADOOP-5779 at 5/17/09 6:51 PM:
-------------------------------------------------------------
create patch using svn diff instead of diff
was (Author: buptzhugy):
create patsh using svn diff instead of diff
> KeyFieldBasedPartitioner would lost data if specifed field not exist, and it should encode free not only support utf8
> ---------------------------------------------------------------------------------------------------------------------
>
> Key: HADOOP-5779
> URL: https://issues.apache.org/jira/browse/HADOOP-5779
> Project: Hadoop Core
> Issue Type: Bug
> Components: mapred
> Affects Versions: 0.20.0
> Reporter: ZhuGuanyin
> Fix For: 0.21.0
>
> Attachments: encode-free-KeyFieldBasedPartitioner-v1.patch, encode-free-KeyFieldBasedPartitioner.patch
>
>
> 1) Currently, KeyFieldBasedPartitioner only support utf8 encoded recored, we should use text or byteswriteable data types.
> 2) when using KeyFieldBasedPartitioner, if the record doesn't contain the specified field, the endChar would equal with array.length, which throw ArrayOutOfIndex exception, losting that record!
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.