You are viewing a plain text version of this content. The canonical link for it is here.
Posted to mapreduce-issues@hadoop.apache.org by "Luke Lu (Commented) (JIRA)" <ji...@apache.org> on 2012/01/10 03:37:39 UTC

[jira] [Commented] (MAPREDUCE-232) TextInputFormat should support character encoding settings

    [ https://issues.apache.org/jira/browse/MAPREDUCE-232?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13183025#comment-13183025 ] 

Luke Lu commented on MAPREDUCE-232:
-----------------------------------

The latest patch lgtm for the 1.x branch. Thanks Nomura. Do you mind update the patch for trunk, 0.23 and 1.0 branch?
                
> TextInputFormat should support character encoding settings
> ----------------------------------------------------------
>
>                 Key: MAPREDUCE-232
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-232
>             Project: Hadoop Map/Reduce
>          Issue Type: Improvement
>         Environment: Windows XP SP3
>            Reporter: NOMURA Yoshihide
>         Attachments: Hadoop-3481.patch
>
>
> I need to read text files in different character encoding from UTF-8,
> but I think TextInputFormat doesn't support such character encoding.
> I suggest the TextInputFormat to support encoding settings like this.
>   conf.set("io.file.defaultEncoding", "MS932");
> I will submit a patch candidate.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira