You are viewing a plain text version of this content. The canonical link for it is here.
Posted to mapreduce-issues@hadoop.apache.org by "Luke Lu (Commented) (JIRA)" <ji...@apache.org> on 2012/01/10 03:37:39 UTC
[jira] [Commented] (MAPREDUCE-232) TextInputFormat should support
character encoding settings
[ https://issues.apache.org/jira/browse/MAPREDUCE-232?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13183025#comment-13183025 ]
Luke Lu commented on MAPREDUCE-232:
-----------------------------------
The latest patch lgtm for the 1.x branch. Thanks Nomura. Do you mind update the patch for trunk, 0.23 and 1.0 branch?
> TextInputFormat should support character encoding settings
> ----------------------------------------------------------
>
> Key: MAPREDUCE-232
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-232
> Project: Hadoop Map/Reduce
> Issue Type: Improvement
> Environment: Windows XP SP3
> Reporter: NOMURA Yoshihide
> Attachments: Hadoop-3481.patch
>
>
> I need to read text files in different character encoding from UTF-8,
> but I think TextInputFormat doesn't support such character encoding.
> I suggest the TextInputFormat to support encoding settings like this.
> conf.set("io.file.defaultEncoding", "MS932");
> I will submit a patch candidate.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira