You are viewing a plain text version of this content. The canonical link for it is here.
Posted to common-issues@hadoop.apache.org by "Allen Wittenauer (JIRA)" <ji...@apache.org> on 2014/09/04 01:46:52 UTC

[jira] [Updated] (HADOOP-9801) Configuration#writeXml uses platform defaulting encoding, which may mishandle multi-byte characters.

     [ https://issues.apache.org/jira/browse/HADOOP-9801?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Allen Wittenauer updated HADOOP-9801:
-------------------------------------
    Fix Version/s:     (was: 3.0.0)

> Configuration#writeXml uses platform defaulting encoding, which may mishandle multi-byte characters.
> ----------------------------------------------------------------------------------------------------
>
>                 Key: HADOOP-9801
>                 URL: https://issues.apache.org/jira/browse/HADOOP-9801
>             Project: Hadoop Common
>          Issue Type: Bug
>          Components: conf
>    Affects Versions: 3.0.0, 1-win, 1.3.0, 2.1.1-beta
>            Reporter: Chris Nauroth
>            Assignee: Chris Nauroth
>             Fix For: 1-win, 1.3.0, 2.1.1-beta
>
>         Attachments: HADOOP-9801-branch-1.1.patch, HADOOP-9801-branch-1.2.patch, HADOOP-9801-trunk.1.patch, HADOOP-9801-trunk.2.patch
>
>
> The overload of {{Configuration#writeXml}} that accepts an {{OutputStream}} does not set encoding explicitly, so it chooses the platform default encoding.  Depending on the platform's default encoding, this can cause incorrect output data when encoding multi-byte characters.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)