You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Josh Rosen (JIRA)" <ji...@apache.org> on 2014/08/18 23:01:19 UTC

[jira] [Resolved] (SPARK-3103) Fix UTF8 encoding in PySpark saveAsTextFile().

     [ https://issues.apache.org/jira/browse/SPARK-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Josh Rosen resolved SPARK-3103.
-------------------------------

       Resolution: Fixed
    Fix Version/s: 1.1.0
         Assignee: Davies Liu

> Fix UTF8 encoding in PySpark saveAsTextFile().
> ----------------------------------------------
>
>                 Key: SPARK-3103
>                 URL: https://issues.apache.org/jira/browse/SPARK-3103
>             Project: Spark
>          Issue Type: Bug
>          Components: PySpark
>    Affects Versions: 1.0.2, 1.1.0
>            Reporter: Josh Rosen
>            Assignee: Davies Liu
>              Labels: starter
>             Fix For: 1.1.0
>
>
> This is a follow-up JIRA for https://github.com/apache/spark/pull/1914, where Ahir and Davies identified a bug in Python JsonRDD when trying to encode non-ASCII strings into unicode.
> The same underlying issue affects saveAsTextFile, so we should apply the same fix there, too, and search for any other code that needs to be updated (and maybe refactor this out into a utility function).



--
This message was sent by Atlassian JIRA
(v6.2#6252)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org