You are viewing a plain text version of this content. The canonical link for it is here.

Posted to issues@spark.apache.org by "Xiangrui Meng (JIRA)" <ji...@apache.org> on 2015/01/13 21:51:34 UTC

[jira] [Resolved] (SPARK-5223) Use pickle instead of MapConvert and ListConvert in MLlib Python API

     [ https://issues.apache.org/jira/browse/SPARK-5223?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Xiangrui Meng resolved SPARK-5223.
----------------------------------
       Resolution: Fixed
    Fix Version/s: 1.2.1
                   1.3.0

Issue resolved by pull request 4023
[https://github.com/apache/spark/pull/4023]

> Use pickle instead of MapConvert and ListConvert in MLlib Python API
> --------------------------------------------------------------------
>
>                 Key: SPARK-5223
>                 URL: https://issues.apache.org/jira/browse/SPARK-5223
>             Project: Spark
>          Issue Type: Bug
>          Components: MLlib, PySpark
>            Reporter: Davies Liu
>            Priority: Critical
>             Fix For: 1.3.0, 1.2.1
>
>
> It will introduce problems if the object in dict/list/tuple can not support by py4j, such as Vector.
> Also, pickle may have better performance for larger object (less RPC).
> In some cases that the object in dict/list can not be pickled (such as JavaObject), we should still use MapConvert/ListConvert.
> discussion: http://apache-spark-developers-list.1001551.n3.nabble.com/Python-to-Java-object-conversion-of-numpy-array-td10065.html



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org