You are viewing a plain text version of this content. The canonical link for it is here.

Posted to issues@spark.apache.org by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/06/21 06:31:02 UTC

[jira] [Updated] (SPARK-22216) Improving PySpark/Pandas interoperability

     [ https://issues.apache.org/jira/browse/SPARK-22216?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Hyukjin Kwon updated SPARK-22216:
---------------------------------
    Labels:   (was: bulk-closed)

> Improving PySpark/Pandas interoperability
> -----------------------------------------
>
>                 Key: SPARK-22216
>                 URL: https://issues.apache.org/jira/browse/SPARK-22216
>             Project: Spark
>          Issue Type: Epic
>          Components: PySpark
>    Affects Versions: 2.2.0
>            Reporter: Li Jin
>            Assignee: Li Jin
>            Priority: Major
>
> This is an umbrella ticket tracking the general effort to improve performance and interoperability between PySpark and Pandas. The core idea is to Apache Arrow as serialization format to reduce the overhead between PySpark and Pandas.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org