You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Li Jin (JIRA)" <ji...@apache.org> on 2017/10/06 15:25:00 UTC

[jira] [Updated] (SPARK-22216) Improving PySpark/Pandas interop

     [ https://issues.apache.org/jira/browse/SPARK-22216?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Li Jin updated SPARK-22216:
---------------------------
    Summary: Improving PySpark/Pandas interop  (was: PySpark/Pandas interop umbrella )

> Improving PySpark/Pandas interop
> --------------------------------
>
>                 Key: SPARK-22216
>                 URL: https://issues.apache.org/jira/browse/SPARK-22216
>             Project: Spark
>          Issue Type: Umbrella
>          Components: PySpark
>    Affects Versions: 2.2.0
>            Reporter: Li Jin
>
> This is an umbrella ticket tracking the general effect of improving performance and interoperability between PySpark and Pandas. The core idea is to Apache Arrow as serialization format to reduce the overhead between PySpark and Pandas.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org