You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Hyukjin Kwon (Jira)" <ji...@apache.org> on 2021/12/23 00:17:00 UTC
[jira] [Resolved] (SPARK-34544) pyspark toPandas() should return pd.DataFrame
[ https://issues.apache.org/jira/browse/SPARK-34544?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Hyukjin Kwon resolved SPARK-34544.
----------------------------------
Fix Version/s: 3.3.0
Resolution: Fixed
Fixed in https://github.com/apache/spark/pull/34927
> pyspark toPandas() should return pd.DataFrame
> ---------------------------------------------
>
> Key: SPARK-34544
> URL: https://issues.apache.org/jira/browse/SPARK-34544
> Project: Spark
> Issue Type: Sub-task
> Components: PySpark
> Affects Versions: 3.1.1
> Reporter: Rafal Wojdyla
> Assignee: Maciej Szymkiewicz
> Priority: Major
> Fix For: 3.3.0
>
>
> Right now {{toPandas()}} returns {{DataFrameLike}}, which is an incomplete "view" of pandas {{DataFrame}}. Which leads to cases like mypy reporting that certain pandas methods are not present in {{DataFrameLike}}, even tho those methods are valid methods on pandas {{DataFrame}}, which is the actual type of the object. This requires type ignore comments or asserts.
--
This message was sent by Atlassian Jira
(v8.20.1#820001)
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org