You are viewing a plain text version of this content. The canonical link for it is here.
Posted to reviews@spark.apache.org by GitBox <gi...@apache.org> on 2019/05/20 14:12:28 UTC

[GitHub] [spark] dvogelbacher opened a new pull request #24650: [SPARK-27778][PySpark] Fix toPandas conversion using arrow for DFs with no partitions

dvogelbacher opened a new pull request #24650: [SPARK-27778][PySpark] Fix toPandas conversion using arrow for DFs with no partitions
URL: https://github.com/apache/spark/pull/24650
 
 
   ## What changes were proposed in this pull request?
   https://github.com/apache/spark/pull/22275 introduced a performance improvement where we send partitions out of order to python and then, as a last step, send the partition order as well.
   However, if there are no partitions we will never send the partition order and we will get an "EofError" on the python side. 
   This PR fixes this by also sending the partition order if there are no partitions present.
   
   ## How was this patch tested?
   New unit test added.
   
   

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
users@infra.apache.org


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org