You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Uri Goren (JIRA)" <ji...@apache.org> on 2018/04/01 11:25:00 UTC

[jira] [Created] (SPARK-23840) PySpark error when converting a DataFrame to rdd

Uri Goren created SPARK-23840:
---------------------------------

             Summary: PySpark error when converting a DataFrame to rdd
                 Key: SPARK-23840
                 URL: https://issues.apache.org/jira/browse/SPARK-23840
             Project: Spark
          Issue Type: Bug
          Components: PySpark
    Affects Versions: 2.3.0
            Reporter: Uri Goren


I am running code in the `pyspark` shell on an `emr` cluster, and encountering an error I have never seen before...


This line works:

spark.read.parquet(s3_input).take(99)

While this line causes an exception:

spark.read.parquet(s3_input).rdd.take(99)
With

> TypeError: 'int' object is not iterable



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org