You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Uri Goren (JIRA)" <ji...@apache.org> on 2018/04/01 11:25:00 UTC
[jira] [Created] (SPARK-23840) PySpark error when converting a
DataFrame to rdd
Uri Goren created SPARK-23840:
---------------------------------
Summary: PySpark error when converting a DataFrame to rdd
Key: SPARK-23840
URL: https://issues.apache.org/jira/browse/SPARK-23840
Project: Spark
Issue Type: Bug
Components: PySpark
Affects Versions: 2.3.0
Reporter: Uri Goren
I am running code in the `pyspark` shell on an `emr` cluster, and encountering an error I have never seen before...
This line works:
spark.read.parquet(s3_input).take(99)
While this line causes an exception:
spark.read.parquet(s3_input).rdd.take(99)
With
> TypeError: 'int' object is not iterable
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org