You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Burak KÖSE (JIRA)" <ji...@apache.org> on 2016/03/25 21:38:25 UTC
[jira] [Commented] (SPARK-14108) calling count() on empty dataframe
throws java.util.NoSuchElementException
[ https://issues.apache.org/jira/browse/SPARK-14108?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15212363#comment-15212363 ]
Burak KÖSE commented on SPARK-14108:
------------------------------------
Please give a test case.
> calling count() on empty dataframe throws java.util.NoSuchElementException
> --------------------------------------------------------------------------
>
> Key: SPARK-14108
> URL: https://issues.apache.org/jira/browse/SPARK-14108
> Project: Spark
> Issue Type: Bug
> Components: SQL
> Affects Versions: 1.6.1
> Environment: Tested in Hadoop 2.7.2 EMR 4.x
> Reporter: Krishna Shekhram
> Priority: Minor
>
> When calling count() on empty dataframe, then spark code still tries to iterate through the empty iterator and throws java.util.NoSuchElementException.
> Stacktrace :
> java.util.NoSuchElementException: next on empty iterator
> at scala.collection.Iterator$$anon$2.next(Iterator.scala:39)
> at scala.collection.Iterator$$anon$2.next(Iterator.scala:37)
> at scala.collection.IndexedSeqLike$Elements.next(IndexedSeqLike.scala:64)
> at scala.collection.IterableLike$class.head(IterableLike.scala:91)
> at scala.collection.mutable.ArrayOps$ofRef.scala$collection$IndexedSeqOptimized$$super$head(ArrayOps.scala:108)
> at scala.collection.IndexedSeqOptimized$class.head(IndexedSeqOptimized.scala:120)
> at scala.collection.mutable.ArrayOps$ofRef.head(ArrayOps.scala:108)
> at org.apache.spark.sql.DataFrame$$anonfun$count$1.apply(DataFrame.scala:1515)
> at org.apache.spark.sql.DataFrame$$anonfun$count$1.apply(DataFrame.scala:1514)
> at org.apache.spark.sql.DataFrame.withCallback(DataFrame.scala:2099)
> at org.apache.spark.sql.DataFrame.count(DataFrame.scala:1514)
> Code Snippet:
> This code fails
> if(this.df !=null){
> long countOfRows = this.df.count();
> }
> If I do this then it works
> if(this.df !=null && ! this.df.rdd().isEmpty()){
> long countOfRows = this.df.count();
> }
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org