You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Michael Armbrust (JIRA)" <ji...@apache.org> on 2015/10/21 00:59:27 UTC

[jira] [Commented] (SPARK-11220) SQL data source gives confusing error message when file not found

    [ https://issues.apache.org/jira/browse/SPARK-11220?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14965927#comment-14965927 ] 

Michael Armbrust commented on SPARK-11220:
------------------------------------------

My hunch is this has something to do with file globbing logic somewhere along the way.

> SQL data source gives confusing error message when file not found
> -----------------------------------------------------------------
>
>                 Key: SPARK-11220
>                 URL: https://issues.apache.org/jira/browse/SPARK-11220
>             Project: Spark
>          Issue Type: Bug
>          Components: SQL
>            Reporter: Reynold Xin
>
> See the following
> {code}
> scala> sqlContext.read.json("asdasdf")
> java.io.IOException: No input paths specified in job
> 	at org.apache.hadoop.mapred.FileInputFormat.listStatus(FileInputFormat.java:198)
> 	at org.apache.hadoop.mapred.FileInputFormat.getSplits(FileInputFormat.java:270)
> 	at org.apache.spark.rdd.HadoopRDD.getPartitions(HadoopRDD.scala:201)
> 	at org.apache.spark.rdd.RDD$$anonfun$partitions$2.apply(RDD.scala:239)
> 	at org.apache.spark.rdd.RDD$$anonfun$partitions$2.apply(RDD.scala:237)
> 	at scala.Option.getOrElse(Option.scala:120)
> 	at org.apache.spark.rdd.RDD.partitions(RDD.scala:237)
> 	at org.apache.spark.rdd.MapPartitionsRDD.getPartitions(MapPartitionsRDD.scala:35)
> {code}
> This is not a problem in SparkContext/RDD:
> {code}
> scala> sqlContext.read.json("/asdasdf")
> java.io.IOException: No input paths specified in job
> 	at org.apache.hadoop.mapred.FileInputFormat.listStatus(FileInputFormat.java:198)
> 	at org.apache.hadoop.mapred.FileInputFormat.getSplits(FileInputFormat.java:270)
> 	at org.apache.spark.rdd.HadoopRDD.getPartitions(HadoopRDD.scala:201)
> 	at org.apache.spark.rdd.RDD$$anonfun$partitions$2.apply(RDD.scala:239)
> 	at org.apache.spark.rdd.RDD$$anonfun$partitions$2.apply(RDD.scala:237)
> 	at scala.Option.getOrElse(Option.scala:120)
> 	at org.apache.spark.rdd.RDD.partitions(RDD.scala:237)
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org