You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@spark.apache.org by kmader <ke...@gmail.com> on 2014/07/23 17:25:32 UTC
Re: wholeTextFiles not working with HDFS
I have the same issue
val a = sc.textFile("s3n://MyBucket/MyFolder/*.tif")
a.first
works perfectly fine, but
val d = sc.wholeTextFiles("s3n://MyBucket/MyFolder/*.tif") does not
work
d.first
Gives the following error message
java.io.FileNotFoundException: File /MyBucket/MyFolder.tif does not
exist.
--
View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/wholeTextFiles-not-working-with-HDFS-tp7490p10505.html
Sent from the Apache Spark User List mailing list archive at Nabble.com.
Re: wholeTextFiles not working with HDFS
Posted by pierred <pi...@demartines.com>.
I forgot to say, I am using bin/spark-shell, spark-1.0.2
That host has scala version 2.10.4 (Java HotSpot(TM) 64-Bit Server VM, Java
1.8.0_11)
--
View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/wholeTextFiles-not-working-with-HDFS-tp7490p12678.html
Sent from the Apache Spark User List mailing list archive at Nabble.com.
---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscribe@spark.apache.org
For additional commands, e-mail: user-help@spark.apache.org
Re: wholeTextFiles not working with HDFS
Posted by pierred <pi...@demartines.com>.
I had the same issue with spark-1.0.2-bin-hadoop*1*, and indeed the issue
seems related to Hadoop1. When switching to using
spark-1.0.2-bin-hadoop*2*, the issue disappears.
--
View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/wholeTextFiles-not-working-with-HDFS-tp7490p12677.html
Sent from the Apache Spark User List mailing list archive at Nabble.com.
---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscribe@spark.apache.org
For additional commands, e-mail: user-help@spark.apache.org
Re: wholeTextFiles not working with HDFS
Posted by kmader <ke...@gmail.com>.
That worked for me as well, I was using spark 1.0 compiled against Hadoop
1.0, switching to 1.0.1 compiled against hadoop 2
--
View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/wholeTextFiles-not-working-with-HDFS-tp7490p10547.html
Sent from the Apache Spark User List mailing list archive at Nabble.com.