You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@spark.apache.org by sprookie <cu...@gmail.com> on 2015/03/18 18:21:53 UTC
Null pointer exception reading Parquet
Hi All,
I am using Saprk version 1.2 running locally. When I try to read a paquet
file I get below exception, what might be the issue?
Any help will be appreciated. This is the simplest operation/action on a
parquet file.
//code snippet//
val sparkConf = new SparkConf().setAppName("
Testing").setMaster("local[10]")
val sc = new SparkContext(sparkConf)
val sqlContext = new org.apache.spark.sql.SQLContext(sc)
sqlContext.setConf("spark.sql.parquet.binaryAsString","true")
import sqlContext._
val temp = "local path to file"
val temp2 = sqlContext.parquetFile(temp)
temp2.printSchema
//end code snippet
//Exception trace
Exception in thread "main" java.lang.NullPointerException
at
parquet.format.converter.ParquetMetadataConverter.fromParquetStatistics(ParquetMetadataConverter.java:249)
at
parquet.format.converter.ParquetMetadataConverter.fromParquetMetadata(ParquetMetadataConverter.java:543)
at
parquet.format.converter.ParquetMetadataConverter.readParquetMetadata(ParquetMetadataConverter.java:520)
at parquet.hadoop.ParquetFileReader.readFooter(ParquetFileReader.java:426)
at parquet.hadoop.ParquetFileReader.readFooter(ParquetFileReader.java:389)
at
org.apache.spark.sql.parquet.ParquetTypesConverter$$anonfun$readMetaData$3.apply(ParquetTypes.scala:457)
at
org.apache.spark.sql.parquet.ParquetTypesConverter$$anonfun$readMetaData$3.apply(ParquetTypes.scala:457)
at scala.Option.map(Option.scala:145)
at
org.apache.spark.sql.parquet.ParquetTypesConverter$.readMetaData(ParquetTypes.scala:457)
at
org.apache.spark.sql.parquet.ParquetTypesConverter$.readSchemaFromFile(ParquetTypes.scala:477)
at
org.apache.spark.sql.parquet.ParquetRelation.<init>(ParquetRelation.scala:65)
at org.apache.spark.sql.SQLContext.parquetFile(SQLContext.scala:165)
//End Exception trace
--
View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/Null-pointer-exception-reading-Parquet-tp22124.html
Sent from the Apache Spark User List mailing list archive at Nabble.com.
Re: Null pointer exception reading Parquet
Posted by Akhil Das <ak...@sigmoidanalytics.com>.
How are you running the application? Can you try running the same inside
spark-shell?
Thanks
Best Regards
On Wed, Mar 18, 2015 at 10:51 PM, sprookie <cu...@gmail.com> wrote:
> Hi All,
>
> I am using Saprk version 1.2 running locally. When I try to read a paquet
> file I get below exception, what might be the issue?
> Any help will be appreciated. This is the simplest operation/action on a
> parquet file.
>
>
> //code snippet//
>
>
> val sparkConf = new SparkConf().setAppName("
> Testing").setMaster("local[10]")
> val sc = new SparkContext(sparkConf)
> val sqlContext = new org.apache.spark.sql.SQLContext(sc)
> sqlContext.setConf("spark.sql.parquet.binaryAsString","true")
>
> import sqlContext._
> val temp = "local path to file"
> val temp2 = sqlContext.parquetFile(temp)
>
> temp2.printSchema
>
>
> //end code snippet
>
>
>
> //Exception trace
>
> Exception in thread "main" java.lang.NullPointerException
> at
> parquet.format.converter.ParquetMetadataConverter.fromParquetStatistics(ParquetMetadataConverter.java:249)
> at
> parquet.format.converter.ParquetMetadataConverter.fromParquetMetadata(ParquetMetadataConverter.java:543)
> at
> parquet.format.converter.ParquetMetadataConverter.readParquetMetadata(ParquetMetadataConverter.java:520)
> at parquet.hadoop.ParquetFileReader.readFooter(ParquetFileReader.java:426)
> at parquet.hadoop.ParquetFileReader.readFooter(ParquetFileReader.java:389)
> at
> org.apache.spark.sql.parquet.ParquetTypesConverter$$anonfun$readMetaData$3.apply(ParquetTypes.scala:457)
> at
> org.apache.spark.sql.parquet.ParquetTypesConverter$$anonfun$readMetaData$3.apply(ParquetTypes.scala:457)
> at scala.Option.map(Option.scala:145)
> at
> org.apache.spark.sql.parquet.ParquetTypesConverter$.readMetaData(ParquetTypes.scala:457)
> at
> org.apache.spark.sql.parquet.ParquetTypesConverter$.readSchemaFromFile(ParquetTypes.scala:477)
> at
> org.apache.spark.sql.parquet.ParquetRelation.<init>(ParquetRelation.scala:65)
> at org.apache.spark.sql.SQLContext.parquetFile(SQLContext.scala:165)
>
> //End Exception trace
>
> ------------------------------
> View this message in context: Null pointer exception reading Parquet
> <http://apache-spark-user-list.1001560.n3.nabble.com/Null-pointer-exception-reading-Parquet-tp22124.html>
> Sent from the Apache Spark User List mailing list archive
> <http://apache-spark-user-list.1001560.n3.nabble.com/> at Nabble.com.
>