You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@spark.apache.org by shamu <pr...@hotmail.com> on 2016/10/06 01:58:23 UTC

codecFactory is not able to return codec for a file, created by spark

I have 2 files.
1. names.snappy is a file created by spark with snappy compression.
CodecFactory(conf).getCodec() correctly  returns
org.apache.hadoop.io.compress.SnappyCodec

2. pairs-avro-snappy-compressed.avro is file created by spark - by reading
an avro file, and write using snappy compression. Header of this file, does
show snappy codec, but CodecFactory(conf).getCodec() returns null. 

How to get the actual codec?

pairs-avro-snappy-compressed.avro
<http://apache-spark-user-list.1001560.n3.nabble.com/file/n27846/pairs-avro-snappy-compressed.avro>  


-----pairs-avro-snappy.avro looks like below.----

Objavro.codec
snappyavro.schema�{"type":"record","name":"topLevelRecord","fields":[{"name":"left","type":["string","null"]},{"name":"right","type":["string","null"]}]}
W=/l�9Ϊ W=�<\ a 1 c 2 b 3 b 26d�hW=/l�9Ϊ W=�



--
View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/codecFactory-is-not-able-to-return-codec-for-a-file-created-by-spark-tp27846.html
Sent from the Apache Spark User List mailing list archive at Nabble.com.

---------------------------------------------------------------------
To unsubscribe e-mail: user-unsubscribe@spark.apache.org