You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@avro.apache.org by 徐凡 <ti...@hotmail.com> on 2015/01/13 20:18:51 UTC

When use AvroUtf8InputFormat, how to remove unknown symbols(BytePrefix)

Hi All,
My folder has avro files with multiple different avro schemas.
So I use AvroUtf8InputFormat to receive data for mapper()
But I got several unknown symbols, which results to be Utf8 version BytePrefix.
Anyone has ideas on how to filter these BytePrefix?
Thanks a lot,
Fan