You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2021/08/19 11:05:00 UTC

[jira] [Commented] (HUDI-2320) Add support ByteArrayDeserializer in AvroKafkaSource

    [ https://issues.apache.org/jira/browse/HUDI-2320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17401619#comment-17401619 ] 

ASF GitHub Bot commented on HUDI-2320:
--------------------------------------

dongkelun opened a new pull request #3502:
URL: https://github.com/apache/hudi/pull/3502


   ## What is the purpose of the pull request
   Add support ByteArrayDeserializer in AvroKafkaSource
   
   
   
   ## Verify this pull request
   
   When the 'value.serializer' of Kafka Avro Producer is 'org.apache.kafka.common.serialization.ByteArraySerializer',Use the following configuration
   
   --source-class org.apache.hudi.utilities.sources.AvroKafkaSource \
   --schemaprovider-class org.apache.hudi.utilities.schema.JdbcbasedSchemaProvider \
   --hoodie-conf "hoodie.deltastreamer.source.kafka.value.deserializer.class=org.apache.kafka.common.serialization.ByteArrayDeserializer"
   
   For now,It will throw an exception:
   java.lang.ClassCastException: [B cannot be cast to org.apache.avro.generic.GenericRecord
   
   After support ByteArrayDeserializer,Use the configuration above,It works properly.And there is no need to provide 'schema.registry.url',For example, we can use the JdbcbasedSchemaProvider to get the sourceSchema
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


> Add support ByteArrayDeserializer in AvroKafkaSource
> ----------------------------------------------------
>
>                 Key: HUDI-2320
>                 URL: https://issues.apache.org/jira/browse/HUDI-2320
>             Project: Apache Hudi
>          Issue Type: Improvement
>          Components: DeltaStreamer
>            Reporter: 董可伦
>            Assignee: 董可伦
>            Priority: Major
>             Fix For: 0.10.0
>
>
> When the 'value.serializer' of Kafka Avro Producer is 'org.apache.kafka.common.serialization.ByteArraySerializer',Use the following configuration
> {code:java}
> --source-class org.apache.hudi.utilities.sources.AvroKafkaSource \
> --schemaprovider-class org.apache.hudi.utilities.schema.JdbcbasedSchemaProvider \
> --hoodie-conf "hoodie.deltastreamer.source.kafka.value.deserializer.class=org.apache.kafka.common.serialization.ByteArrayDeserializer"
> {code}
> For now,It will throw an exception::
> {code:java}
> java.lang.ClassCastException: [B cannot be cast to org.apache.avro.generic.GenericRecord{code}
> After support ByteArrayDeserializer,Use the configuration above,It works properly.And there is no need to provide 'schema.registry.url',For example, we can use the JdbcbasedSchemaProvider to get the sourceSchema



--
This message was sent by Atlassian Jira
(v8.3.4#803005)