You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "Hive QA (Jira)" <ji...@apache.org> on 2020/02/21 09:42:00 UTC

[jira] [Commented] (HIVE-21218) KafkaSerDe doesn't support topics created via Confluent Avro serializer

    [ https://issues.apache.org/jira/browse/HIVE-21218?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17041710#comment-17041710 ] 

Hive QA commented on HIVE-21218:
--------------------------------



Here are the results of testing the latest attachment:
https://issues.apache.org/jira/secure/attachment/12958362/HIVE-21218.2.patch

{color:red}ERROR:{color} -1 due to build exiting with an error

Test results: https://builds.apache.org/job/PreCommit-HIVE-Build/20763/testReport
Console output: https://builds.apache.org/job/PreCommit-HIVE-Build/20763/console
Test logs: http://104.198.109.242/logs/PreCommit-HIVE-Build-20763/

Messages:
{noformat}
Executing org.apache.hive.ptest.execution.TestCheckPhase
Executing org.apache.hive.ptest.execution.PrepPhase
Tests exited with: NonZeroExitCodeException
Command 'bash /data/hiveptest/working/scratch/source-prep.sh' failed with exit status 1 and output '+ date '+%Y-%m-%d %T.%3N'
2020-02-21 09:40:35.488
+ [[ -n /usr/lib/jvm/java-8-openjdk-amd64 ]]
+ export JAVA_HOME=/usr/lib/jvm/java-8-openjdk-amd64
+ JAVA_HOME=/usr/lib/jvm/java-8-openjdk-amd64
+ export PATH=/usr/lib/jvm/java-8-openjdk-amd64/bin/:/usr/local/bin:/usr/bin:/bin:/usr/local/games:/usr/games
+ PATH=/usr/lib/jvm/java-8-openjdk-amd64/bin/:/usr/local/bin:/usr/bin:/bin:/usr/local/games:/usr/games
+ export 'ANT_OPTS=-Xmx1g -XX:MaxPermSize=256m '
+ ANT_OPTS='-Xmx1g -XX:MaxPermSize=256m '
+ export 'MAVEN_OPTS=-Xmx1g '
+ MAVEN_OPTS='-Xmx1g '
+ cd /data/hiveptest/working/
+ tee /data/hiveptest/logs/PreCommit-HIVE-Build-20763/source-prep.txt
+ [[ false == \t\r\u\e ]]
+ mkdir -p maven ivy
+ [[ git = \s\v\n ]]
+ [[ git = \g\i\t ]]
+ [[ -z master ]]
+ [[ -d apache-github-source-source ]]
+ [[ ! -d apache-github-source-source/.git ]]
+ [[ ! -d apache-github-source-source ]]
+ date '+%Y-%m-%d %T.%3N'
2020-02-21 09:40:35.490
+ cd apache-github-source-source
+ git fetch origin
From https://github.com/apache/hive
   f826283..ffee30e  master     -> origin/master
+ git reset --hard HEAD
HEAD is now at f826283 HIVE-22816 : QueryCache: Queries using views can have them cached after CTE expansion (Gopal V via Jesus Camacho Rodriguez)
+ git clean -f -d
Removing standalone-metastore/metastore-server/src/gen/
+ git checkout master
Already on 'master'
Your branch is behind 'origin/master' by 1 commit, and can be fast-forwarded.
  (use "git pull" to update your local branch)
+ git reset --hard origin/master
HEAD is now at ffee30e HIVE-21164: ACID: explore how we can avoid a move step during inserts/compaction (Marta Kuczora, reviewed by Peter Vary)
+ git merge --ff-only origin/master
Already up-to-date.
+ date '+%Y-%m-%d %T.%3N'
2020-02-21 09:40:37.273
+ rm -rf ../yetus_PreCommit-HIVE-Build-20763
+ mkdir ../yetus_PreCommit-HIVE-Build-20763
+ git gc
+ cp -R . ../yetus_PreCommit-HIVE-Build-20763
+ mkdir /data/hiveptest/logs/PreCommit-HIVE-Build-20763/yetus
+ patchCommandPath=/data/hiveptest/working/scratch/smart-apply-patch.sh
+ patchFilePath=/data/hiveptest/working/scratch/build.patch
+ [[ -f /data/hiveptest/working/scratch/build.patch ]]
+ chmod +x /data/hiveptest/working/scratch/smart-apply-patch.sh
+ /data/hiveptest/working/scratch/smart-apply-patch.sh /data/hiveptest/working/scratch/build.patch
Trying to apply the patch with -p0
error: patch failed: kafka-handler/README.md:25
Falling back to three-way merge...
Applied patch to 'kafka-handler/README.md' with conflicts.
error: patch failed: kafka-handler/pom.xml:114
Falling back to three-way merge...
Applied patch to 'kafka-handler/pom.xml' cleanly.
Going to apply patch with: git apply -p0
/data/hiveptest/working/scratch/build.patch:10: trailing whitespace.
If you use Confluent Avro serialzier/deserializer with schema registry you may want to remove 5 bytes from beginning that represents magic byte + schema ID from registry. 
error: patch failed: kafka-handler/README.md:25
Falling back to three-way merge...
Applied patch to 'kafka-handler/README.md' with conflicts.
error: patch failed: kafka-handler/pom.xml:114
Falling back to three-way merge...
Applied patch to 'kafka-handler/pom.xml' cleanly.
U kafka-handler/README.md
warning: 1 line adds whitespace errors.
+ result=1
+ '[' 1 -ne 0 ']'
+ rm -rf yetus_PreCommit-HIVE-Build-20763
+ exit 1
'
{noformat}

This message is automatically generated.

ATTACHMENT ID: 12958362 - PreCommit-HIVE-Build

> KafkaSerDe doesn't support topics created via Confluent Avro serializer
> -----------------------------------------------------------------------
>
>                 Key: HIVE-21218
>                 URL: https://issues.apache.org/jira/browse/HIVE-21218
>             Project: Hive
>          Issue Type: Bug
>          Components: kafka integration, Serializers/Deserializers
>    Affects Versions: 3.1.1
>            Reporter: Milan Baran
>            Assignee: Milan Baran
>            Priority: Major
>              Labels: pull-request-available
>         Attachments: HIVE-21218.2.patch, HIVE-21218.patch
>
>          Time Spent: 4h 40m
>  Remaining Estimate: 0h
>
> According to [Google groups|https://groups.google.com/forum/#!topic/confluent-platform/JYhlXN0u9_A] the Confluent avro serialzier uses propertiary format for kafka value - <magic_byte 0x00><4 bytes of schema ID><regular avro bytes for object that conforms to schema>. 
> This format does not cause any problem for Confluent kafka deserializer which respect the format however for hive kafka handler its bit a problem to correctly deserialize kafka value, because Hive uses custom deserializer from bytes to objects and ignores kafka consumer ser/deser classes provided via table property.
> It would be nice to support Confluent format with magic byte.
> Also it would be great to support Schema registry as well.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)