You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by "Ethan Guo (Jira)" <ji...@apache.org> on 2022/03/16 22:48:00 UTC

[jira] [Assigned] (HUDI-3610) Validate Hudi Kafka Connect Sink writing to S3

     [ https://issues.apache.org/jira/browse/HUDI-3610?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Ethan Guo reassigned HUDI-3610:
-------------------------------

    Assignee: Raymond Xu  (was: Ethan Guo)

> Validate Hudi Kafka Connect Sink writing to S3
> ----------------------------------------------
>
>                 Key: HUDI-3610
>                 URL: https://issues.apache.org/jira/browse/HUDI-3610
>             Project: Apache Hudi
>          Issue Type: Task
>            Reporter: Ethan Guo
>            Assignee: Raymond Xu
>            Priority: Critical
>             Fix For: 0.11.0
>
>
> From community:
> Hi guys, I'm trying to implement this architecture with hudi
> db table --- Debezium --> kafka ---Hudi sink connector --> S3 bucket
> My setting
> Kafka version 2.4
> Hudi version 0.10.1
> Hdf sink connector version 10.1.4
> I'm encountering this error
> ERROR WorkerSinkTask\{id=<XXX>} Task threw an uncaught and unrecoverable exception. Task is being killed and will not recover until manually restarted (org.apache.kafka.connect.runtime.WorkerTask)
> java.lang.NoClassDefFoundError: org/apache/hadoop/fs/FSDataInputStream
> 	at org.apache.hudi.connect.HoodieSinkTask.start(HoodieSinkTask.java:80)
> 	at org.apache.kafka.connect.runtime.WorkerSinkTask.initializeAndStart(WorkerSinkTask.java:312)
> 	at org.apache.kafka.connect.runtime.WorkerTask.doRun(WorkerTask.java:186)
> 	at org.apache.kafka.connect.runtime.WorkerTask.run(WorkerTask.java:243)
> 	at java.base/java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:515)
> 	at java.base/java.util.concurrent.FutureTask.run(FutureTask.java:264)
> 	at java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128)
> 	at java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628)
> 	at java.base/java.lang.Thread.run(Thread.java:829)
> Caused by: java.lang.ClassNotFoundException: org.apache.hadoop.fs.FSDataInputStream
> 	at java.base/java.net.URLClassLoader.findClass(URLClassLoader.java:476)
> 	at java.base/java.lang.ClassLoader.loadClass(ClassLoader.java:589)
> 	at org.apache.kafka.connect.runtime.isolation.PluginClassLoader.loadClass(PluginClassLoader.java:103)
> 	at java.base/java.lang.ClassLoader.loadClass(ClassLoader.java:522)
> 	... 9 more



--
This message was sent by Atlassian Jira
(v8.20.1#820001)