You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@kafka.apache.org by "Jordan Moore (Jira)" <ji...@apache.org> on 2022/01/19 23:44:00 UTC
[jira] [Commented] (KAFKA-13486) Kafka Connect: Failed to start task due to NPE
[ https://issues.apache.org/jira/browse/KAFKA-13486?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17479033#comment-17479033 ]
Jordan Moore commented on KAFKA-13486:
--------------------------------------
You seem to be missing part of the stacktrace. The workers need to know about each other via their advertised REST listeners settings... I believe Strimzi Operator KafkaConnect CRD, for example, configures this correctly.
> Kafka Connect: Failed to start task due to NPE
> ----------------------------------------------
>
> Key: KAFKA-13486
> URL: https://issues.apache.org/jira/browse/KAFKA-13486
> Project: Kafka
> Issue Type: Bug
> Components: KafkaConnect
> Affects Versions: 2.7.1
> Environment: Kubernetes, custom docker image
> Reporter: Geliba Uilte
> Priority: Major
>
> I have a Kafka Connect cluster with three workers running on Kubernetes. The workers communicate with each other using pod's IP (internal IP 192.X.X.X). Sometimes, pods are redistributed to different node. I am not sure if it has anything to do with the issue, but I think it makes pod's IP to be changed and Kafka Connect needs to rebalance.
> Occasionally, tasks fail due to NPE.
> From the connectors/:connector/status REST API, I can see this trace:
>
> {code:java}
> at org.apache.kafka.connect.runtime.Worker.startTask(Worker.java:517)
> at org.apache.kafka.connect.runtime.distributed.DistributedHerder.startTask(DistributedHerder.java:1258)
> at org.apache.kafka.connect.runtime.distributed.DistributedHerder.access$1700(DistributedHerder.java:127)
> at org.apache.kafka.connect.runtime.distributed.DistributedHerder$10.call(DistributedHerder.java:1273)
> at org.apache.kafka.connect.runtime.distributed.DistributedHerder$10.call(DistributedHerder.java:1269)
> at java.base/java.util.concurrent.FutureTask.run(FutureTask.java:264)
> at java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128)
> at java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628)
> at java.base/java.lang.Thread.run(Thread.java:834){code}
>
> It looks like the issue is similar to KAFKA-10323 and
> It seems NPE is thrown from [here|https://github.com/apache/kafka/blob/2.7.1/connect/runtime/src/main/java/org/apache/kafka/connect/runtime/Worker.java#L517].
>
>
--
This message was sent by Atlassian Jira
(v8.20.1#820001)