You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@kafka.apache.org by "Jordan Moore (Jira)" <ji...@apache.org> on 2022/01/19 23:44:00 UTC

[jira] [Commented] (KAFKA-13486) Kafka Connect: Failed to start task due to NPE

    [ https://issues.apache.org/jira/browse/KAFKA-13486?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17479033#comment-17479033 ] 

Jordan Moore commented on KAFKA-13486:
--------------------------------------

You seem to be missing part of the stacktrace. The workers need to know about each other via their advertised REST listeners settings... I believe Strimzi Operator KafkaConnect CRD, for example, configures this correctly.  

> Kafka Connect: Failed to start task due to NPE
> ----------------------------------------------
>
>                 Key: KAFKA-13486
>                 URL: https://issues.apache.org/jira/browse/KAFKA-13486
>             Project: Kafka
>          Issue Type: Bug
>          Components: KafkaConnect
>    Affects Versions: 2.7.1
>         Environment: Kubernetes, custom docker image
>            Reporter: Geliba Uilte
>            Priority: Major
>
> I have a Kafka Connect cluster with three workers running on Kubernetes. The workers communicate with each other using pod's IP (internal IP 192.X.X.X). Sometimes, pods are redistributed to different node. I am not sure if it has anything to do with the issue, but I think it makes pod's IP to be changed and Kafka Connect needs to rebalance.
> Occasionally, tasks fail due to NPE.
> From the connectors/:connector/status REST API, I can see this trace:
>  
> {code:java}
> at org.apache.kafka.connect.runtime.Worker.startTask(Worker.java:517)
> at org.apache.kafka.connect.runtime.distributed.DistributedHerder.startTask(DistributedHerder.java:1258)
> at org.apache.kafka.connect.runtime.distributed.DistributedHerder.access$1700(DistributedHerder.java:127)
> at org.apache.kafka.connect.runtime.distributed.DistributedHerder$10.call(DistributedHerder.java:1273)
> at org.apache.kafka.connect.runtime.distributed.DistributedHerder$10.call(DistributedHerder.java:1269)
> at java.base/java.util.concurrent.FutureTask.run(FutureTask.java:264)
> at java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128)
> at java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628)
> at java.base/java.lang.Thread.run(Thread.java:834){code}
>  
> It looks like the issue is similar to KAFKA-10323 and
> It seems NPE is thrown from [here|https://github.com/apache/kafka/blob/2.7.1/connect/runtime/src/main/java/org/apache/kafka/connect/runtime/Worker.java#L517].
>  
>  



--
This message was sent by Atlassian Jira
(v8.20.1#820001)