You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@samza.apache.org by "Xinyu Liu (JIRA)" <ji...@apache.org> on 2018/01/04 20:37:00 UTC

[jira] [Updated] (SAMZA-1301) Investigate ZKClient exceptions in ZkJobCoordinator.

     [ https://issues.apache.org/jira/browse/SAMZA-1301?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Xinyu Liu updated SAMZA-1301:
-----------------------------
    Fix Version/s:     (was: 0.14.0)
                   0.15.0

> Investigate ZKClient exceptions in ZkJobCoordinator.
> ----------------------------------------------------
>
>                 Key: SAMZA-1301
>                 URL: https://issues.apache.org/jira/browse/SAMZA-1301
>             Project: Samza
>          Issue Type: Bug
>            Reporter: Shanthoosh Venkataraman
>            Priority: Minor
>             Fix For: 0.15.0
>
>
> ProcessorChange event handler in ZkJobCoordinator fails with the following exception. Halts the generation of JobModel in some executions. One possible sequence to trigger this.
>     LocalApplicationRunner runner = new LocalApplicationRunner(config);
>     runner.run(streamApp1);
>     runner.kill(streamApp1);
>     runner = new LocalApplicationRunner(config);
>     runner.run(streamApp1);
> java.lang.IllegalStateException: ZkClient already closed!
> 	at org.I0Itec.zkclient.ZkClient.retryUntilConnected(ZkClient.java:987)
> 	at org.I0Itec.zkclient.ZkClient.getChildren(ZkClient.java:676)
> 	at org.I0Itec.zkclient.ZkClient.getChildren(ZkClient.java:672)
> 	at org.apache.samza.zk.ZkUtils.getSortedActiveProcessorsZnodes(ZkUtils.java:132)
> 	at org.apache.samza.zk.ZkUtils.getSortedActiveProcessorsIDs(ZkUtils.java:160)
> 	at org.apache.samza.zk.ZkJobCoordinator.generateNewJobModel(ZkJobCoordinator.java:192)
> 	at org.apache.samza.zk.ZkJobCoordinator.doOnProcessorChange(ZkJobCoordinator.java:125)
> 	at org.apache.samza.zk.ZkJobCoordinator$LeaderElectorListenerImpl.lambda$onBecomingLeader$18(ZkJobCoordinator.java:239)
> 	at org.apache.samza.zk.ZkJobCoordinator$LeaderElectorListenerImpl$$Lambda$53/731465117.run(Unknown Source)
> 	at org.apache.samza.zk.ScheduleAfterDebounceTime.lambda$scheduleAfterDebounceTime$0(ScheduleAfterDebounceTime.java:89)
> 	at org.apache.samza.zk.ScheduleAfterDebounceTime$$Lambda$54/1552634391.run(Unknown Source)
> 	at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
> 	at java.util.concurrent.FutureTask.run(FutureTask.java:266)
> 	at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$201(ScheduledThreadPoolExecutor.java:180)
> 	at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:293)
> 	at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
> 	at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
> 	at java.lang.Thread.run(Thread.java:745)
> 8056 [debounce-thread-0] ERROR org.apache.samza.zk.ZkJobCoordinator - Received exception from in JobCoordinator Processing!
> java.lang.IllegalStateException: ZkClient already closed!
> 	at org.I0Itec.zkclient.ZkClient.retryUntilConnected(ZkClient.java:987)
> 	at org.I0Itec.zkclient.ZkClient.getChildren(ZkClient.java:676)
> 	at org.I0Itec.zkclient.ZkClient.getChildren(ZkClient.java:672)
> 	at org.apache.samza.zk.ZkUtils.getSortedActiveProcessorsZnodes(ZkUtils.java:132)
> 	at org.apache.samza.zk.ZkUtils.getSortedActiveProcessorsIDs(ZkUtils.java:160)
> 	at org.apache.samza.zk.ZkJobCoordinator.generateNewJobModel(ZkJobCoordinator.java:192)
> 	at org.apache.samza.zk.ZkJobCoordinator.doOnProcessorChange(ZkJobCoordinator.java:125)
> 	at org.apache.samza.zk.ZkJobCoordinator$LeaderElectorListenerImpl.lambda$onBecomingLeader$18(ZkJobCoordinator.java:239)
> 	at org.apache.samza.zk.ZkJobCoordinator$LeaderElectorListenerImpl$$Lambda$53/731465117.run(Unknown Source)
> 	at org.apache.samza.zk.ScheduleAfterDebounceTime.lambda$scheduleAfterDebounceTime$0(ScheduleAfterDebounceTime.java:89)
> 	at org.apache.samza.zk.ScheduleAfterDebounceTime$$Lambda$54/1552634391.run(Unknown Source)
> 	at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
> 	at java.util.concurrent.FutureTask.run(FutureTask.java:266)
> 	at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$201(ScheduledThreadPoolExecutor.java:180)
> 	at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:293)
> 	at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
> 	at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
> 	at java.lang.Thread.run(Thread.java:745)



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)