You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@curator.apache.org by "Sergey Nuyanzin (Jira)" <ji...@apache.org> on 2022/08/16 10:24:00 UTC

[jira] [Updated] (CURATOR-649) Background exception was not retry-able or retry gave up

     [ https://issues.apache.org/jira/browse/CURATOR-649?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Sergey Nuyanzin updated CURATOR-649:
------------------------------------
    Description: 
This is is probably a kind of reopen of CURATOR-538
the case is we have 3 flink-nodes which under the hood using curator.
Then just add one node and remove another one. 
After that some nodes start having NPE.
{noformat}
java.lang.NullPointerException: null
at org.apache.curator.utils.Compatibility.getHostAddress(Compatibility.java:116) 
at org.apache.curator.framework.imps.EnsembleTracker.configToConnectionString(EnsembleTracker.java:185)
at org.apache.curator.framework.imps.EnsembleTracker.processConfigData(EnsembleTracker.java:206) 
at org.apache.curator.framework.imps.EnsembleTracker.access$300(EnsembleTracker.java:50)
at org.apache.curator.framework.imps.EnsembleTracker$2.processResult(EnsembleTracker.java:150)
at org.apache.curator.framework.imps.CuratorFrameworkImpl.sendToBackgroundCallback(CuratorFrameworkImpl.java:926) 
at org.apache.curator.framework.imps.CuratorFrameworkImpl.processBackgroundOperation(CuratorFrameworkImpl.java:683)
at org.apache.curator.framework.imps.WatcherRemovalFacade.processBackgroundOperation(WatcherRemovalFacade.java:152)
at org.apache.curator.framework.imps.GetConfigBuilderImpl$2.processResult(GetConfigBuilderImpl.java:222)
at org.apache.zookeeper.ClientCnxn$EventThread.processEvent(ClientCnxn.java:598)
at org.apache.zookeeper.ClientCnxn$EventThread.run(ClientCnxn.java:510) 
{noformat}
We've got same issue and trace is very close it fails with NPE trying get Address
the only difference is that it is trying to get Address at {{org.apache.curator.utils.Compatibility#getHostAddress}}
at line 
{noformat}
return (address != null) ? address.getAddress().getHostAddress() : "unknown";
{noformat}
We do not use telepresence mentioned in comments

  was:
This is is probably a kind of reopen of CURATOR-538
the case is we have 3 flink-nodes which under the hood using curator.
Then just add one node and remove another one. 
After that some nodes start having NPE.
We've got same issue and trace is very close it fails with NPE trying get Address
the only difference is that it is trying to get Address at {{org.apache.curator.utils.Compatibility#getHostAddress}}
at line 
{noformat}
return (address != null) ? address.getAddress().getHostAddress() : "unknown";
{noformat}
We do not use telepresence mentioned in comments


> Background exception was not retry-able or retry gave up
> --------------------------------------------------------
>
>                 Key: CURATOR-649
>                 URL: https://issues.apache.org/jira/browse/CURATOR-649
>             Project: Apache Curator
>          Issue Type: Bug
>          Components: Client
>    Affects Versions: 5.2.0
>            Reporter: Sergey Nuyanzin
>            Priority: Major
>
> This is is probably a kind of reopen of CURATOR-538
> the case is we have 3 flink-nodes which under the hood using curator.
> Then just add one node and remove another one. 
> After that some nodes start having NPE.
> {noformat}
> java.lang.NullPointerException: null
> at org.apache.curator.utils.Compatibility.getHostAddress(Compatibility.java:116) 
> at org.apache.curator.framework.imps.EnsembleTracker.configToConnectionString(EnsembleTracker.java:185)
> at org.apache.curator.framework.imps.EnsembleTracker.processConfigData(EnsembleTracker.java:206) 
> at org.apache.curator.framework.imps.EnsembleTracker.access$300(EnsembleTracker.java:50)
> at org.apache.curator.framework.imps.EnsembleTracker$2.processResult(EnsembleTracker.java:150)
> at org.apache.curator.framework.imps.CuratorFrameworkImpl.sendToBackgroundCallback(CuratorFrameworkImpl.java:926) 
> at org.apache.curator.framework.imps.CuratorFrameworkImpl.processBackgroundOperation(CuratorFrameworkImpl.java:683)
> at org.apache.curator.framework.imps.WatcherRemovalFacade.processBackgroundOperation(WatcherRemovalFacade.java:152)
> at org.apache.curator.framework.imps.GetConfigBuilderImpl$2.processResult(GetConfigBuilderImpl.java:222)
> at org.apache.zookeeper.ClientCnxn$EventThread.processEvent(ClientCnxn.java:598)
> at org.apache.zookeeper.ClientCnxn$EventThread.run(ClientCnxn.java:510) 
> {noformat}
> We've got same issue and trace is very close it fails with NPE trying get Address
> the only difference is that it is trying to get Address at {{org.apache.curator.utils.Compatibility#getHostAddress}}
> at line 
> {noformat}
> return (address != null) ? address.getAddress().getHostAddress() : "unknown";
> {noformat}
> We do not use telepresence mentioned in comments



--
This message was sent by Atlassian Jira
(v8.20.10#820010)