You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@slider.apache.org by "Billie Rinaldi (JIRA)" <ji...@apache.org> on 2017/01/19 18:07:26 UTC
[jira] [Updated] (SLIDER-1189) Agent never connects to new AM
[ https://issues.apache.org/jira/browse/SLIDER-1189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Billie Rinaldi updated SLIDER-1189:
-----------------------------------
Attachment: SLIDER-1189.1.patch
> Agent never connects to new AM
> ------------------------------
>
> Key: SLIDER-1189
> URL: https://issues.apache.org/jira/browse/SLIDER-1189
> Project: Slider
> Issue Type: Bug
> Components: agent
> Reporter: Billie Rinaldi
> Assignee: Billie Rinaldi
> Fix For: Slider 1.0.0
>
> Attachments: SLIDER-1189.1.patch
>
>
> In testing RM and AM failure scenarios, I killed my RM, killed the AM, waited for a bit, then restarted the RM. The AM is restarted, but running agents never connect to the new AM. The AM data is re-read from the ZK registry once if the heartbeat retry threshold is reached, at which point the agent tries re-registering with the AM. However, if the AM data is stale at that point, it never re-reads the data from the ZK registry, and retries registering with the nonexistent AM forever (until it is timed out due to heartbeat loss and killed by the new AM).
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)