You are viewing a plain text version of this content. The canonical link for it is here.
Posted to yarn-dev@hadoop.apache.org by "Botong Huang (JIRA)" <ji...@apache.org> on 2017/08/04 21:26:00 UTC

[jira] [Created] (YARN-6955) Concurrent registerAM thread in Federation Interceptor

Botong Huang created YARN-6955:
----------------------------------

             Summary: Concurrent registerAM thread in Federation Interceptor
                 Key: YARN-6955
                 URL: https://issues.apache.org/jira/browse/YARN-6955
             Project: Hadoop YARN
          Issue Type: Bug
            Reporter: Botong Huang
            Assignee: Botong Huang
            Priority: Minor


 The timeout between AM and AMRMProxy is shorter than the timeout + failOver between FederationInterceptor (AMRMProxy) and RM. When the first register thread in FI is blocked because of an RM failover, AM can timeout and resend register call, leading to two outstanding register call inside FI. 

Eventually when RM comes back up, one thread succeeds register and the other thread got an application already registered exception. FI should swallow the exception and return success back to AM in both threads. 



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: yarn-dev-unsubscribe@hadoop.apache.org
For additional commands, e-mail: yarn-dev-help@hadoop.apache.org