You are viewing a plain text version of this content. The canonical link for it is here.
Posted to yarn-issues@hadoop.apache.org by "Minni Mittal (Jira)" <ji...@apache.org> on 2021/07/08 10:52:00 UTC

[jira] [Updated] (YARN-10841) Fix token reset synchronization by making sure for UAM response token reset is done while in lock.

     [ https://issues.apache.org/jira/browse/YARN-10841?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Minni Mittal updated YARN-10841:
--------------------------------
    Description: 
*2021-06-24T10:11:39,465* [ERROR] [AMRM Heartbeater thread] |impl.AMRMClientAsyncImpl|: Exception on heartbeat
org.apache.hadoop.yarn.exceptions.YarnException: org.apache.hadoop.yarn.exceptions.YarnRuntimeException: amrmToken from UAM cluster-0 should be null here
at org.apache.hadoop.yarn.server.nodemanager.amrmproxy.FederationInterceptor.allocate(FederationInterceptor.java:782)
 

 

*2021-06-24T10:10:12,608* INFO  [616916] FederationInterceptor: Received new UAM amrmToken with keyId 843616604 

Hearbeatcallback sets token to null. But because of synchroniztion issue, it happened after mergeAllocate is called. So, while allocate merge is happening the value should get set to null and should have happened Inside lock

> Fix token reset synchronization by making sure for UAM response token reset is done while in lock. 
> ---------------------------------------------------------------------------------------------------
>
>                 Key: YARN-10841
>                 URL: https://issues.apache.org/jira/browse/YARN-10841
>             Project: Hadoop YARN
>          Issue Type: Bug
>            Reporter: Minni Mittal
>            Assignee: Minni Mittal
>            Priority: Minor
>
> *2021-06-24T10:11:39,465* [ERROR] [AMRM Heartbeater thread] |impl.AMRMClientAsyncImpl|: Exception on heartbeat
> org.apache.hadoop.yarn.exceptions.YarnException: org.apache.hadoop.yarn.exceptions.YarnRuntimeException: amrmToken from UAM cluster-0 should be null here
> at org.apache.hadoop.yarn.server.nodemanager.amrmproxy.FederationInterceptor.allocate(FederationInterceptor.java:782)
>  
>  
> *2021-06-24T10:10:12,608* INFO  [616916] FederationInterceptor: Received new UAM amrmToken with keyId 843616604 
> Hearbeatcallback sets token to null. But because of synchroniztion issue, it happened after mergeAllocate is called. So, while allocate merge is happening the value should get set to null and should have happened Inside lock



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: yarn-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: yarn-issues-help@hadoop.apache.org